[go: up one dir, main page]

Academia.eduAcademia.edu
An empirical analysis of poverty, inequality and the labour market in Malawi by Anderson Sawira Gondwe Dissertation presented for the degree of Doctor of Philosophy (Economics) in the Faculty of Economic and Management Sciences at Stellenbosch University Supervisor: Prof. Servaas van der Berg December 2016 Stellenbosch University https://scholar.sun.ac.za Declaration By submitting this dissertation electronically, I declare that the entirety of the work contained therein is my own, original work, that I am the sole author thereof (save to the extent explicitly otherwise stated), that reproduction and publication thereof by Stellenbosch University will not infringe any third party rights and that I have not previously in its entirety or in part submitted it for obtaining any qualification. December 2016 _____________________________ (Anderson Gondwe) Copyright © 2016 Stellenbosch University All rights reserved i Stellenbosch University https://scholar.sun.ac.za Abstract This thesis is a consolidation of three related studies on Malawi. The first study contains spatial and temporal comparisons of poverty and inequality in Malawi using two non-monetary dimensions, namely an asset index and child nutritional status. Through stochastic dominance tests, the study establishes that poverty and inequality are unambiguously higher in rural areas, which contain 85% of the population, in the Southern region and among households headed by females. Results indicate that poverty has significantly declined over time and that the gains from growth have been pro-poor. We show that welfare does not vary much across regions and areas with respect to child nutritional status but there are large differences in asset poverty. Stunting is a bigger problem among children under the age of five than body wasting and being underweight. Econometric analysis shows that asset ownership is positively associated with household size, the age of household head and education attainment. Age dependency ratio and incidence of sickness are negatively associated with asset ownership. Multivariate analysis of child nutrition reveals that malnutrition first worsens before improving at some critical age. This is consistent with possible recovery found in some of the studies that track children over time. Also in accordance with some literature, we find that boys have weaker nutritional status than girls. The second study looks at the role of education in poverty reduction identified through the labour market. This study contributes to research on returns to education by including self-employment activities and non-farm business enterprises. Unlike previous studies, this study uses panel data which has many advantages, as acknowledged in the literature. We find large and positive returns to education in Malawi suggesting that education is a good investment. The returns increase with the levels of education. Interestingly, females have higher returns to education than males with similar skills. Since the Malawian labour market is not homogeneous, our analysis distinguishes between the formal and informal employment sectors. Furthermore, studying Malawi’s informal sector is important as it accounts for 78% of total employment. Our results show that education externalities exist and play an important role in non-farm enterprises. The findings are robust to sample selection and treatment of outliers. We further show that dealing with inconsistencies in the data helps improves the quality and reliability of the results. The third study applies spatial panel data econometric techniques to the study of migration and employment in Malawi. The study shows that the magnitudes of coefficients drop after taking into account spatial dependencies. This confirms that studies that fail to take into account the spatial effects tend to overstate the results. By matching geographical codes that are consistent over time, it is now feasible to integrate census data with other data for similar spatial analysis. The study further evaluates the impact of land reform policy on spatial migration and employment using a difference-in-difference ii Stellenbosch University https://scholar.sun.ac.za estimation strategy. Results show that the policy has had significant effects on migration and employment patterns in Malawi. iii Stellenbosch University https://scholar.sun.ac.za Opsomming Hierdie tesis is a konsolidasie van drie verwante studies oor Malawi. Die eerste studie ondersoek armoede en ongelykheid in Malawi oor tyd en ruimte heen deur twee nie-monetêre dimensies, naamlik 'n bate-indeks en die voedingstatus van kinders, te gebruik. Deur middel van stogastiese dominansietoetse word ondubbelsinnig getoon dat armoede en ongelykheid hoër is in landelike gebiede, wat 85% van die bevolking huisves, in die Suidelike streek en onder huishoudings met vroue as hoof van die huis. Resultate toon dat armoede beduidend afgeneem het en dat groei tot voordeel van die armes strek. Ons resultate toon weinig verskille in welsyn tussen streke en gebiede met betrekking tot die voeding status van kinders, maar groot verskille in bate-armoede . Vertraagde groei is 'n groter probleem by kinders onder die ouderdom van vyf jaar as kwyning en ondergewig. Ekonometriese ontleding toon dat batebesit positief verband hou met die grootte van die huishouding en die ouderdom en opvoedingsvlak van die hoof van die huishouding . Die ouderdom-afhanklikheidslas en die voorkoms van siekte hou negatief verband met bate-besit. Regressie-analise wys dat wanvoeding onder kinders eers met ouderdom toeneem voordat dit by hoër ouderdomme afneem, wat konsekwent is met die moontlikheid van herstel soos party studies wat kinders oor 'n tydperk volg bevind. Ook, in ooreenstemming met party studies, word bevind dat die voedingstatus van dogters beter is as dié van seuns. Die tweede studie bestudeer die rol van onderwys in die vermindering van armoede in die arbeidsmark. Deur die insluiting van selfwerksaamheidsaktiwiteite en nie-landbou sakeondernemings dra die studie by tot navorsing oor die voordele van opvoeding in Malawi. Anders as in vorige studies, gebruik hierdie studie paneeldata, wat baie voordele inhou, soos in die literatuur bevestig. Ons vind groot en positiewe opbrengste op onderwys, wat daarop dui dat dit 'n goeie belegging is. Opbrengste neem toe met vlakke van onderwys. Interessant genoeg, ervaar vroue hoër opbrengste op belegging in onderwys as mans met dieselfde vaardighede. Aangesien die arbeidsmark in Malawi nie homogeen is nie, tref ons analise ‘n onderskeid tussen die formele en informele indiensnemingsektore. Dit belangrik om Malawi se informele sektor in ag te neem, aangesien dit 78% van die totale indiensneming uitmaak. Ons resultate wys dat daar eksternaliteite van onderwys bestaan wat 'n belangrike rol speel in nie-landbou ondernemings. Ons resultate is robuust virsteekproefseleksie en die hantering van uitskieters. Die uitstryk van data-onreëlmatighede dra tot 'n verbetering in die kwaliteit en betroubaarheid van die resultate by. Die derde studie pas ruimtelike paneeldata ekonometriese tegnieke toe op migrasie en indiensneming in Malawi. Die grootte van koëffisiënte neem af as ruimtelike afhanklikhede in ag geneem word. Dit bevestig dat studies wat nalaat om ruimtelike aspekte in berekening te bring geneig is om effekte te oorskat. Deur konsekwente geografiese kodes oor tyd te verbind is dit nou moontlik om sensusdata met ander data te integreer vir verdere ruimtelike analise. Die studie evalueer ook die uitwerking van die iv Stellenbosch University https://scholar.sun.ac.za grondhervormingsbeleid op ruimtelike migrasie en indiensneming deur die gebruik van 'n verskil-inverskille metodeevalueer. Die resultate dui daarop dat hierdie beleid 'n beduidende uitwerking op migrasie en werkloosheid in Malawi het. v Stellenbosch University https://scholar.sun.ac.za Acknowledgements Firstly, I would like to thank Professor Servaas van der Berg from whom I have received tremendous feedback and guidance throughout my PhD study. I will forever remember him for his positive contribution to my life and for opening my mind to the big picture of research. I know Servaas as a very humble, patient and wise man with outstanding fatherly care. Considering my non-academic background, the fact that I am completing my studies within three years speaks volumes about how Servaas is able to identify and develop talent in people. Secondly, I acknowledge the contribution from Research on Social Economic Policy (RESEP) which is led by Servaas. RESEP has been important to my progress and timely completion of the studies. Apart from the networking, the organisation has been the source of the much needed additional funding for my studies and academic conferences. The RESEP team also boasts an excellent pool of researchers from which I have greatly benefited. It has provided a platform for the transfer of new skills and refining my work through the weekly departmental seminars and training workshops. I particularly thank Dr Dieter von Fintel for his insights that have also helped shape my study. Thirdly, I also say my heart-felt thanks to The Stellenbosch Institute for Advanced Study (STIAS) for the scholarship funding provided through The Graduate School of Economic and Management Sciences (GEMS). The GEMS cohort system, which recruits PhD researchers from different parts of the world, has enabled me to develop important networks with colleagues from three different continents, namely Africa, Asia and Europe. In this regard, special mention goes to Dr Jaco Franken, the manager of the graduate school programme and fellow PhD students in the programme. Lastly, I thank my family and friends who have provided encouragement to me during my physical absence from them. I have made it to the glory of the LORD and Jesus Christ. vi Stellenbosch University https://scholar.sun.ac.za Dedication I dedicate this work to my family and particularly Mary Gondwe. My wish is that there may never cease to be people who attain PhD education throughout our family generations. vii Stellenbosch University https://scholar.sun.ac.za Table of Contents Declaration ............................................................................................................................................... i Abstract ................................................................................................................................................... ii Opsomming ............................................................................................................................................ iv Acknowledgements ................................................................................................................................ vi Dedication ............................................................................................................................................. vii Table of Contents ................................................................................................................................. viii List of Figures ........................................................................................................................................ xi List of Tables ......................................................................................................................................... xii Chapter 1 ................................................................................................................................................. 1 1.1 Introduction ............................................................................................................................. 1 1.2 Geography and history ............................................................................................................ 1 1.3 Economy.................................................................................................................................. 2 1.4 Problem statement ................................................................................................................... 5 1.5 National data sources............................................................................................................... 6 1.6 Thesis structure........................................................................................................................ 7 Chapter 2 ............................................................................................................................................... 10 2.1 Introduction ........................................................................................................................... 10 2.2 Theoretical considerations in poverty measurement ............................................................. 12 2.3 Inequality measurement ........................................................................................................ 14 2.4 Stochastic dominance analysis .............................................................................................. 15 2.5 Poverty and inequality decomposition .................................................................................. 17 2.6 Pro-poor growth analysis....................................................................................................... 17 2.7 Data ....................................................................................................................................... 20 2.8 Poverty lines .......................................................................................................................... 27 2.9 Cumulative density curves .................................................................................................... 27 2.10 FGT poverty estimates .......................................................................................................... 28 2.11 Poverty dominance analysis .................................................................................................. 30 2.12 Gini and GE inequality estimates .......................................................................................... 31 2.13 Inequality dominance analysis .............................................................................................. 33 2.14 Poverty decomposition .......................................................................................................... 34 2.15 Subgroup inequality decomposition ...................................................................................... 34 2.16 Spatial distribution of poverty and inequality ....................................................................... 37 2.17 Factors affecting asset poverty .............................................................................................. 40 viii Stellenbosch University https://scholar.sun.ac.za 2.18 Child nutritional status in Malawi ......................................................................................... 42 2.19 Multivariate analysis of child nutrition ................................................................................. 44 2.20 Asset index and pro-poor growth analysis ............................................................................ 47 2.21 Pro-poor growth in child nutritional status............................................................................ 52 2.22 Conclusions ........................................................................................................................... 55 2.23 Policy discussion ................................................................................................................... 55 Chapter 3 ............................................................................................................................................... 57 3.1 Introduction ........................................................................................................................... 57 3.2 Methodology ......................................................................................................................... 59 3.2.1 Theoretical framework .................................................................................................. 59 3.2.2 Estimating returns to education ..................................................................................... 62 3.2.3 Sample selection ............................................................................................................ 63 3.2.4 Modelling unobserved heterogeneity ............................................................................ 64 3.3 Description of the data .......................................................................................................... 65 3.3.1 Work and non-work activities of the employed ............................................................ 66 3.3.2 Describing employment structure and hours worked .................................................... 67 3.3.3 Treatment of outliers, missing data and zero earnings .................................................. 69 3.3.4 Dealing with inconsistencies ......................................................................................... 71 3.4 Labour force participation ..................................................................................................... 71 3.4.1 Size of labour force and labour force participation rates............................................... 72 3.4.2 Changes in the labour force according to background characteristics .......................... 73 3.4.3 Shares in the labour force .............................................................................................. 75 3.4.4 Multivariate analysis of labour force participation........................................................ 76 3.5 Unemployment ...................................................................................................................... 78 3.6 Employment trends and characteristics ................................................................................. 79 3.6.1 Employment shares and growth rates ............................................................................ 79 3.6.2 Multivariate analysis of employment likelihood ........................................................... 80 3.7 Earnings and changes in employment status ......................................................................... 81 3.7.1 Employed in either wave ............................................................................................... 82 3.7.2 Employed in both waves ............................................................................................... 82 3.7.3 Identifying sources of increases in earnings .................................................................. 83 3.8 Econometric analysis of returns to education........................................................................ 85 3.8.1 Wage employment ......................................................................................................... 85 3.8.2 Household non-farm enterprise earnings ....................................................................... 97 3.9 Measurement error using panel data...................................................................................... 99 3.10 Comparing income and consumption .................................................................................. 100 ix Stellenbosch University https://scholar.sun.ac.za 3.11 Dynamics of household percapita consumption .................................................................. 102 3.12 Conclusion and policy implications .................................................................................... 104 Chapter 4 ............................................................................................................................................. 106 4.1 Introduction ......................................................................................................................... 106 4.2 Data and methods ................................................................................................................ 107 4.2.1 Data ............................................................................................................................. 107 4.2.2 Theoretical framework ................................................................................................ 108 4.2.3 Spatial autocorrelation ................................................................................................. 109 4.2.4 Spatial panel data models ............................................................................................ 111 4.3 Descriptive analysis ............................................................................................................. 114 4.3.1 Spatial distribution of employment, education, assets and fertility ............................. 114 4.3.2 Fertility trends ............................................................................................................. 116 4.3.3 Changes in population age structure and labour supply .............................................. 118 4.3.4 Spatial and temporal patterns of migration.................................................................. 122 4.3.5 Spatial autocorrelation in variables ............................................................................. 125 4.4 Spatial panel regression results ........................................................................................... 127 4.4.1 Effects of land reform policy on migration ................................................................. 128 4.4.2 Effects of land reform policy on employment ............................................................. 130 4.5 Conclusions ......................................................................................................................... 133 Chapter 5 ............................................................................................................................................. 135 5.1 Introduction ......................................................................................................................... 135 5.2 Summary of findings ........................................................................................................... 135 5.3 Conclusions ......................................................................................................................... 138 5.4 Implications of the research ................................................................................................ 139 5.5 Summary of contributions ................................................................................................... 143 5.6 Future research .................................................................................................................... 144 List of references ................................................................................................................................. 146 Appendices .......................................................................................................................................... 154 Appendix A1: Coefficient comparison test ................................................................................. 154 Appendix A2: Dealing with data inconsistencies ........................................................................ 154 x Stellenbosch University https://scholar.sun.ac.za List of Figures Figure 1.1: Political map of Malawi ........................................................................................................ 2 Figure 1.2: Malawi’s GDP per capita annual growth rates (1961-2015) ................................................ 5 Figure 2.1: Adjusted and unadjusted asset indices by population subgroups........................................ 24 Figure 2.2: Distribution of anthropometric Z-scores for HAZ, WAZ and WHZ .................................. 25 Figure 2.3: MCA asset index cumulative density curves by population groups ................................... 28 Figure 2.4: MCA asset index Lorenz curves by population subgroups................................................. 31 Figure 2.5: Spatial distribution of poverty using asset index and child-nutritional status .................... 37 Figure 2.6: Spatial distribution of inequality for asset index and child-nutritional status ..................... 39 Figure 2.7: Asset poverty incidence curves by survey year .................................................................. 48 Figure 2.8: Access to assets by type and survey period ........................................................................ 52 Figure 2.9: Poverty incidence curves for HAZ by DHS survey year .................................................... 53 Figure 3.1: Histogram of real monthly earnings by survey year and occupation .................................. 84 Figure 3.2: Rates of return on education by gender .............................................................................. 90 Figure 3.3: Kernel densities for annual household percapita consumption and income by year ........ 101 Figure 4.1: Map of Malawi showing administrative boundaries ......................................................... 108 Figure 4.2: Spatial distribution of status in employment for 2008 ...................................................... 115 Figure 4.3: Spatial distribution of employment type for 2008 ............................................................ 115 Figure 4.4: Spatial distribution of educational attainment for 2008 .................................................... 116 Figure 4.5: Spatial distribution of asset index, schooling and fertility for 2008 ................................. 116 Figure 4.6 : Pyramids showing populations and labour force participation by sex (1987) ................. 120 Figure 4.7: Pyramids showing populations and labour force participation by sex (1998) .................. 120 Figure 4.8: Pyramids showing populations and labour force participation by sex (2008) .................. 121 Figure 4.9: Proportions that moved between districts ......................................................................... 124 Figure 4.10: Spatial autocorrelation for employment .......................................................................... 126 Figure 4.11: Spatial autocorrelation for migration .............................................................................. 126 Figure 4.12: Measures of local spatial autocorrelation for employment and migration ...................... 127 Figure A 1: FISP evaluation panel data on real ganyu wages ............................................................. 155 xi Stellenbosch University https://scholar.sun.ac.za List of Tables Table 1.1: Malawi’s HDI trends based on consistent methodology and data ......................................... 4 Table 2.1: Summary of Malawi DHS data sets used ............................................................................. 21 Table 2.2: Descriptive statistics for the poverty and asset indices ........................................................ 23 Table 2.3: Child malnutrition rates by population groups ..................................................................... 26 Table 2.4: Poverty headcount, average poverty gap and poverty severity estimates ............................ 29 Table 2.5: Poverty stochastic dominance test results for population subgroups ................................... 30 Table 2.6: Inequality estimates across population subgroups ............................................................... 32 Table 2.7: Generalised Lorenz dominance test results across population subgroups............................ 33 Table 2.8: FGT poverty sub-group decomposition ............................................................................... 35 Table 2.9: GE inequality decomposition for asset index and child nutrition ........................................ 36 Table 2.10: Summary descriptive statistics for the asset model ............................................................ 40 Table 2.11: OLS regression results for asset poverty ............................................................................ 42 Table 2.12: Descriptive statistics for the nutritional models ................................................................. 43 Table 2.13: OLS regression results for child nutritional status ............................................................. 45 Table 2.14: Descriptive statistics for asset index scores ....................................................................... 47 Table 2.15: Differences in poverty headcount indices for household asset ownership ......................... 49 Table 2.16: Indices of pro-poorness in child nutritional status between 1992 and 2010 ...................... 49 Table 2.17: Pooled mean access of assets by area and region, all periods ............................................ 51 Table 2.18: Descriptive statistics for HAZ ............................................................................................ 52 Table 2.19: Differences in the FGT poverty headcount index for HAZ ............................................... 54 Table 2.20: Indices of pro-poorness for HAZ ....................................................................................... 54 Table 3.1: Employment and occupation structures ............................................................................... 67 Table 3.2: Changes in the average weekly hours worked and years of education by year ................... 69 Table 3.3: Labour force participation rates according to characteristics ............................................... 73 Table 3.4: Changes in the labour force according to characteristics ..................................................... 74 Table 3.5: Shares of working-age population and labour force according to characteristics ................ 75 Table 3.6: Probit regressions on labour force participation................................................................... 77 Table 3.7: Unemployment shares and rates by year .............................................................................. 78 Table 3.8: Broad and narrow employment shares and growth by year ................................................. 80 Table 3.9: Two-step Heckman probit results on employment likelihood ............................................. 81 Table 3.10: Mean monthly total wages by employment status and survey period ................................ 82 Table 3.11: Mean monthly ganyu wages by employment status and survey period ............................. 83 Table 3.12: Employment status and education attainment of individuals ............................................. 83 Table 3.13: OLS results for log of real monthly wages ........................................................................ 86 Table 3.14: OLS, Fixed effects and random effects results for log of real monthly wages .................. 87 Table 3.15: OLS and random effects results for log of monthly wages using education categories ..... 89 Table 3.16: Regressions of log monthly wages by gender and employees in both waves .................... 91 Table 3.17: Random effect results based on alternative treatment of outliers....................................... 92 Table 3.18: Wage functions corrected for sample selection.................................................................. 93 Table 3.19: Probit on choice of employment sector .............................................................................. 94 Table 3.20: Regression results on log of monthly wages by sector without sample selection .............. 95 Table 3.21: Regression results on log of monthly wages by sector with sample selection ................... 96 Table 3.22: Regressions for monthly household non-farm enterprise earnings .................................... 98 Table 3.23: Average household percapita consumption by components and year .............................. 102 xii Stellenbosch University https://scholar.sun.ac.za Table 3.24: Consumption transition matrix ......................................................................................... 103 Table 4.1: Summary of spatial panel model options ........................................................................... 114 Table 4.2: Age-specific fertility rates (ASFR) and total fertility rates (TRF) ..................................... 117 Table 4.3: Changes in economically active population and labour force by sex (1987-2008)............ 119 Table 4.4: Proportions of population and labour force by age group .................................................. 122 Table 4.5: Inter-regional migration transition frequencies by gender ................................................. 123 Table 4.6: Inter-regional migration transition probabilities by gender ............................................... 123 Table 4.7: Results showing spatial dependencies in variables (pooled 1987, 1998 and 2008) ........... 125 Table 4.8: Effects of land reform policy on migration ........................................................................ 129 Table 4.9: Balancing tests for base period........................................................................................... 130 Table 4.10: Effects of land reform policy on agricultural employment .............................................. 131 Table 4.11: Effects of land reform policy on government employment ............................................. 132 Table A 1: Table showing the distribution of households by year and population subgroups ............ 156 Table A 2: Table showing the distribution of children by year and population subgroups................. 156 Table A 3: WHZ regression results by age category ........................................................................... 157 Table A 4: List of regions, districts and traditional areas consistent over time................................... 158 Table A 5: Results showing spatial dependencies in variables for 1987............................................. 163 Table A 6: Results showing spatial dependencies in variables for 1998............................................. 163 Table A 7: Results showing spatial dependencies in variables for 2008............................................. 163 Table A 8: Population figures for individuals aged (15 years and older)............................................ 164 Table A 9: Number of people who migrated to other districts (15 years and older) ........................... 165 Table A 10: Proportions of people who migrated to other districts (15 years and older) ................... 166 Table A 11: Effects of land reform policy on migration ..................................................................... 167 Table A 12: Effects of land reform policy on agricultural employment ............................................. 168 Table A 13: Effects of land reform policy on government employment ............................................. 169 Table A 14: Effects of land reform policy on self, wage and private employment (OLS only) ......... 170 xiii Stellenbosch University https://scholar.sun.ac.za Chapter 1 Introduction 1.1 Introduction This Chapter provides some background information on Malawi to help the reader contextualise the study before turning to the topics addressed in this thesis. In Section 1.2, we look at Malawi’s geography and history. Section 1.3 discusses the economy. The problem statement is discussed in Section 1.4. The background information on the national data sources is given in Section 1.5. Finally, Section 1.6 gives the thesis structure. 1.2 Geography and history Malawi is a landlocked country located in Southern Africa. It is bordered by Zambia to the north-west, Tanzania to the northeast and Mozambique to the west, south and east as shown in Figure 1.1. The country is a long strip of about 901 km and ranges between 80 and 161 km in width1. The total surface area is about 118,484 km2 of which 94,276 km2 is made up of land. The remaining area is largely made up of Lake Malawi, Africa’s third-largest fresh-water lake, about 475 km long (National Statistical Office & ICF Macro, 2011). Administratively, Malawi is divided into three regions, namely the Northern, Central and Southern regions. Regions are also divided into districts and there are 28 districts in total: six, nine and 13 districts in the North, Centre and South, respectively. Each of the 28 districts is further subdivided into traditional authorities (TAs), ruled by senior chiefs. The smallest units of administration are villages typically governed by village headmen or women. Only a small proportion (about 15%) of Malawi’s population resides in urban areas (National Statistical Office & ICF Macro, 2011). The geographical division of Malawi into regions, districts and traditional authorities provides an important dimension for decomposition analysis as we will see later. Malawi became a protectorate of Britain in 1891. From 1953 to 1963, Malawi (formerly called Nyasaland) was part of the Federation of Rhodesia and Nyasaland together with Zambia (formerly Northern Rhodesia) and Zimbabwe (formerly Southern Rhodesia). Malawi became independent in 1964 and gained the status of a republic in 1966 (National Statistical Office & ICF Macro, 2011). Although politically independent for 52 years, the country is still highly dependent on foreign aid; the United Kingdom, the European Commission, the Global Fund and the World Bank continue to make up the four largest donors (Organisation for Economic Co-operation and Development, 2008). Multiparty democracy was abolished in 1966 but later reintroduced in 1993 after a national referendum. During the 1 In Chapter 4 we compute distance matrices for spatial analysis. 1 Stellenbosch University https://scholar.sun.ac.za first year of a multiparty democracy in 1994, primary education in government schools became free and this resulted in a large increase in school enrolment (Kadzamira & Rose, 2015). Figure 1.1: Political map of Malawi Source: Own construction from country shapefiles 1.3 Economy The Malawian economy is largely dependent on agriculture, which made up about 30% of the gross domestic product (GDP) in 2015 and continues to directly benefit more than 75% of the population. The agricultural sector in Malawi consists of the smallholder sector mainly for subsistence production and the estate sector for exportation. The main food crops are maize, rice and cassava. In 2014, tobacco generated about 64% of foreign exchange. Other important export crops in Malawi are tea and sugar, accounting for about 9% and 8% of total export value, respectively (Reserve Bank of Malawi, 2015). However, more recently, tobacco, which is the main cash crop for export, has come under threat because of the world-wide anti-smoking campaign. 2 Stellenbosch University https://scholar.sun.ac.za According to Food and Agriculture Organisation (FAO) AQUASTAT (2015), less than 1% of cultivated areas is under irrigation2. Since much of Malawi’s agriculture is dependent on rains, food security and household incomes are threatened by flooding and droughts. Irrigation development, if combined with advancements in good cropping systems and inputs, has the potential of improving farm incomes and food security for the majority of the population who are involved in subsistence agriculture. The Green Belt Irrigation is one of the key priority areas identified in the national development agenda with the aim of facilitating growth and development in the coming years by taking advantage of the country’s abundant water resources. Potentially, 1 million hectares can be irrigated in Malawi. The current project is expected to expand the amount of land under irrigation from 90,000 to 400,000 hectares (Government of Malawi, 2007, 2012). The development policy agenda for Malawi is focussed on poverty reduction and is summarised in the Malawi Growth and Development Strategy (MGDS), a five-year strategy. The first programme (MGDS I) was launched in July 2007 and ran through to 2011. The second programme, the Malawi Growth and Development Strategy II (MGDS II), was being implemented from 2011 and expired on 30 June 2016. Some of the elements of MGDSII have been rolled over by the government. MGDS I and MGDS II share the same themes but the latter has six themes instead of five. The themes or thematic areas are Sustainable Economic Growth, Social Development, Social Support and Disaster Risk Management, Infrastructure Development, Improved Governance and Cross-Cutting Issues. MGDS II includes a cross-cutting theme that deals with gender imbalances, capacity development, and research and development, HIV and AIDS, nutrition, environment, climate change, population and science and technology (Government of Malawi, 2007, 2012). Malawi’s national strategies are developed by the Government of Malawi in consultation with key stakeholders, particularly the International Monetary Fund (IMF) and the World Bank, from whom the country continues to receive technical support. In the 1980s and 1990s, Malawi embarked on economic and structural reforms under the Structural Adjustment Policies (SAPS) of the World Bank and IMF. The country was also granted debt relief under the 1996 Heavily Indebted Poor Countries (HIPC) initiative which the IMF and World Bank implemented for a number of developing countries. Despite these reforms and other targeted interventions, Malawi is still one of the least developed countries in the world and its economy is still undiversified (Government of Malawi, 2012; Organisation for Economic Co-operation and Development, 2008). 2 Available at http://www.fao.org/nr/water/aquastat/main/index.stm. 3 Stellenbosch University https://scholar.sun.ac.za According to the Human Development Report (2015), the Human Development Index (HDI) value for Malawi in 2014 was 0.445 and this puts the country on a ranking of 173 out of 188 countries and regions recognised by the United Nations (UN). Table1.1 provides a review of Malawi’s progress between 1980 and 2014 in each of the three indicators that make up the HDI. The three dimensions of the HDI are a long and healthy life (measured by life expectancy), access to knowledge (measured by expected and mean years of schooling) and a decent standard of living (measured by Gross National Income (GNI))3. Table 1.1: Malawi’s HDI trends based on consistent methodology and data Year 1980 1985 1990 1995 2000 2005 2010 2011 2012 2013 2014 Life expectancy at birth 44.8 45.1 43.8 43.5 44.1 48.3 56.9 58.6 60.1 61.5 62.8 Expected years of Mean years of schooling schooling 4.8 1.8 4.6 2.1 5.3 2.5 11.1 2.7 10.3 3.1 9.6 3.4 10.6 4.3 10.8 4.3 10.8 4.3 10.8 4.3 10.8 4.3 GNI per capita (2011 PPP$) 705 643 612 556 613 601 722 732 717 726 747 HDI Value 0.278 0.278 0.284 0.334 0.340 0.355 0.420 0.429 0.433 0.439 0.445 Source: Human Development Report (2015) The table shows that from 1980 to 2014, life expectancy improved by 18 years, expected years of schooling by 6 years, mean years of schooling by 2.5 years and GNI per capita by about 6%. Comparing HDI changes over time using previously published reports would be erroneous due to the revisions and updates that take place from time to time. However, the comparisons provided in the table are based on consistent indicators and methodology developed by the United Nations Development Programme (UNDP) for the purpose of analysis over time. Therefore, the figures in the tables show real changes in values and Malawi’s actual progress over time (Human Development, 2015). In Figure 1.2 we show that the annual GDP per capita growth for Malawi between 1961 and 2015 has been both volatile and dismal in some of the years. The average number of years of education is among individuals aged 25 years and older while the expected years of schooling are the total number of years of schooling a child of school-entry age is expected to receive assuming that the prevailing patterns of age-specific enrolment rates stay the same during the child's life. 3 4 Stellenbosch University https://scholar.sun.ac.za Figure 1.2: Malawi’s GDP per capita annual growth rates (1961-2015) Source: Own computation from World Bank country data4 Population growth, alongside nutrition, is another critical area identified under the MGDS II’s theme of Social Development. Due to high levels of fertility rates, averaging 5.05 children per woman in 2008, Malawi’s population is rising rapidly. During the last population and housing census in 2008, the population was estimated at 13 million and projected at 17 million in 2015 (National Statistical Office, 2008). Malawi is now ranked at number 62 out of 196 countries by population with 149 people per km2. This expanding population has implications for the labour market as well as the economy’s ability to support jobs. Furthermore, in the MGDS II, labour and employment are identified as some of the subthemes for sustainable economic development. Within this framework, the focus is placed on the creation of employment with a view to poverty reduction, incorporating gender specific issues in all labour initiatives and interventions, reducing practices of discrimination in the labour market and improved statistics on the labour market (Government of Malawi, 2012). 1.4 Problem statement Our study is placed within the context of the critical issues facing Malawi as outlined in the national development plan, namely the MGDS. The government identifies malnutrition as a crisis particularly for the rural areas where most of the children suffer from high levels of stunting, wasting and underweight. Agriculture naturally comes on the national development agenda because it is the major source of employment for the Malawian population, as earlier stated. It is for this reason that the government continues to invest in agriculture with the aim of improving food security which has implications for nutritional status for the population. Land is identified as one of the critical resources for agricultural development and the Government of Malawi recently embarked on the land reform policy with the aim of improving production and incomes for the rural population which makes up 85% of the population (World Bank, 2012). A related project is the Integrated Rural Development (IRD) 4 Available at http://www.worldbank.org/en/country/malawi. 5 Stellenbosch University https://scholar.sun.ac.za programme which seeks to promote economic development in rural areas through the establishment of rural growth centres and provision of social services. Other critical areas in the national development plan include labour and employment, addressing gender imbalances and improving education (Government of Malawi, 2012). It is against this background that our study focuses on poverty, child-nutritional status, education, employment and migration. The overarching theme in the thesis is poverty reduction and economic development. This study’s contribution is towards an improved empirical understanding of the aforementioned economic phenomena. It, therefore, can inform and have implications for policy and development strategy. 1.5 National data sources This study makes use of multiple data sources, all of which complement each other in the understanding of the issues discussed in the thesis. The main agency for data collection in Malawi is the National Statistical Office (NSO), responsible inter alia for conducting nationally representative surveys in the country. Malawi has an Integrated Household Survey programme consisting of national censuses, the Integrated Household Surveys (IHS) and Demographic Health Surveys (DHS). We identify these data sets as the most suitable for this thesis. Conducted every ten years since 1966, the national censuses provide a unique source of data for understanding long-term patterns of economic phenomena in Malawi. Furthermore, they consist of information at small geographical areas which is important for understanding issues and planning at low levels of administration. Information collected in the censuses includes literacy, education, migration and employment, among other population characteristics (National Statistical Office, 2008). The IHSs are conducted every five years and collect information on consumption expenditure, education, time use and labour, agriculture, health and child anthropometry. The first IHS was carried out in 1990 and was called the Household Expenditure and Small Scale Economic Activities Survey (HESSEA). Three rounds of integrated household surveys have been conducted after HESSEA, namely IHS1 conducted in 1997/8, IHS2 conducted in 2004/5 and recently IHS3 conducted between March 2010 and March 2011 (National Statistical Office, 2012). In between the surveys are the Welfare Monitoring Survey (WMS) normally conducted every year with the aim of tracking the living conditions of people, identification of the vulnerable population groups and collecting indicators for monitoring the attainment of national goals to which Malawi has committed itself, such as the MGDS and the 6 Stellenbosch University https://scholar.sun.ac.za Millennium Development Goals (MDGs)5. So far, seven rounds of WMSs have been conducted in Malawi, namely for 2005, 2006, 2007, 2008, 2009, 2011 and 2014 (National Statistical Office, 2015). An alternative data source for the understanding of labour markets issues is the 2013 Malawi Labour Force Survey (MLFS), which was conducted to provide a situational analysis of employment and unemployment in Malawi. The previous labour force survey was conducted in 1983 but was not publicly made available (National Statistical Office, 2012). Although it complements data from the IHSs and WMSs, we do not use data from the labour force survey as it is stand-alone and, therefore, inadequate for comparisons over time. The DHSs are conducted every four years with main emphasis on health and nutrition, which is an area of focus of study in this thesis. The first Malawi DHS was conducted in 1992 (National Statistical Office & ICF Macro, 2011). Although the integrated household surveys also collect information on assets, health and child-anthropometry, the DHSs provide a better source because of the detailed extent to which these issues are dealt with in the questionnaire. In the context of Malawi, Verduzo-Gallo, Ecker, and Pauw (2014) point out some serious inconsistencies and data quality issues in child-nutrition estimates obtained using IHS data sets compared to anthropometric records from other nationally representative data sets. Specifically, they find that while the 2010 DHS and the 2009 National Micronutrient Survey (NMS) yield national child stunting rates of between 47% and 49%, estimates from the 2010/11 IHS3 suggest a prevalence rate of only about 30%. Similarly, estimates of child stunting levels based on the IHS2 data in 2004/05 are about 9 percentage points below the incidence rates based on the 2004/05 DHS and the 2006 Multiple Indicator Cluster Survey (MICS). 1.6 Thesis structure This thesis is a consolidation of three related studies on Malawi. The introductory discussion provided in the previous sections of this Chapter covers the issues related to all the three studies. In order to allow for a detailed analysis, each of the studies forms a separate chapter. Similarly, the theoretical underpinnings of the studies are also discussed separately in each of the main chapters. In Chapter 2, we provide some theoretical considerations on poverty and inequality measurement, stochastic dominance analysis and measurement of pro-poor growth. In Chapter 3, two main competing groups of theories for explaining labour market outcomes are discussed, namely the traditional neoclassical model of labour supply and the segmented labour market hypothesis. Examples of the theories discussed are the human capital theory, Roy’s (1951) two sector model and the Harris-Todaro (1970) model of migration, among others. Furthermore, we discuss the different theories or explanations as to what constitutes or gives rise 5 MDGs have now replaced with Sustainable Development Goals (SDGs) since 2015. 7 Stellenbosch University https://scholar.sun.ac.za to informal labour markets. In Chapter 4, we discuss the theoretical perspectives of migration, broadly grouped under either the disequilibrium or equilibrium perspectives. The specific theories discussed in the chapter are the gravity models of migration, the human capital theory and the ‘spatial job-search models’. Chapter 2 contains spatial and temporal comparisons of multidimensional poverty and inequality in Malawi based on two non-monetary dimensions of welfare, namely an asset index and child nutritional status. Data for this study are drawn from the DHSs. Through this paper, we attempt to present Malawi’s profile of poverty and inequality, including the progress made over time. We also show the extent to which the observed changes over time have been pro-poor. Child nutritional status is identified as a problem requiring attention in the country’s national development agenda (MGDS). The first part of the chapter deals with the derivation of our two dimensions of poverty. The second part conducts robust comparisons of economic welfare and an econometric analysis of underlying possible reasons behind the observed changes. The third and final part analyses pro-poor changes in poverty over time. In Chapter 3, we look at the role of education in poverty reduction in Malawi by using data from the Malawi Integrated Household Panel Survey (IHPS). The linkage between education and poverty reduction is identified through the labour market. The argument made here is twofold, namely that education improves an individual’s chances of getting employment and that education positively impacts on earnings. This partly justifies why governments invest in education, which is acknowledged to have positive externalities on households and communities (e.g. Basu & Foster, 1998). In the context of Malawi, primary education was universally made free in 1994, which is before the MDGs, while university education is either directly subsidised or students are granted study loans. The overall objective of this chapter is to estimate returns to education. Our analysis distinguishes between wage employment and self-employment activities (household enterprises), which make up a large percentage of total employment in Malawi. With respect to self-employment, the returns to education are calculated at the household level using the maximum level of education in the household. Prior to the analysis, we also conduct some consistency checks in the data to ensure data quality and meaningful comparability over time. The theme of employment continues through Chapter 4, where emphasis is now placed on gender related issues and the role of spatial effects in employment. The data used are from the national censuses. Segregation of results by gender is important because women form a large percentage of the labour force in Malawi, with the majority of them engaged in the agricultural sector. Specifically, Chapter 4 applies spatial panel data econometric techniques to the study of migration and employment in Malawi. It is widely recognised in the literature that both geography (space) and time are important to the understanding of economic phenomena. However, only few studies incorporate spatiotemporal analysis 8 Stellenbosch University https://scholar.sun.ac.za and within the context of Malawi, this study is the first-time attempt. First, we first match geographical codes so that they are consistent over time. Once this is done, we analyse long-term patterns of migration and employment in Malawi. In 2004, the Government of Malawi introduced land reform with the aim of increasing the incomes of poor rural families in four Southern region districts of the country. This policy was aimed at poverty reduction through increase of incomes and improvement in food security for the participating families. To be specific, willing individuals purchased agricultural land from willing sellers and resettled in the new areas. Therefore, the second part of this study is dedicated to the analysis of the effects that the land reform policy had on migration and employment. The findings from this study are important because of the future joint plans by the World Bank and Malawi Government to scale up the project to the rest of Malawi. Finally, Chapter 5 concludes the thesis. The chapter provides a discussion on how the thesis addresses the research questions developed in each of the studies. We also look at the significance of the thesis in terms of the contributions made to research, implications of the research, its limitations and suggestions for future study. 9 Stellenbosch University https://scholar.sun.ac.za Chapter 2 Measuring poverty and pro-poor growth in Malawi 2.1 Introduction Poverty and inequality remain big concerns in Malawi, a very poor country. Deprivation exists in a number of dimensions such as education, consumption, child nutritional status and assets. Based on household per capita consumption estimates from the Third Household Integrated Survey (IHS3), about 51% of households in Malawi are poor. In addition, the Gini coefficient shows that inequality has increased over the past five survey years from 0.390 in 2005 to 0.452 in 2011. Estimates based on the 2010 Malawi Demographic Health Survey (MDHS) indicate that the incidence of child stunting in Malawi stands at 47% (National Statistical Office & ICF Macro, 2011). Following the works of Sen (1985, 1987), a number of approaches have been developed to measure multidimensional poverty. A multidimensional view of poverty considers more than one aspect of deprivation. Conventionally and for a long time, poverty has been looked at in terms of either income or consumption. However, this view of poverty has been criticised for ignoring other important dimensions of well-being such as health, education, empowerment and freedom of association. Based on the literature, one can group the existing approaches to the measurement of multidimensional poverty into three alternatives. The first approach aggregates a number of dimensions of poverty such as life expectancy, literacy and Gross Domestic Product (GDP) into a single one-dimensional index. Examples include the Human Development Index (HDI) and the Multidimensional Poverty Index (MPI). Ravallion (2011) raises questions as to whether such single-one dimensional indices (which he refers to as “mashup” indices) are sufficient for poverty measurements as opposed to developing a set of poverty indicators that are relevant within a particular setting. Specifically, Ravallion (2011) criticises the composite indices for not being so useful for sound development policy making because they essentially ‘collapse’ important dimensions into a single index which is difficult to interpret. Another criticism of the multidimensional indices raised by Ravallion (2011) is how weights are applied in the construction of multidimensional indices. Specifically, in as much as it is recognised that poverty is multidimensional, weights need not be determined by the poverty analyst but should rather be consistent with the choices made by the poor people. The second approach considers two or more dimensions of poverty such as income, education, health, etc. but analyses each dimension independently, without taking into account the possible correlations which may exist between the dimensions (e.g., Sahn & Stifel, 2003; Mussa, 2013). The third approach also considers two or more dimensions but unlike the second approach takes into account the potential interrelationships among dimensions (e.g., Gondwe, 2011; Duclos, Sahn, & Younger, 2006; Batana, 2008; Batana & Duclos, 2008). In this approach, a poverty line is set for each of the 10 Stellenbosch University https://scholar.sun.ac.za dimensions and then a decision is made as to whether an individual is to be considered poor if deprived in just one, some or all of the chosen indicators. In the literature, poverty has been found to be higher in distributions with higher correlations between the measures of well-being than those with lower correlations. Therefore, it is possible that univariate and multivariate analyses of poverty produce different rankings of poverty between distributions (Bourguignon & Chakravarty, 2003). The third approach, therefore, looks at poverty measures that make for possible substitutions and complementarities between the levels of dimensions. Assuming two dimensions, for the substitutability assumption, we expect that the more someone has of one dimension of poverty, the less is overall poverty deemed to be reduced if their value of the other dimension is increased. On the other hand, for the complementarity assumption, increasing one dimension would reduce overall poverty. For example, transferring education from the poorly nourished to the better nourished would reduce overall poverty because better-nourished children learn better (Bourguignon & Chakravarty, 2003). Most previous studies on poverty in Malawi concentrated on unidimensional poverty analysis (e.g., Murkherjee & Benson, 2003; Bokosi, 2006). Mussa (2013) considered three dimensions of poverty and inequality in Malawi, namely household per capita consumption, education and health. However, the study looked at the three dimensions one-at-a-time (or independently) without taking into account the correlations that exist between the dimensions of well-being. Gondwe (2011) did account for the possible correlations that exist between the dimensions of poverty. Two dimensions were used, household per capita consumption and education. This study conducts spatial comparisons of multidimensional poverty and inequality using two nonmonetary dimensions, namely an asset index and child nutritional status. We look at the two dimensions separately. It is the first time attempt to apply the asset index approach to the measurement of poverty in Malawi and uses a more recent DHS data set compared to the 2004 DHS data used by Alkire and Santos (2010). Also, the study conducts pro-poor growth analysis in the selected dimensions of living standards over two decades, from 1992 to 2010. As pointed out in Grosse et al. (2008), pro-poor growth analysis has recently become important to researchers and policy makers particularly with respect to monitoring progress towards the attainment of MDGs (now SDGs). However, the current efforts of propoor growth analysis have largely focussed on monetary dimensions of poverty thereby ignoring the non-monetary aspects of well-being. This study tries to reduce this shortcoming in the literature of the current pro-poor growth analysis by using two non-monetary dimensions of poverty, namely assets and nutritional status which are also central to the attainment of SDGs. 11 Stellenbosch University https://scholar.sun.ac.za Household income, consumption expenditures and assets are the three main indicators of economic status that exist in the literature. The use of assets has gained popularity in recent years (Filmer & Pritchett, 1998; Sahn & Stifel, 2000; Booysen, Van der Berg, Burger, Von Maltiz, & Du Rand, 2008). Using data from Demographic and Health Surveys (DHS), an index is computed from a number of asset variables and this forms the basis for ranking households by their long-run socio-economic status. Furthermore, assets as a measure of economic status have been found to have more advantages than both income and expenditure. We discuss four main advantages. Firstly, unlike assets, both income and household expenditures are associated with measurement problems. For example, many respondents hide their incomes and only provide income figures in ranges. Income and consumption are also associated with seasonality and, therefore, unreliable as long term measures of status. Secondly, unearned income such as interest on loans, gambling, etc. is not reported. Income on home production and self-employment activities is usually excluded just as expenditure on non-routine goods and services. Thirdly, it is usually the income or consumption expenditure of the respondent (in most cases household head) that is recorded as opposed to the rest of the household members. Fourthly, data collected on income and expenditure is usually over the past month, week or day thereby raising questions as to what period of time should be covered (Rutstein & Johnson, 2004). Based on the foregoing discussion, wealth is not only said to represent a more permanent status than income and expenditure but also more easily measured with only a single respondent required in most cases. In addition, the collection of asset information requires fewer questions than in income and expenditure surveys (Rutstein & Johnson, 2004). The study achieves five objectives. First, it presents spatial poverty and inequality comparisons in assets and child nutritional status across population groups (areas, regions and sex of household head) in Malawi. Related to the first objective, we conduct poverty and inequality decompositions to see the relative contributions by the respective population groups or distributions. Second, it establishes a robust ranking of poverty and inequality across the groups that are compared. Third, it identifies the factors associated with asset poverty and child nutritional status in Malawi. Fourth, it tracks the incidences of asset poverty and child malnutrition in Malawi over the past two decades using a series of cross-sectional data sets. Finally, it establishes if the observed changes in living standards and child nutritional status over time have been pro-poor, absolutely and relatively speaking. Relative pro-poor changes in welfare have implications for inequality since poor people benefit more from the changes than the rich. 2.2 Theoretical considerations in poverty measurement Three conditions are necessary for poverty measurement, namely a set of welfare indicators, poverty line and poverty measure (World Bank, 2004). The first condition is the choice of the welfare indicator which can be grouped into two, namely monetary (e.g. consumption or income) and non-monetary 12 Stellenbosch University https://scholar.sun.ac.za dimensions (e.g. assets or child-nutritional status). There is debate in the literature regarding which is a better indicator of welfare. The second condition is the choice of the poverty which can be looked at as the threshold separating the poor from the non-poor with the former falling below it. There exist two definitions of the poverty line. On the one hand, we have the absolute poverty line which is set for a particular group without reference to other members in the population. This poverty line is determined with respect to the basic needs needed for a living by a household or individual. On the other hand, we have the relative poverty line set with reference to the population, say at 60% of the average percapita consumption. The choice of which poverty line to use depends on the population we are studying. For poor countries, an absolute poverty line seems appropriate since emphasis is to ensure that the basic needs of the population are met. However, for richer countries that have met the basic needs a relative poverty line would make sense. Having chosen the measure of welfare and poverty line, the third step is the choice of the poverty measure to use (Haughton & Khandker, 2009). Several poverty measures are available in the literature such as the Watts index and the Sen-Shorrocks-Thon index, among others. A good poverty measure is supposed to satisfy some basic axioms to be considered reliable (see for example Sen, 1976; Kakwani, 1980; Foster, Greer, & Thorbecke, 1984). In this study, we use the Foster-Greer-Thorbecke (FGT) measures because of the decomposability property which they possess in addition to other favourable characteristics. We consider three FGT indices, namely the headcount index, poverty gap index and the squared poverty gap or poverty severity index (Foster, Greer, & Thorbecke, 1984). The FGT measures are given as: 1 P z ,    N   z  yi   I  yi  z    z  i 1  N (2.1) Where: yi , z, N , are the welfare indicator, poverty line, population size and measure of poverty aversion, respectively. I . is an indicator function that takes on a value of 1 if the expression is true and 0 otherwise. When   0 , the result is the poverty headcount index which is a measure of the proportion of the population that is poor. For   1 we have the poverty gap index which indicates the extent to which individuals on average fall below the poverty line and expresses it as a percentage of the poverty line. Finally, when   2 , we have the squared gap index which averages the squares of the poverty gaps relative to the poverty line. Although the headcount index P0  is easy to understand and measure, it does not indicate how poor the poor are. Unlike the headcount index, the poverty gap index P1  measures the extent to which 13 Stellenbosch University https://scholar.sun.ac.za individuals fall below the poverty line (the poverty gaps) as a proportion of the poverty line. The main limitation of the poverty gap index is that it does not reflect changes in inequality among the poor. By averaging the squares of the poverty gaps relative to the poverty line, the squared poverty gap index (also called the poverty severity index, P2 ) is able to show the changes in inequality among the poor. 2.3 Inequality measurement Unlike poverty analysis which only focuses on the poor individuals or households, inequality is defined over the entire population and takes into account both the rich and the poor. Here we only consider the Gini and Theil indices measures due to their desirable properties. The other measures of inequality discussed in the literature include Decile Dispersion Ratio, Atkinson’s inequality measures and Coefficient of Variation (see Haughton & Khandker, 2009; Duclos & Araar, 2009, for discussion). The Theil indices are advantageous because they are additive across different population subgroups and enable us to see between and with group inequalities. On the other hand, the Gini is easy to understand and has a desirable graphical representation. It is for this reason that the Gini is preferred in most studies. The Gini coefficient varies between 0 (representing equal distribution) and 1 (representing a complete inequality). On the other hand, Theil index values vary between zero and infinity, which reflect complete equality and inequality, respectively. Graphically, the Gini coefficient is calculated as the area above the curve but below the line of perfect equality divided by the total area below the line of perfect equality. Apart from measuring the level of inequality, the Lorenz curve is also be used to test for inequality dominance between two distributions. The Gini coefficient is calculated by the following formula: N G  1    xi  xi 1  yi  yi 1  (2.2) i 1 Where: G refers to the Gini coefficient; x i is the cumulative proportion of the population (represented on the x-axis) and y i be the cumulative proportion of the welfare indicator (in our case child-nutritional status and asset index). If there are N equal intervals on the x-axis, equation (2.2) collapses to: Gini  1  1 N N  y i 1 i  y i 1  (2.3) 14 Stellenbosch University https://scholar.sun.ac.za The Gini satisfies four main properties and these are mean independence, population size independence and the Pigou-Dalton Transfer sensitivity. However, the Gini does not satisfy two important characteristics, namely decomposability by population groups, dimensions or sources and statistical testability over time although this is less problematic now due to the fact that confidence intervals can typically be obtained through the use of bootstrap techniques. The Theil index measures satisfy all of the six properties. The Theil indices are part of a larger family of measures referred to as the Generalised Entropy (GE) class of indices. The general specification of the GE measures is given as:   1  1 N  yi       1 GE      1  N i 1  y    (2.4) Where: y is the selected welfare indicator or dimension and y is its average or mean. The parameter  gives the weight given to distances between values of a given indicator at different parts of the distribution, and can take any real value. The GE index is more sensitive to changes in the lower tail of the distribution for lower values of  , and for higher values, GE is more sensitive to changes occurring at the upper tail. When   0 , we have the Theil-L index, also called mean the mean log deviation measure, and when   1 , the result is the Theil-T index. 2.4 Stochastic dominance analysis Dominance tests are necessary because poverty or inequality ranking can be reversed by different choices of poverty lines, measures, aggregation procedures and samples. Stochastic dominance analysis seeks to achieve non-ambiguous ranking in terms of welfare and inequality between any two distributions (Araar, 2006; Davidson & Duclos, 2000). First, we discuss poverty dominance. Assuming two distributions, A and B, for our dimensions of poverty, namely asset index and child nutritional status, F A and F B will be the cumulative density functions (CDFs). Distribution B is said to dominate distribution A stochastically at first order if, for any argument y , F A  y   F B  y  . In terms of poverty, this means that there is (weakly) more poverty in distribution A than there is in B. Higher orders of stochastic dominance are obtained through repeated integrals of the CDF of each distribution (Davidson & Duclos, 2000). Generally, we have: y D 1  y   F  y , D s 1  y    D s  z  dz, for s  1,2,3,... 0 15 (2.5) Stellenbosch University https://scholar.sun.ac.za Where: D 1 is the CDF of the distribution under study; D 2  y  is the integral of D 1 from 0 to y; D 3  y  is the integral of D 2 from 0 to y, and so on. By definition, distribution B dominates A at order s if D A  y   DB  y  for all arguments y  [0, z max ] s s . The lower limit of 0 represents the lowest value of the welfare indicator while zmax is the maximum acceptable poverty line for each welfare indicator. First-order dominance implies dominance at all higher orders (Davidson & Duclos, 2000). Where first order dominance is not established, we proceed to higher levels but stop at third order dominance as is the practice in the literature (e.g., Mussa, 2013). Lorenz curves are the widely used approach to testing stochastic dominance in inequality (Araar, 2006). A given distribution is said to Lorenz dominate another distribution if the Lorenz curve of the first distribution lies everywhere above that of the latter. We then say that there is less inequality in the distribution with the higher curve than in that with a lower curve. Simply put, inequality is higher in A than in B if L B  p  is everywhere above L A  p  . Distribution B dominates distribution A in inequality, with the second order, if L A  p   L B  p  p  0,1 (2.6) Where p is the percentile. The Lorenz curve for the percentile p can be defined as follows: p  Qq dq  1 Qq dq L p     Qq dq  0 1 p (2.7) 0 0 L  p  is the cumulative proportion of the welfare indicator (asset index or child-nutritional status) held by a cumulative percentage p of the population, when individuals are ordered in increasing asset or the bottom p p  Qq dq gives the sum of the values of the welfare indicator of proportion (the poorest 100 p % ) of the population.  Q q dq gives the sum the welfare child-nutritional values. The integral 0 1 0 indicator values of all (Duclos & Araar, 2006). The inequality dominance tests used in this study are based on Araar’s (2006) theoretical developments. Specifically, generalised Lorenz dominance tests are used, and these turn out to be the same thing as second-order stochastic poverty dominance (Araar & Duclos, 2013). 16 Stellenbosch University https://scholar.sun.ac.za 2.5 Poverty and inequality decomposition As indicated in Sections 2.2 and 2.3, the FGT and GE indices are decomposable by population groups. In this study, we follow decompositions based on Araar and Duclos (2013). The decomposition of the FGT index enables us to determine the absolute or relative contribution of each group such as area, region or etc. It takes the following form: G Pˆ  z ;     ˆ g Pˆ z ;  ; g  (2.8) g 1 Where: G refers to the total number of population groups; Pˆ  z ;  ; g  and ˆg  are the estimated FGT ˆ g Pˆ z;  ; g  are the estimated index and population share of subgroup g ; ˆ g Pˆ  z;  ; g  and Pˆ z;   absolute and relative contributions to total poverty by subgroup g . GE decomposition takes the following form:  K  ˆ k   ˆ   I k ;    I   Iˆ    ˆk  k 1  ˆ   (2.9) Where: K refers to the total number of population groups; ˆk  is population share of subgroup k ; ̂ k   is the mean of the selected indicator subgroup k ; Iˆk ;  is the inequality within subgroup k ; I   is population inequality if each individual in subgroup k is given the mean for the poverty indicator of subgroup k , ̂ k  . 2.6 Pro-poor growth analysis In the literature, outcomes of pro-poor growth between any two given periods are analysed by calculating the growth rate (g) and five different pro-poor indices (Duclos & Verdier-Chouchane, 2010). The first three of these indices are measures of absolute pro-poorness and they are: the Ravallion and Chen (2003) index, the Kwakwani and Pernia (2000) index and the PEGR index. The other two indices namely, the Ravallion and Chen (2003) index minus (g) and the PEGR index minus (g) are indices of relative pro-poor growth. There exist two different approaches to the definition of pro-poor growth, namely a relative and an absolute approach. Growth is defined as pro-poor in the absolute sense if it reduces absolute poverty. 17 Stellenbosch University https://scholar.sun.ac.za Using the relative approach, growth is pro-poor if reduces inequality and relative poverty. In this sense, the poor proportionately benefit more from growth than the non-poor. If the growth rate and the Ravallion and Chen (2003), the Kwakwani and Pernia (2000) and the Poverty Equivalent Growth Rate (PEGR) indices are positive, there is absolute pro-poor growth from one period to another. When g is positive and the Kwakwani and Pernia (2000) is negative or when g is negative and the Kwakwani and Pernia (2000) index is positive, then the distributive change has increased absolute poverty. Growth is said to be anti-poor when this is the case. When the Ravallion and Chen (2003) minus g and the PEGR minus g are positive, the distributive change is considered to be relatively pro-poor. A similar conclusion is arrived at if the Kakwani and Pernia (2000)'s index is larger than 1. In this case, growth among the poor is higher than average growth. The poor have, therefore, been favourably affected by the change. In order to understand Ravallion and Chen (2003)'s growth incidence curves, we, first of all, explain what a “quantile” is. Suppose there are n incomes in a given distribution ranked from the lowest to the highest. A quantile of a given population is given by the income level that is found at a particular rank in that distribution. The rank of the level of income yi will be given by i / n . Growth incidence in the population can be understood by comparing quantile curves before and after a change in a distribution has taken place. Let the pre-change distribution be given by y A and the post-change distribution be given by y B , each of equal size n . We can build quantile curves for each of these distributions; these are given by the incomes y iA and y iB found at different ranks i / n . We can then assess the incidence of growth at any particular rank i / n by comparing the quantile curves at the point i / n . The absolute value change is given by y iB  y iA . The proportional change is given by y iB  y iA y iA . The Ravallion and Chen (2003) growth incidence curve is a plot of the proportional change against all possible values of ranks i / n . The incidence curve shows the rates of growth for various ranks in the distribution. Absolute pro-poorness of growth is obtained when the absolute value change is everywhere positive for the range of ranks over which the initially poor individuals or households are located. Relative pro-poorness of growth is obtained when the growth incidence curve is everywhere above the proportional change in the mean income. 18 Stellenbosch University https://scholar.sun.ac.za The Kakwani and Pernia (2000) index compares the actual poverty outcome of a distributive change to the outcome that would have been observed if the change had been distribution-neutral (Kakwani & Pernia, 2000). Two main distribution-neutrality criteria are provided. The first one assumes that everyone's income has changed by the same absolute amount while the second one considers that everyone's income has changed by the same proportional amount. There exist several views on what that proportion should be but the most common one is the proportional change in average income. Suppose P A and P B be the actual pre- and post-change poverty levels, and P BN let be post change PA  PB is the ratio of the actual change in poverty to poverty under distribution neutrality. Then, P A  P BN the change that would have been observed under distribution neutrality. Several poverty indices can be chosen for P . In the main text, P is specified as the headcount ratio. Several scenarios of distribution neutrality can also serve to specify BN . Let for instance A  y 1A , y 2A ,..., y nA and B  y 1B , y 2B ,..., y nB . Kakwani and Pernia (2000)'s index uses the following definition for BN :     BN   B y1A , B y 2A , B y nA  A A   A (2.10) It says that a change is distribution neutral if incomes change in proportion to the proportional change in average income. This index thus gives the ratio of the observed change in poverty to the change that would have been observed under constant inequality. The Poverty Equivalent Growth Rate (PEGR) index, also called the Kakwani, Khandker and Son (2003) index, assesses the pro-poorness of growth by calculating “poverty equivalent growth rates”. PEGR is the growth rate that would have resulted in the same observed level of poverty change if the distribution of income shares had not changed. PEGR can be thought of as the counterfactual distribution of income   BN  1  PEGR  y 1A , 1  PEGR  y 2A ,..., 1  PEGR  y nA as giving the same final level of poverty as A the one that is actually observed. When the growth rate PEGR is applied to all of the initial income y , B poverty thus equals poverty with the distribution of y .We, therefore, have PB  PBN and thus that PA  PB  PA  PBN If PEGR is greater than 0, the distributive change is judged to be absolutely pro-poor by this approach. This is the case if and only if PA  PB  0 . 19 Stellenbosch University https://scholar.sun.ac.za Let g  B   A and it is thus the actual rate of growth in average income. If income shares remain A constant in the movement from A to B , then we must have that y iB  1  g  y iA for all i . Since PA  PBN , with constant income shares, it must be that PEGR  g . The poverty equivalent growth rate is, therefore, just the usual growth rate if inequality has remained unchanged. Movements in inequality will, however, create a divergence between the poverty equivalent growth rate and the usual growth rate. The greater the adverse effects of inequality on the poor, the greater the value of PB , and therefore the lower the value of PEGR . The difference, PEGR  g , can, therefore, help assess whether the distributive change has affected the income shares of the poor. If PEGR  g is negative, growth among the poor is lower than average growth, and the income shares of the poor have therefore been adversely affected by the change. The converse is true when PEGR  g is positive. Although pro-poor growth analysis based on non-monetary dimensions has been shown to yield important results, there exist a number of potential problems of extending pro-poor growth analysis to non-monetary dimensions of welfare. Grosse et al. (2008) provide a good discussion of some of these potential limitations. In the context of this study, the main challenge relates to comparing relative changes in the asset index score or HAZ score on a linear scale. For example, in our data sets, HAZ scores range from -6 to 6 which clearly includes negative, 0 and positive values. This raises the question of how to compare a relative change from a score of -6 in 1992 to -5 in 2010 with an improvement from 5 to 6 over the same period. We overcome this challenge by transforming the z-scores into percentiles in the distribution. Similarly, we convert the asset index scores so that they are all positive by adding to the asset index a value slightly higher than the absolute value of the most negative number. The conversions and transformations of the HAZ and asset index scores are further discussed in Section 2.7. 2.7 Data Malawi has participated in four nationally representative Demographic Health Surveys (DHSs). These surveys have provided up-to-date information on living conditions and health programmes in Malawi. The surveys were conducted by the National Statistical Office (NSO) and the Community Health Sciences Unit (CHSU) during different times of the calendar year. Traditionally, DHSs consist of data sets relating to the household, men, women and children. This study makes use of the household and children’s data sets only. Child nutritional status is particularly important because it affects their growth and development, which has a direct link to their future health status as adult men and women. The household data set contains information on assets which are found to be more reliable than consumption 20 Stellenbosch University https://scholar.sun.ac.za expenditure and income as a measure of the long-term living standards for households, as we explain shortly. Table 2.1 provides a summary of the DHS data sets used in the study. Detailed tables showing the distribution of households and children by the different subgroups are respectively presented in Tables A1 and A2 in the appendices. Table 2.1: Summary of Malawi DHS data sets used Year Survey period No. of households No. of children (0-59 months) 1992 September-November 1992 5,323 3,353 2000 July-November 2000 14,213 9,753 2004 October 2004 -January 2005 13,664 8,707 2010 June-November 2010 24,825 4,801 Source: Own computation from MDHS data From the data sets, we derive two non-monetary measures of welfare, namely household asset index and child nutritional status in terms of anthropometric indices of children. The number of children indicated in the table excludes children whose age, height and weight measurements are missing. Poverty and inequality measurement is based on the 2010 data sets only. On the other hand, pro-poor growth analysis is done using all the DHS survey periods from 1992 to 2010. As earlier discussed, in the literature the use of assets as a measure of economic status has been found to be advantageous over household income and consumption. Wealth6 is not only said to represent a more permanent status than income and expenditure but is also more easily measured with only a single respondent required in most cases (Rutstein & Johnson, 2004). Since wealth cannot be directly observed, asset variables have to be identified to proxy wealth. There exists no best approach for selecting which asset variables to use (Montgomery et al., 2000). Although the choice of asset variables has varied by author, generally all assets and utility services reflecting the economic status of a household need to be included. This broader criterion, as opposed to selecting a few assets, is preferred because using a large number of variables potentially prevents a situation where households are concentrated on certain index scores (Rutstein & Johnson, 2004). In adopting the broad criterion, the study uses 31 asset variables and access to utilities including household assets, means of transport, source of lighting, ownership of livestock, agricultural land, having separate room for kitchen, rooms for sleeping, source of drinking water, type of toilet facility, type of material for the main floor, main wall material, main roof material and type of cooking fuel. 6 In the literature, wealth and asset index are loosely used synonymously because the assets owned by a household represent its asset wealth. 21 Stellenbosch University https://scholar.sun.ac.za The derivation of the asset index requires that the indicator variables for asset ownership7 be captured or transformed into binary form, e.g. 1 for “yes” if a household owns a given asset and 0 for “no” if a household does not own the asset. It is, however, not necessary to transform variables that are already categorical such as the source of drinking water, type of toilet facility, floor, wall, roof material and source of cooking fuel. Stata 13.1, the software used, recognises the categorisation automatically even in cases where there are more than two categories. After categorisation in the manner explained, the mca command was applied to the asset variables. The index is weighted by household size and sample weight. Principal components analysis (PCA) and factor analysis (FA) were used as robustness checks. Results from MCA showed that assets and living conditions associated with good higher economic status contribute negatively to the index while those associated with low economic status contribute positively to the index. For example, ownership of a durable asset (e.g. radio, television, refrigerator, etc.) and having piped water, flush toilet, carpet floor and good source of lighting and cooking fuel, among others, contribute negatively to the asset index. On the other hand, not owning a durable asset and living in poor conditions contribute positively to the asset index. The negative signs imply that the index generated from MCA is not an asset index but rather a poverty index. This implies that if quintiles were to be created from the index, the poorest households would be in the fifth quintile and the richest households in the first quintile. This complicates the interpretation of the index and the suggested solution in the literature is to generate an asset index by multiplying the poverty index by (-1). When the transformation is done, the richest households would now be in the fifth quintile and the poorest households in the first quintile, giving the normal interpretation of an asset index. This approach is motivated by Greenacre (2007) and adopted in the other studies (e.g. Da Maia, 2012). In this case, the poverty and asset indices are negative opposites of each other. Asset and poverty indices typically contain negative values which are unsuitable for poverty and inequality measurement. Following the literature, we adjusted the asset index by adding to it a value slightly higher than the absolute value of the most negative number (e.g., Da Maia, 2012). In our case, since -1.056288 was the minimum value, we added 1.056289 to the asset index. This results in a shift of the distribution of the asset index to the right. The resultant asset index, which we call the adjusted asset index, is what is used for poverty and inequality analysis. Some selected summary statistics for the poverty and asset indices are provided in Table 2.2. 7 In this study, asset ownership is used to mean ownership of private assets as well as access to public services. 22 Stellenbosch University https://scholar.sun.ac.za Table 2.2: Descriptive statistics for the poverty and asset indices Description Poverty index Asset index Adjusted asset index Percentiles 1% -3.65 -0.92 0.14 5% -1.84 -0.85 0.20 10% -1.00 -0.80 0.26 25% -0.08 -0.67 0.38 50% 0.43 -0.43 0.63 75% 0.67 0.08 1.13 90% 0.80 1.00 2.06 95% 0.85 1.84 2.90 99% 0.92 3.65 4.71 Statistic Minimum -5.29 -1.06 0.00 Maximum 1.06 5.29 6.34 Mean 0.11 -0.11 0.94 Standard deviation 0.91 0.91 0.91 Variance 0.82 0.82 0.82 Skewness -2.35 2.35 2.35 Kurtosis 9.36 9.36 9.36 Observations 24,825 24,825 24,825 Source: Own computation from MDHS 2010 In Figure 2.1, we show the adjusted and unadjusted asset indices in addition to some kernel density plots across population sub-groups. The results show that urban areas have a higher average of asset index scores compared to rural areas. Amongst regions, the Northern region has the highest asset score values seconded by the Central region and finally the Southern region. A similar trend is observed amongst Rural north, Rural centre and Rural south. On the other hand, Urban centre and Urban south seem to be doing better than Urban north. 23 Stellenbosch University https://scholar.sun.ac.za 0 0 .5 .5 Density Density 1 1 1.5 1.5 Figure 2.1: Adjusted and unadjusted asset indices by population subgroups -2 0 2 MCA asset index 6 0 Unadjusted 2 MCA asset index 4 Urban areas 6 Rural areas 0 0 .5 .5 Density Density 1 1 1.5 1.5 Adjusted 4 0 2 4 6 Central region 0 Southern region 2 4 MCA asset index Rural north Rural centre 6 Rural south 0 0 .1 .5 Density Density .2 1 .3 1.5 .4 Northern region MCA asset index 0 2 Urban north MCA asset index Urban centre 4 6 0 Urban south 2 MCA asset index Male head 4 6 Female head Source: Own computation from MDHS 2010 With respect to our second poverty measure, i.e. child- nutritional status, we calculate three anthropometric indices or z-scores, namely height-for-age (HAZ), weight-for-age (WAZ) and weightfor-height (WHZ) for children aged between 0 and 59 months. We discuss each of these three measures separately in the next few paragraphs. Firstly, HAZ measures stunted growth and reflects cumulative linear growth and failure to receive adequate nutrition over a long period. HAZ, therefore, indicates the long-term effects of malnutrition in 24 Stellenbosch University https://scholar.sun.ac.za a population. Secondly, WHZ is a measure body mass in relation to body height or length and reflects the current nutritional status. It can be used to describe body wasting which represents the failure to receive adequate nutrition in the period immediately preceding the survey. Wasting may result from inadequate food intake or recent episodes of illness causing loss of weight and the onset of malnutrition. Finally, WAZ (a composite index of HAZ and WHZ) gives the overall malnutrition level and takes into account both acute and chronic malnutrition. It measures the state of body weight. The z-scores are calculated using the new 2006 World Health Organisation (WHO) child growth standards based on the Multicentre Growth Reference Study done on a ‘healthy’ sample size of 8,440 children drawn from six countries across the world. The analysis is done using the zscore 06 module8. The z-scores express normal and abnormal departures of an individual child's height or weight from the average height or weight of comparable children of the same sex and age in the standard reference population. According to WHO (2006), z-scores below -2 standard deviations (SD) and -3 SD from the median of the reference population indicate malnourishment and extreme malnourishment, respectively. Normal children have z-scores of greater than or equal to -2 SD (>=-2SD). Figure 2.2 provides kernel density estimates for HAZ, WAZ and WHZ. Summary descriptive statistics are provided in Table 2.3. .3 .2 .1 0 Cumulative proportion of children .4 Figure 2.2: Distribution of anthropometric Z-scores for HAZ, WAZ and WHZ -6 -4 -3 -2 0 2 Anthropometric Z-scores HAZ WAZ 4 6 WHZ Source: Own computation from MDHS 2010 Levels of child malnutrition seem to depend on the choice of measure. HAZ shows the highest levels of child malnutrition, whereas WAZ and especially WHZ appear to reflect lower levels. Child malnutrition rates for the sample (i.e. scores more than 2 standard deviations below the mean for the standard 8 zscore06: Stata command for the calculation of anthropometric z-scores using the 2006 WHO child growth standards; http://www.ifpri.org/staffprofile/jefleroy. 25 Stellenbosch University https://scholar.sun.ac.za reference population) stand at 46%, 14% and 4% as based on HAZ, WAZ and WHZ, respectively. Extreme child malnutrition (more than 3 standard deviations below the reference population mean) stands at 19%, 3% and 2% based on HAZ, WAZ and WHZ, respectively. Table 2.3: Child malnutrition rates by population groups Description HAZ <-2SD < -3SD Age 0-23 38.60% 18.20% 24-59 50.70% 19.80% Sex Male 49.00% 22.20% Female 42.20% 16.20% Areas Urban 39.40% 15.30% Rural 46.60% 19.80% Region Northern 42.50% 17.90% Central 45.40% 18.50% Southern 46.40% 20.00% Residence Rural North 42.50% 18.30% Rural Centre 46.30% 19.20% Rural South 48.10% 20.80% Urban North 41.70% 14.90% Urban Centre 40.50% 14.40% Urban South 38.10% 16.20% Total 45.50% 19.10% Source: Own computation from MDHS 2010 WAZ <-2SD < -3SD WHZ <-2SD < -3SD 13.50% 13.80% 3.70% 3.30% 6.40% 2.30% 2.60% 0.70% 14.80% 12.60% 3.10% 3.80% 4.40% 3.80% 1.80% 1.30% 11.40% 14.10% 3.20% 3.50% 2.40% 4.40% 0.60% 1.70% 12.70% 14.10% 13.40% 2.50% 4.00% 3.20% 2.70% 4.40% 4.10% 0.50% 1.80% 1.50% 13.80% 14.30% 13.90% 3.40% 13.20% 11.10% 13.70% 2.50% 4.00% 3.30% 2.50% 3.60% 2.80% 3.50% 3.00% 4.60% 4.50% 0.80% 2.80% 2.20% 4.10% 0.60% 2.00% 1.70% 0.00% 0.60% 0.60% 1.50% The level of malnutrition is higher amongst children aged above 24 months compared to those below 24 months. We follow the WHO age-group comparison although a more detailed analysis can be done across smaller age groups. Malnutrition levels are also lower amongst girls compared to boys. Urban residents have a lower incidence of child malnutrition compared to rural residents. Amongst regions, child malnutrition is the highest in the Southern region followed by the Central region. Similar observations are made across areas and regions combined. However, it is worth noting that the gap in the levels of child mal-nutrition across regions, areas and regions are smaller compared to those obtained by age, sex and area; we get very similar incidence levels of child mal-nutrition amongst the three regions, rural areas (Rural north, Rural Centre and Rural South) and urban areas (Urban north, Urban Centre and Urban South). This seems to suggest that Malawi is uniform in terms of the incidence of child malnutrition. On the other hand, we get a different picture 26 Stellenbosch University https://scholar.sun.ac.za with assets where the incidence levels seem much different not only between areas but also across regions and by sex of the household head. 2.8 Poverty lines Based on household expenditure data, 50.7% of households in Malawi live below the poverty line. The asset poverty line is set at an equivalent of 50.7% since it is assumed that the appropriate asset poverty line should place the same proportion of households in poverty. The absolute poverty line for the asset index is, therefore, the value at the 50.7th percentile and this turns out to be 0.63399. With respect to child nutritional status, we convert each of the three anthropometric z-scores into percentiles. This approach has also been previously used by Mussa (2010). The conversion involves calculating the area under the standard normal curve to the left of the z-score. The area under the curve adds up to unity and the mean (z-score of 0) splits the area into two halves of 0.5 each. The conversion is monotonic and does not affect the ranking of the children. Therefore, for each point of the SDs, there is a corresponding percentile or cumulative probability which is fixed along the x-axis. For example, if we look up in the standard normal distribution tables, a z-score of -2 gives 0.0228 as the area to the left of -2 or simply the 2.3rd percentile. Similarly, -3 corresponds to 0.0013 or the 0.13th percentile. In this study, we are interested in malnutrition (<-2 SD) as opposed to extreme malnutrition (<-3 SD) and, therefore, use 2.3 as our poverty line. A percentile gives the value of a variable below which a certain percentage of observations (or population) falls. In our case, as we can see from Figure 2.2, almost half of the data points for HAZ lie to the left of -2. We also note that all our three anthropometric measures (HAZ, WAZ and WHZ) follow the standard normal distribution pattern. 2.9 Cumulative density curves Cumulative density curves (CDCs) indicate how poverty incidence varies with the level of poverty lines but are also used to test for dominance between two distributions. We show the CDCs for the asset index (when   0 ) in Figure 2.3.A distribution whose curve lies above the other reflects a higher level of poverty. The figures generally show that there is poverty dominance in the poverty relevant range. The CDCs only cross each other at very high asset levels where it is difficult to conclude that poverty is higher in one population subgroup than the other. However, it is only the poverty relevant range that we are interested in for practical purposes. The figures also show that the gap between the CDCs is largest between urban and rural areas; small differences exist between regions and by sex of household head. 27 Stellenbosch University https://scholar.sun.ac.za 1 .6 .2 0 0 .2 .4 .4 .6 .8 .8 Cumulative proportion of households Cumulative proportion of households 1 Figure 2.3: MCA asset index cumulative density curves by population groups 0 .63399 1.4 2.8 4.2 Poverty line (MCA asset index) 7 0 .63399 Rural areas 1.4 2.8 4.2 Poverty line (MCA asset index) Northern region Central region 5.6 7 Southern region 0 .2 .2 .4 .4 .6 .6 .8 .8 C um u la tive p rop ortio n of h o use ho ld s C um u la tive p rop ortio n of h o use h o ld s 1 1 Urban areas 5.6 .63399 1.4 2.8 4.2 Poverty line (MCA asset index) Rural north Urban north Rural centre Urban centre 5.6 7 0 0 0 Rural south Urban south .63399 1.4 2.8 4.2 Poverty line (MCA asset index) Male head 5.6 7 Female head Source: Own computation from MDHS 2010 2.10 FGT poverty estimates Table 2.4 shows our poverty estimates for the three FGT classes, namely the headcount index (  0) , the average poverty gap index (  1) and the poverty severity index (  2) , respectively. Household observations are weighted by sampling weights and household size. Sampling weights are used so that the chosen households are representative of all households in Malawi. On the other hand, household size takes into account the effect of size on welfare. Thus, a poor household with more members is given a higher weight in the analysis than a similarly poor household with fewer members. For the children’s data set, we only apply sampling weights since household size is not applicable for individual level data. Without the use of weights, our results would be either overestimated or underestimated. The difference between the results in Table 2.4 and Figure 2.3 is that the latter is calculated for the whole range of the poverty lines while the former is calculated at a specific and chosen level of the poverty line. For example with the poverty line set at as asset index of 0.63399, the table shows that 46.0% of 28 Stellenbosch University https://scholar.sun.ac.za the Malawian population is asset poor for   0 . For the same measure (headcount index), asset poverty is higher in rural areas (53.3%) compared to urban areas (7.2%). The Central region has the highest levels of asset poverty amongst the three regions with the incidence of household poverty at 51.3%. A similar observation is made for rural centre and urban centre where 59.4% and 9.2% of the households are living below the poverty line, respectively. Asset poverty is also higher in female-headed households than male headed households. Child malnutrition estimates are dependent on the type of measure used except for rural-urban population group where rural areas are found to be the poorer than urban areas for all the three indicators, namely HAZ, WAZ and WHZ. At a national level (  0) , 47.1% of the children are malnourished with respect to HAZ compared to 13.2% and 4.2% for WAZ and WHZ, respectively. Table 2.4: Poverty headcount, average poverty gap and poverty severity estimates Description Area Urban Rural Region Northern Central Southern Residence Rural north Rural centre Rural south Urban north Urban centre Urban south Sex Male head Female head Malawi α=2 α=0 WHZ α=1 40.8% 31.0% 26.4% 48.3% 38.0% 33.2% 10.5% 6.7% 5.1% 13.7% 8.7% 7.0% 2.5% 4.6% 1.8% 1.5% 3.3% 2.8% 7.2% 2.4% 1.1% 53.3% 20.3% 10.1% 44.8% 34.6% 30.0% 47.2% 37.2% 32.3% 47.6% 37.3% 32.6% 11.8% 6.8% 5.2% 13.5% 9.0% 7.2% 13.2% 8.3% 6.5% 2.9% 4.5% 4.3% 1.5% 1.0% 3.4% 2.9% 3.0% 2.6% 32.4% 11.0% 5.1% 51.3% 20.6% 10.6% 44.5% 16.2% 7.7% 44.9% 48.1% 49.3% 43.8% 42.1% 39.2% 12.9% 13.8% 13.8% 2.9% 12.3% 10.0% 5.6% 7.3% 6.9% 2.0% 6.6% 4.2% 3.1% 4.8% 4.7% 0.9% 2.9% 2.3% 1.6% 3.6% 3.3% 0.6% 2.1% 1.8% 1.1% 3.2% 2.8% 0.5% 1.6% 1.6% 34.5% 11.8% 5.5% 59.4% 24.0% 12.3% 52.9% 19.3% 9.2% 8.6% 2.0% 0.7% 9.2% 3.1% 1.5% 5.3% 1.8% 0.8% 12.9% 8.4% 6.7% 17.0% 9.3% 6.8% 13.2% 8.4% 6.7% 4.3% 4.1% 4.2% 3.0% 2.6% 3.2% 2.7% 3.0% 2.6% 41.7% 14.8% 6.9% 59.1% 25.8% 13.9% 46.0% 17.5% 8.6% α=0 HAZ α=1 35.0% 38.0% 38.9% 32.0% 32.4% 29.5% α=2 30.5% 33.1% 34.1% 26.0% 27.7% 25.3% 47.1% 36.8% 32.1% 47.6% 38.5% 34.0% 47.1% 37.0% 32.2% α=0 WAZ α=1 7.3% 9.1% 8.7% 2.3% 8.4% 5.9% α=2 α=0 Asset index α=1 α=2 Source: Own computation from MDHS 2010 It is, however, important to state that the national poverty estimates based on HAZ and asset index are similar in magnitude suggesting a similar national poverty profile for Malawi. We also note that the differences in the incidence of poverty between groups (e.g. rural and urban areas) seem to be larger for the asset index than for the three anthropometric measures. Consequently, the results tell us that levels of malnutrition are similar in magnitude across many population subgroups in Malawi. On the other hand, the incidence of asset poverty varies much by population groups and a similar pattern is obtained based on household consumption expenditure studies such as Mussa (2013). Although we have not 29 Stellenbosch University https://scholar.sun.ac.za presented the CDCs of HAZ, WAZ and WHZ, we find them to be much closer to one another than is the case for the asset index in Figure 2.3. Poverty ranking by population group is the same for all the three indices,   0 ,   1 and   2 . However, this result only holds at the specifically chosen level of poverty line and not all poverty lines. As discussed, it is difficult to conclusively identify which population groups have higher poverty levels because the CDCs cross at different levels. The estimates depend on the choice of the poverty measure used as is the case with child nutritional status. This problem is resolved through stochastic dominance analysis dealt with in Section 2.11 which follows. 2.11 Poverty dominance analysis Our dominance tests are done up to the third order, as is the tradition in the literature (e.g., Mussa, 2013). Naturally, where first order dominance exists, there is no need to test for second or third order dominance, or where second order dominance exists, there is no need to test for third order dominance, as it would automatically hold. Table 2.5 shows the results. Poverty dominance is measured and interpreted in terms of welfare or standard of living. A distribution that dominates the other has lower levels of poverty than the one that is dominated. Table 2.5: Poverty stochastic dominance test results for population subgroups Population group pair HAZ Area Urban v. Rural U>R: order 2 Region Northern v. Central ND Northern v. Southern ND Central v. Southern ND Residence Rural north v. Rural centre ND Rural north v. Rural south ND Rural centre v. Rural south ND Urban north v. Urban centre UN>UC: order 2 Urban north v. Urban south ND Urban centre v. Urban south ND Sex Male head v. female head ND Source: Own computation from MDHS 2010 WAZ WHZ Asset ND U>R: order 2 U>R: order 1 ND ND ND N>C: order 2 N>S: order 1 ND ND N>S: order 3 ND ND ND ND ND ND ND RN>RC: order 3 RN>RS: order 1 ND UN>UC: order 2 UN>US: order 1 ND ND RN>RS: order 2 ND ND ND UC>US: order 2 ND ND MH>FH Notes: U=Urban areas, R=Rural areas, ND= no dominance up to third order, UN=Urban north, UC=Urban centre, US=Urban south, N=Northern region, C=Central region, S=Southern region, RN=Rural north, RC=Rural centre, RS=Rural south, MH=male-headed household, FH=femaleheaded household 30 Stellenbosch University https://scholar.sun.ac.za As with the FGT estimates, dominance test results for child nutritional status depend on the measure used. Where dominance is established, the order of dominance is not reversed. For example, all measures establish that urban areas are better in terms of welfare than rural areas and not the opposite. Using HAZ, urban areas dominate rural areas at second order dominance. Likewise, urban north dominates urban centre. No dominance is established for the rest of the subgroups. When WAZ is used, no dominance is established at all for all the pairs of subgroups. Finally, when WHZ is used, non-dominance is found for the Central and Southern regions, Rural centre and Rural south and Urban centre and Urban south. No dominance is established between male and female-headed households- the same finding for HAZ and WAZ. It is, however, important to note that in most of these cases, non-dominance occurs outside the poverty relevant ranges of the poverty line as illustrated in the CDCs. Using the asset index, urban areas dominate rural areas at first order. The Northern region dominates the Southern region. When areas and regions are further broken down, dominance is only established for two pairs, namely rural north dominates rural south and urban centre dominates urban south. Male headed households dominate those headed by females. 2.12 Gini and GE inequality estimates In Figure 2.4 are Lorenz curves for Malawi’s population groups based on the asset index. The Lorenz curve maps the cumulative share of the asset index, on the vertical axis against the population distribution on the horizontal axis. The 45-degree line indicates perfect equality, a case in which each household has the same share of assets. The numerical inequality estimates for both the asset index and child malnutrition are presented in Table 2.6 that follows. 0 0 .2 .2 .4 .4 .6 .6 .8 .8 Cumulative share of the (MCA) asset index Cumulative share of the (MCA) asset index 1 1 Figure 2.4: MCA asset index Lorenz curves by population subgroups 0 .2 .4 .6 Cumulative proportion of households .8 1 0 .2 .4 .6 Cumulative proportion of households 45° line Population 45° line Population Urban areas Rural areas Central region Southern region 31 .8 Northern region 1 0 .2 .2 .4 .4 .6 .6 .8 C um ulative share of the (M C A) asset index .8 1 1 C um ulative share of the (M C A) asset index Stellenbosch University https://scholar.sun.ac.za .2 .4 .6 Cumulative proportion of households .8 1 0 0 0 .2 .4 .6 .8 45° line Population Rural north Rural centre Rural south Urban north 45° line Population Urban centre Urban south Male head Female head Cumulative proportion of households 1 Source: Own computation from MDHS 2010 Table 2.6: Inequality estimates across population subgroups Description Gini Theil L (theta=0) HAZ WAZ WHZ Asset HAZ WAZ WHZ Asset Theil T (theta=1) HAZ WAZ WHZ Asset 2.165 0.756 0.279 0.205 2.563 0.889 0.429 0.261 0.852 0.364 0.149 0.170 1.058 0.459 0.186 0.267 2.416 0.787 0.292 0.244 2.547 0.918 0.430 0.353 2.481 0.840 0.408 0.362 1.018 0.424 0.159 0.230 1.021 0.447 0.181 0.356 1.029 0.445 0.186 0.361 2.487 2.608 2.532 1.811 2.189 2.182 1.040 1.051 1.070 0.845 0.850 0.842 Area Urban 0.683 0.461 0.281 0.326 Rural 0.744 0.521 0.313 0.397 Region Northern region 0.733 0.502 0.290 0.376 Central region 0.734 0.513 0.306 0.457 Southern region 0.736 0.513 0.314 0.462 Residence Rural north 0.739 0.512 0.300 0.361 Rural centre 0.742 0.519 0.309 0.394 Rural south 0.747 0.523 0.321 0.397 Urban north 0.680 0.413 0.197 0.337 Urban centre 0.681 0.475 0.292 0.338 Urban south 0.678 0.454 0.281 0.308 Sex Male head 0.734 0.512 0.308 0.441 Female head 0.741 0.519 0.313 0.478 Malawi 0.735 0.513 0.308 0.453 Source: Own computation from MDHS 2010 0.794 0.943 0.851 0.691 0.768 0.752 0.310 0.450 0.437 0.116 0.316 0.268 0.225 0.257 0.259 0.200 0.220 0.184 2.502 0.872 0.402 0.327 2.528 0.856 0.453 0.385 2.505 0.871 0.406 0.348 0.441 0.457 0.462 0.282 0.390 0.353 0.168 0.183 0.194 0.075 0.166 0.145 0.210 0.265 0.272 0.180 0.182 0.152 1.022 0.443 0.180 0.325 1.048 0.459 0.186 0.400 1.024 0.444 0.181 0.346 The results show that inequality is higher in rural areas compared to urban areas and this is consistent for all the three inequality measures used, namely Gini, Theil L and Theil T. Nevertheless, amongst the three measures of child nutrition, HAZ generally yields the highest levels of inequality. Just as with poverty measurement, levels of inequalities depend on the measure used. Consequently, it is difficult to conclude without ambiguity which population groups have higher levels of inequality. We, therefore, 32 Stellenbosch University https://scholar.sun.ac.za turn to inequality dominance testing as a solution. The inequality dominance test results are presented in Section 2.13 and act as robustness checks. 2.13 Inequality dominance analysis Results of the Lorenz dominance tests are presented in Table 2.7. Using HAZ, inequality dominance is only established for the urban and rural area pair. Since the Lorenz curve for urban areas lies above that of rural areas, inequality is said to be lower in urban areas compared to rural areas. This finding is confirmed with WHZ as well as the asset index. No dominance is established using WAZ. Table 2.7: Generalised Lorenz dominance test results across population subgroups Population group pair HAZ Area Urban v. Rural U>R: order 2 Region Northern v. Central ND Northern v. Southern ND Central v. Southern ND Residence Rural north v. Rural centre ND Rural north v. Rural south ND Rural centre v. Rural south ND Urban north v. Urban centre ND Urban north v. Urban south ND Urban centre v. Urban south ND Sex Male head v. female head ND Source: Own computation from MDHS 2010 WAZ WHZ Asset ND U>R: order 2 ND ND ND N<C N<S ND ND ND ND ND ND ND ND RN<RS: order 2 ND UN<UC: order 2 UN<US: order 2 ND ND RN<RS: order 2 ND ND ND UC<US: order 2 ND ND MH<FH: order 2 U>R: order 2 ND ND ND Notes: U=Urban areas, R=Rural areas, ND= no dominance, UN=Urban north, UC=Urban centre, US=Urban south, N=Northern region, C=Central region, S=Southern region, RN=Rural north, RC=Rural centre, RS=Rural south, MH=male-headed household, FH=female-headed household Amongst the three regions, no inequality dominance is established using HAZ, WAZ and asset index. For WHZ, inequality is found to be the highest in the Northern region compared to both the Central and Southern regions. With respect to the sex of the household head, dominance is established for the asset index only, in which case inequality is found to be higher among male-headed households. WHZ and asset index establish dominance between some areas such as Rural north, Rural south, Urban north, Urban centre and Urban south. 33 Stellenbosch University https://scholar.sun.ac.za 2.14 Poverty decomposition Table 2.8 shows results of the FGT decompositions across rural-urban areas, regions, areas and sex of the household head for the three anthropometric indicators and the asset index. We provide the estimated population share of each subgroup as well as the estimated relative contribution of each subgroup to total poverty. Three FGT measures are used, namely the poverty headcount, average poverty gap and poverty severity indices. The population shares and relative contributions to poverty sum up to unity. The objective here is to establish if the most populated areas are also the ones with the highest incidence of poverty. Our decomposition results for the asset index indicate that most poor people in Malawi are also in the rural areas-where the population is large. For example, the table shows that 84% of the households are based in rural areas which contribute 97.5% to the poverty headcount when the asset index is used as the measure of welfare. The contributions are slightly higher with the poverty gap and severity indices at 97.8% and 98%, respectively. This does not only indicate that the incidence of poverty is above the national average in rural areas but also that the poorer parts of the population are more concentrated in rural areas. Amongst the three regions, much of the poverty is contributed by the Central region followed by the Northern region. A similar observation is made within rural and urban areas, e.g. Rural north and Rural centre. Households headed by males contribute more to poverty than those headed by females. Using the nutritional measures, rural areas are also found to contribute more to poverty than urban areas but the difference between the rural population share and the contribution of rural areas to these measures of nutritional poverty is small. This is surprising since it appears as if nutritional status is not much better in urban than in rural areas, despite the fact that asset holdings in urban areas are definitely considerably greater and asset poverty less severe. With respect to the three regions of the country, it is the Central region that contributes the highest to poverty in both absolutes and relative terms. Rural centre and urban centre also contribute more to poverty than the other population subgroups. Finally, just as is the case with the asset index, male-headed households contribute more to poverty than those headed by females. 2.15 Subgroup inequality decomposition Table 2.9 presents generalised entropy inequality estimates decomposed to indicate within (vertical) and between (horizontal) subgroup inequalities. Irrespective of the measures used, inequalities are largely driven by within population subgroups as opposed to between population subgroups. Between sub-group inequality is only a greater magnitude for the urban-rural areas comparison using the Theil L or Theil T measure for asset inequality, when it contributes somewhere between one-quarter and one-third to overall asset inequality. All other between group contributions to inequality are extremely small, reflecting the fact that locational differences in nutritional status appear to be non-systematic. 34 Stellenbosch University https://scholar.sun.ac.za Table 2.8: FGT poverty sub-group decomposition Description Population share HAZ α=1 α=0 α=2 Area Urban 15.0% 13.0% 12.6% 12.4% Rural 85.0% 87.0% 87.4% 87.6% Region Northern 10.7% 10.0% 9.8% 9.8% Central 46.4% 46.3% 46.5% 46.4% Southern 42.9% 43.7% 43.7% 43.8% Residence Rural north 9.6% 9.0% 8.9% 8.9% Rural centre 39.6% 40.2% 40.5% 40.5% Rural south 35.8% 37.7% 38.0% 38.2% Urban north 1.1% 1.0% 1.0% 0.9% Urban centre 6.7% 6.0% 5.9% 5.8% Urban south 7.2% 6.0% 5.7% 5.6% Sex Male head 91.9% 91.9% 91.6% 91.5% Female head 8.1% 8.1% 8.4% 8.5% Malawi 100.0% 100.0% 100.0% 100.0% Source: Own computation from MDHS 2010 α=0 WAZ α=1 α=0 WHZ α=1 α=2 12.6% 87.4% 12.9% 87.1% 10.0% 47.8% 42.3% 9.7% 41.3% 36.4% 0.3% 6.4% 5.9% α=2 12.6% 87.4% 8.6% 91.4% 8.8% 91.2% 8.4% 91.6% 2.5% 97.5% 2.2% 97.8% 2.0% 98.0% 9.4% 48.9% 41.7% 9.4% 49.1% 41.5% 7.1% 49.6% 43.3% 5.1% 52.3% 42.7% 4.3% 53.0% 42.7% 8.4% 48.4% 43.3% 7.5% 51.1% 41.4% 7.0% 53.0% 40.0% 9.1% 42.2% 35.8% 0.4% 6.6% 5.9% 9.0% 42.7% 35.8% 0.4% 6.4% 5.7% 6.9% 45.1% 39.4% 0.2% 4.5% 3.9% 4.8% 47.8% 38.5% 0.2% 4.5% 4.1% 4.1% 49.1% 38.4% 0.2% 3.9% 4.3% 8.2% 47.0% 42.3% 0.2% 1.4% 0.9% 7.4% 49.9% 40.6% 0.1% 1.3% 0.8% 6.9% 51.8% 39.2% 0.1% 1.2% 0.7% 90.0% 91.4% 91.9% 10.0% 8.6% 8.1% 100.0% 100.0% 100.0% 35 91.6% 90.6% 90.4% 8.4% 9.4% 9.6% 100.0% 100.0% 100.0% Asset index α=0 α=1 α=2 68.5% 63.9% 60.6% 31.5% 36.1% 39.4% 100.0% 100.0% 100.0% Stellenbosch University https://scholar.sun.ac.za Table 2.9: GE inequality decomposition for asset index and child nutrition Description GE index Sub-group Type Areas Absolute Relative Within 0.254 Asset Theil L Between 0.098 Total 0.353 Within 0.236 Between 0.116 Asset Theil T Total 0.352 Within 2.503 HAZ Theil L Between 0.002 Total 2.505 Within 1.022 HAZ Theil T Between 0.002 Total 1.024 Within 0.869 WAZ Theil L Between 0.002 Total 0.871 Within 0.442 WAZ Theil T Between 0.002 Total 0.444 Within 0.406 WHZ Theil L Between 0.000 Total 0.406 Within 0.180 WHZ Theil T Between 0.000 Total 0.181 Source: Own computation from MDHS 2010 72.2% 27.8% 100.0% 66.9% 33.1% 100.0% 99.9% 0.1% 100.0% 99.8% 0.2% 100.0% 99.7% 0.3% 100.0% 99.5% 0.5% 100.0% 100.0% 0.1% 100.0% 99.9% 0.2% 100.0% Regions Absolute Relative 0.349 0.004 0.353 0.348 0.004 0.352 2.505 0.000 2.505 1.024 0.000 1.024 0.871 0.001 0.871 0.444 0.001 0.444 0.406 0.000 0.406 0.180 0.000 0.180 99.0% 1.0% 100.0% 99.0% 1.0% 100.0% 100.0% 0.0% 100.0% 100.0% 0.0% 100.0% 99.9% 0.1% 100.0% 99.9% 0.1% 100.0% 100.0% 0.0% 100.0% 99.9% 0.0% 100.0% 36 Within areas/regions Absolute Relative 0.248 0.105 0.353 0.230 0.122 0.352 2.501 0.003 2.505 1.020 0.003 1.024 0.868 0.004 0.871 0.441 0.004 0.444 0.406 0.000 0.406 0.180 0.000 0.180 70.3% 29.7% 100.0% 65.3% 34.7% 100.0% 99.9% 0.1% 100.0% 99.7% 0.3% 100.0% 99.6% 0.4% 100.0% 99.2% 0.8% 100.0% 99.9% 0.1% 100.0% 99.7% 0.2% 100.0% Household head Absolute Relative 0.344 0.009 0.353 0.344 0.008 0.352 2.505 0.000 2.505 1.024 0.000 1.024 0.871 0.000 0.871 0.444 0.000 0.444 0.406 0.000 0.406 0.180 0.000 0.180 97.5% 2.5% 100.0% 97.6% 2.4% 100.0% 100.0% 0.0% 100.0% 100.0% 0.0% 100.0% 100.0% 0.0% 100.0% 100.0% 0.0% 100.0% 100.0% 0.0% 100.0% 100.0% 0.0% 100.0% Stellenbosch University https://scholar.sun.ac.za 2.16 Spatial distribution of poverty and inequality The maps are drawn to provide a visual representation of the FGT headcount and Gini inequality estimates presented in Table 2.4 and Table 2.6. They are drawn for Malawi’s three regions (Northern, Central and Southern) using StatPlanet, an interactive data visualisation and mapping software9. The colours on the maps vary positively with the levels of poverty and inequality. Therefore, darker and lighter colours represent areas with higher and lower levels of poverty and inequality, respectively. Figure 2.5 and Figure 2.6 provide the spatial distribution of poverty and inequality using our four selected indicators, namely the asset index, HAZ, WAZ and WHZ. The mapping shows little variation in terms of both poverty and inequality estimates across Malawi’s regions. Figure 2.5: Spatial distribution of poverty using asset index and child-nutritional status Poverty headcount (asset index) 9 Poverty headcount (HAZ) We multiply our estimates by 100 for easy customisation of the map colours in StatPlanet. 37 Stellenbosch University https://scholar.sun.ac.za Poverty headcount (WAZ) Poverty headcount (WHZ) Source: Own computation from MDHS 2010 The maps also confirm that regardless of the indicator used, the poverty incidence is the lowest in the Northern region. The ranking between the Central and Southern regions is dependent on the indicator used. Based on the asset index, WAZ and WHZ, the Central region has the highest incidence of poverty. On the other hand, the Southern region has the highest incidence of poverty when HAZ is used as the indicator. With respect to Gini mapping, it is also shown that inequality is lowest in the Northern region for all the measures used. The Central and Southern regions fairly rank the same for all the measures used apart from HAZ where inequality is highest in the Southern region. This ranking is similar to the one found in the poverty headcount when HAZ is used. 38 Stellenbosch University https://scholar.sun.ac.za Figure 2.6: Spatial distribution of inequality for asset index and child-nutritional status Gini inequality (asset index) Gini inequality (HAZ) Gini inequality (WAZ) Gini inequality (WHZ) Source: Own computation from MDHS 2010 39 Stellenbosch University https://scholar.sun.ac.za 2.17 Factors affecting asset poverty So far, we have only looked at the poverty profile for Malawi without explaining the factors associated with poverty. As acknowledged in the literature, a satisfactory explanation of why households are poor is important if we are to be able to deal with the roots of poverty (Haughton & Khandker, 2009). This chapter, therefore, addresses the question of what factors are associated with asset poverty in Malawi using the 2010 MDHS data. Our dependent variable is the asset index, reflecting long-term economic status. The choice of the explanatory variables is informed by the literature. The correlates include household size, age dependency ratio, age of the household head, sex of the household head (male=1, female=2), incidence of sickness in the household (no=0, yes=1) and the levels of education in the household. We also include dummies for area (urban=1, rural=2) and region (1=Northern, 2=Central and 3=Southern region) to control for the location in which the households reside. The educational status of household members has four categories, namely 0=no education and preschool, 1=primary education, 2=secondary education and 3=post-secondary education. Shocks to the household are represented by the incidence of sickness in the household, captured as (no=0, yes=1).Table 2.10 shows descriptive statistics of the variables used in our analysis of asset poverty. Table 2.10: Summary descriptive statistics for the asset model Variables Obs Mean/Prop. Std. Dev. Asset index 24,825 0.95 0.903 Age of head 24,798 43.34 16.368 Household size 24,825 4.79 2.295 Dependency ratio 24,825 49.2% 0.245 Female head 24,825 28.4% 0.451 Sickness in the household 19,947 2.5% 0.157 Primary 24,724 61.0% 0.488 Secondary 24,724 17.3% 0.378 Higher 24,724 2.6% 0.160 24,825 88.3% 0.322 Central 24,825 33.7% 0.473 Southern 24,825 48.8% 0.500 Household's education Rural area Region Source: Own computation from MDHS 2010 The proportions relate to categorical variables. The asset index has a mean value of about 0.95 and standard deviation of 0.903. On average, the household head is aged about 43 years with standard 40 Stellenbosch University https://scholar.sun.ac.za deviation of 16.368. Based on the sample, the average household size in Malawi is about 4.79 persons. Based on an alternative data set, the Malawi IHS3 data, the average household size is 4.6 members. The age dependency ratio stands at 49% and is calculated as the number of children aged 0 to 14 years and the number of persons aged 65 years divided by the household size10. About 28% of the households are headed by females. With respect to the incidence of sickness, only about 2.5% of the households reported not to have been very sick for 3 or more months in the year prior to the survey. With regards to the levels of education, most households (about 61%) have primary education as the highest level of qualification. This is followed by no education or preschool education at 19%, secondary education at 17.3% and finally post-secondary education at 2.6%. With regard to area of residence, about 88% of the people reside in households which are rural compared to 12% which are urban. The Southern region has the highest number of households with 49%, followed by the Central region with 34% and finally the Northern region with 17%. Table 2.11 presents the OLS regression results of the asset index. The results show that about 55% of the variance in the asset index is explained by the explanatory variables. All the variables have the expected signs and statistically significant at conventional levels. There is also satisfactory performance for our control variables. We checked for the robustness of the model based on an alternative definition of the age dependency ratio and use of sickness of the father and mother in place of sickness for any of the household members. Our preferred model, which is presented, was chosen because it gives the largest size of R-squared and age of the household head had the expected positive sign. The coefficient for household size is positive and statistically significant. It indicates that a unit increase in household size is associated with an increase of 0.071 standard deviations in the asset index while holding all the other factors constant. Put differently, larger households are associated with more assets. The coefficient for age dependency ratio is also significant coefficient but negative indicating that the greater the proportion of economically inactive members in the household, the poorer is that household in terms of asset ownership. Households that are headed by older people are associated with better longterm economic status by about 0.005 standard deviations. The regression results indicate that the there is no statistically significant difference in the asset ownership between male and female headed households. 10 Strictly speaking, age dependency ratio is given as the ratio of dependents (people younger than 15 or older than 64) to the working-age population (those aged 15-64). However, this strict definition generates missing values since some of the households do not have people in either of the three age categories. 41 Stellenbosch University https://scholar.sun.ac.za With respect to education, it is found that higher levels of education are positively associated with asset ownership in Malawi when compared to the base category (no education). It is worth noting with great interest that the size of the education coefficient increases with the level of education. Urban areas are associated with higher levels of asset ownership than rural areas by about 1.051 standard deviations. Regional dummies have mixed performance with the Central region being statistically different from the Northern region which is the base. Controlling for the 27 districts in the data does not add much to the model so we do not report in the tables. Table 2.11: OLS regression results for asset poverty Description Household size Age dependency ratio Age of household head Sex of household head Household member is sick Household education Primary Secondary Higher Rural area Region Central Southern Constant R-squared Observations Prob >F F statistic Asset index 0.071*** -0.505*** 0.005*** 0.038** -0.053 SE (0.005) (0.047) (0.001) (0.015) (0.034) 0.236*** 0.864*** 2.390*** -1.051*** (0.015) (0.037) (0.093) (0.067) -0.141*** 0.003 2.283*** 0.552 19,900 0.000 190.350 (0.040) (0.039) (0.130) Notes: *, **, *** denote significance at 10%, 5% and 1% levels 2.18 Child nutritional status in Malawi We look at factors associated with child nutritional status in Malawi using the 2010 MDHS children data set. A previous study by Ngalawa & Chirwa (2008) was based on the 1997-1998 IHS data set. As earlier discussed, DHS surveys provide a better source of data on anthropometric measurements when compared to IHS data (see Verduzco-Gallo et al., 2014). Furthermore, the present analysis is based on the most recent data. Our dependent variables in the study are HAZ, WAZ and WHZ as indicators of stunting, underweight and body wasting, respectively. Table 2.12 shows descriptive statistics of the variables used in our three nutrition models. The proportions relate to all categorical variables just as before. 42 Stellenbosch University https://scholar.sun.ac.za Table 2.12: Descriptive statistics for the nutritional models Variable HAZ WAZ WHZ Age in months Square of age in moths Weight at birth Parental age difference Birth order number Female child Rural area Child is twin Female household head Asset index Square of asset index Mother's education Incomplete primary Complete primary Incomplete secondary Complete secondary Higher Father's education Incomplete primary Complete primary Incomplete secondary Higher Region Central Southern Obs 4,653 4,783 4,609 4,801 4,801 4,801 4,411 4,801 4,801 4,801 4,801 4,801 4,801 4,801 Mean Std. Dev. -1.76 1.662 -0.79 1.215 0.31 1.35 28.91 16.903 11.22 10.334 5.64 3.271 5.52 4.692 3.69 2.321 50.60% 0.50 90.00% 0.299 3.20% 0.176 7.90% 0.269 -0.097 0.899 0.817 3.235 4,801 4,801 4,801 4,801 4,801 60.60% 9.40% 9.30% 3.40% 0.50% 0.489 0.292 0.291 0.182 0.068 4,717 4,717 4,717 4,717 56.30% 8.30% 23.90% 1.60% 0.496 0.275 0.426 0.127 4,801 4,801 37.10% 45.50% 0.483 0.498 Source: Own computation from MDHS 2010 Explanatory variables include child characteristics such as sex (male=1, female=2), age in months, age squared included to account for the possible non-linearity between age and nutritional status, the weight at birth and the status of being a twin (no=0, yes=1) and absolute birth order. We control for area of residence (urban=1, rural=2), sex of the household head (male head=1, female head=2), mothers’ and fathers’ education (0/5). The levels of education for the father and mother are represented by dummies representing six categories, namely no education, incomplete primary, complete primary, incomplete secondary, complete secondary and higher education(post-secondary education). Father’s education has five categories only. There are no fathers with complete secondary education in the data set. 43 Stellenbosch University https://scholar.sun.ac.za The age difference between the father and mother to captures the bargaining position of the mother in the household. According to the bargaining literature on household decisions, bargaining status could influence the resources that the mother may receive for herself as well as for her child, possibly leading to adverse nutrition consequences (Linnemayr, Alderman & Ka, 2008). Finally, we include the asset index to capture the economic status of the household to which the child belongs. In the literature, economic status has been found to be a strong determinant of the nutritional status of the children (e.g., Dancer, Rammohan & Smith, 2008). The table reveals that amongst the three measures, WHZ has the highest mean followed by WAZ. The average age of the children is about 29 months. On average, the children were born with a weight of around 5.64kg which is actually high and in line with our statistic suggesting that being underweight is the least of the malnutrition problems in Malawi. The difference in age between fathers and mothers is about 5.52 years. The average birth order of the children in our study is 3.69. Rural areas constitute a very large proportion of children- about 90% of the total. The incidence of a twin is very low at 3.2% of the total number of children. Only about 8% of the children come from households which are headed by females. The majority of the mothers have incomplete primary education (60.6%) compared to 56.3% for fathers. About 46% and 37% of the children reside in the Southern region Central region, respectively. 2.19 Multivariate analysis of child nutrition Table 2.13 presents results from the OLS regression analysis of HAZ, WAZ and WHZ11. The model statistics show that the R-squared stands at 10%, 9% and 11% for HAZ, WAZ and WHZ, respectively. Though the F-statistics indicate the hypothesis that all slope coefficients are equal to zero, it is important to note that the WHZ model does not perform well. The results do not improve even if we regress by the recommended WHO age categories of <24 months and >=24 months (see Table A3 in the appendix). This finding is in line with an earlier study by Chirwa and Ngalawa (2008) which suggests that children in Malawi seem to be ‘fatter for their age’. There could be other factors such as poor feeding practices or genetic factors that affect their nutritional status or indeed a concern for consistent measurement error. Child characteristics such as age, sex and twin status are statistically significant for HAZ and WAZ but not WHZ. Birth order is significant for the WAZ and WHZ models only. The relationship between the age of the children and nutritional status is negative and statistically significant for HAZ and WAZ only. The square of age is included to account for possible non-linearity that exists between child nutrition and age. The coefficient of age-squared is positive for HAZ and WAZ and statistically significant at 11 Area of residence, the weight of the child at birth, parental age difference and sex of the household head are not significant and, therefore, not reported in the results. 44 Stellenbosch University https://scholar.sun.ac.za 1%. This implies that the relationship between age and nutritional status of a child is U-shaped (convexity) with respect to HAZ and WHZ. When WHZ is used, the relationship is inverted U-shaped (concavity) but insignificant. The convexity or concavity indicates that malnutrition in under-five children worsens or improves with age but this is only up to some critical age beyond which a child’s nutrition status improves or worsens with age. Table 2.13: OLS regression results for child nutritional status Variables Age in months Square of age Female child Child is twin Birth order number Mother's education Incomplete primary Complete primary Incomplete secondary Complete secondary Post-secondary Father's education Incomplete primary Complete primary Incomplete secondary Post-secondary Asset index Square of asset index Rural area Region Central Southern Constant R-squared Prob > F Observations HAZ Coeff SE -0.094*** (0.008) 0.128*** (0.012) 0.280*** (0.057) -0.930*** (0.169) 0.025 (0.016) WAZ Coeff SE -0.045*** (0.007) 0.052*** (0.010) 0.102* (0.048) -0.901*** (0.160) 0.029* (0.014) WHZ Coeff SE 0.002 (0.007) -0.001 (0.010) -0.072 (0.049) -0.286* (0.145) 0.038** (0.015) 0.082 0.110 0.186 0.060 0.439 (0.099) (0.138) (0.143) (0.192) (0.473) 0.154* 0.057 0.302** 0.243 0.236 (0.076) (0.100) (0.114) (0.172) (0.384) 0.124 0.112 0.165 0.369* -0.049 (0.083) (0.118) (0.116) (0.166) (0.428) -0.095 -0.178 -0.006 -0.055 0.183** -0.011 0.127 (0.124) (0.168) (0.139) (0.235) (0.063) (0.016) (0.108) -0.191* -0.208 -0.202* -0.169 0.195*** -0.017 0.052 (0.082) (0.111) (0.101) (0.159) (0.052) (0.011) (0.094) -0.048 0.083 -0.092 -0.007 -0.002 0.008 -0.105 (0.094) (0.130) (0.108) (0.211) (0.055) (0.014) (0.095) 0.009 -0.081 -0.259 0.088 0.000 4,700 (0.075) (0.072) (0.197) -0.102 -0.174* 0.368 0.011 0.010 4,531 (0.075) (0.072) (0.213) -0.031 (0.111) -0.006 (0.110) -1.031*** (0.261) 0.101 0.000 4,574 Notes: *, **, *** denote significance at 10%, 5% and 1% levels The critical ages are 37 months for HAZ, 44 months for WAZ and 54 months for WHZ. The significance and signs of coefficients for both of age and age squared are also confirmed in Chirwa and Ngalawa (2006) who find 30, 34 and 35 months as critical ages for HAZ, WAZ and WHZ. There exists literature which links recovery from stunting to cognitive outcomes. For example, a panel data study by Casale and Desmond (2015) in South Africa finds that children who were stunted at age 2 (in years) but recovered by age 5 performed better than children who remained stunted over the study period. They also find that children who recover from stunting by age 5 perform significantly worse on cognitive tests 45 Stellenbosch University https://scholar.sun.ac.za than children who do not experience early malnutrition. Moreover, they find that recovery from stunting is not uncommon among children- a similar finding in this study. However, the most important thing is the timing of recovery as well as the need to provide the required nutritional inputs at an early stage if the goal is to help improve children’s cognitive performance. Evidence in this research suggests it took longer by between 7 and 19 months for children to ‘recover’ in 2010 compared to what Chirwa and Ngalawa (2006) found using the 1997/8 data12. The coefficient of a variable capturing the status of being one of a twin is negative in all models but only statistically significant for HAZ and WAZ. Mussa (2014) uses HAZ as a measure of long-term nutritional status and finds opposite signs of coefficients for age, age-squared and twin status of a child. With respect to age and age-squared, Mussa (2014) finds concavity in which case the nutritional status of a child improves with age but begins to worsen after some time. This is the same finding in this study for the WHZ model although not statistically significant. We have found that the coefficient for birth order is also positive and statistically significant for WAZ and WHZ. Mussa (2014) finds a significant relationship for rural areas only which seems to confirm the empirical finding that the effect of childorder is cultural and must be interpreted within a given cultural context. With respect to parental agedifference, we find no significant relation for all the three models. On the other hand, Mussa (2014) finds a negative relationship implying that households with older fathers compared to mothers perform worse in terms of child nutritional outcomes. The level of education of the mother matters more than that of the father in our models for child nutritional status. For both the fathers and mothers, no education at all is used as the base category. All levels of mothers’ education except post-secondary education play a significant role in the WAZ model. In the HAZ model it is only incomplete secondary that matters while for WHZ, incomplete primary and complete secondary school levels play a role. The education level of the father is only significant for incomplete primary education and only for the WAZ model. However, this happens at the lowest levels of education and has a negative coefficient implying that lower levels of education are associated with poor nutritional status. Mussa (2014), who also uses “no education” as the base category, finds negative coefficients for all levels of education but only statistically significant for some levels of educations. Chirwa and Ngalawa (2006) find mixed results, a case also found in this study. Lower levels of education have negative coefficients while higher levels have positive coefficients. Chirwa and Ngalawa (2006) find that both the mothers’ and father’s matter. 12 Since we do not have panel data, we cannot conclude this is recovery but simply a finding that is consistent with possible patterns of recovery. 46 Stellenbosch University https://scholar.sun.ac.za Higher asset index scores are associated with better levels of child nutrition levels for two models only, HAZ and WAZ. Mussa (2014) uses five categories of the asset index of the households, namely poorest, poor, middle, richer and richest. Using the poorest as the base category, he finds that household wealth seems to matter more in improving height-for-age z-scores in rural areas than in urban areas. For example, a child born into the wealthiest quintile in rural areas has a height-for-age z-score that is 0.31 standard deviations better than that of a child from the poorest wealth quintile. We also used asset quintiles (poorest as the base category) and found significant results (HAZ and WAZ only) for all quintiles except the middle quintile. Of great similarity is the finding that a child from the richest quintile is better by 0.33 and 0.32 standard deviations for HAZ and WAZ, respectively. The square of the asset index displays convexity for HAZ and WAZ and concavity for WHZ. The turning points are asset index scores of 5.66 (richest quintile), 7.11 (richest quintile) and -0.45(middle quintile) for HAZ, WAZ and WHZ, respectively. 2.20 Asset index and pro-poor growth analysis Over time, DHS surveys have been upgraded to include more information and also adjusted for technological and demographic changes that have taken place. Our comparisons are, therefore, only based on 9 asset variables which are common in all data sets. Table 2.14 shows that the average MCA asset index scores have increased over time from 0.67 in 1992 to 1.04 in 2010, respectively. The spread as measured by standard deviation has declined suggesting a change in the distribution of asset index scores over time. Table 2.14: Descriptive statistics for asset index scores Survey year Observations Average Standard deviation 1992 5,323 0.67 1.14 2000 14,213 0.62 0.99 2004 13,664 0.76 0.86 2010 24,825 1.04 0.92 Total 58,025 0.84 0.97 Source: Own computation from MDHS 1992, 2000, 2004 and 2010 Figure 2.7 shows the poverty incidence curves drawn using the asset index scores for 1992, 2000, 2004 and 2010. Higher curves indicate higher levels of poverty. The curves indicate that at lower levels of the MCA asset index scores, asset poverty is lowest in 2010, followed by 2004, 2000 and finally 1992. For higher levels of the poverty line, the curves cross making it difficult for us to conclude in which year poverty is higher. 47 Stellenbosch University https://scholar.sun.ac.za .6 .4 0 .2 P overtyheadcount .8 1 Figure 2.7: Asset poverty incidence curves by survey year 0 2 4 6 MCA asset index scores 1992 2000 2004 8 10 2010 Source: Own computation from MDHS data We also calculate differences in the incidence of asset poverty since 1992 with the poverty line set at the 50.7th percentile (equal to an asset score of 0.807) as the cut-off point below which households are considered poor. The null difference is that there exists no difference in household asset ownership between any two chosen periods against the alternative hypothesis that there exist differences. Table 2.15 shows that there has been an improvement in living standards in Malawi as shown by the reduction in poverty incidence from 79.5% in 1992 to 50.0% in 2010. Poverty incidence increased by 1 percentage point from 79.5% in 1992 to 80.5% in 2000 but the increase was not statistically significant. During this period, Malawi generally experienced a down turn in economic activity although we do not place a causal link here. This worsening of long term economic conditions was, however, offset by the improvements made over the three survey period from 2000 and 2010. The greatest improvement was registered between 2004 and 2010 during which was a period of economic prosperity for Malawi.in the country driven by strong growth in the agricultural sector13. 13 As noted earlier, the agricultural sector is one of the most important economic sectors in Malawi and it contributes at least 30% to national GDP. 48 Stellenbosch University https://scholar.sun.ac.za Table 2.15: Differences in poverty headcount indices for household asset ownership Description Estimate Std. Err. T P>|t| [95% Conf. interval] Pov. Line 1992 0.795 0.006 143.76 0.000 0.784 0.806 0.807 2000 0.805 0.003 242.46 0.000 0.799 0.812 0.807 Difference 0.010 0.006 1.56 0.118 -0.003 0.023 --- 2000 0.805 0.003 242.46 0.000 0.799 0.812 0.807 2004 0.785 0.004 223.15 0.000 0.778 0.792 0.807 Difference -0.021 0.005 -4.26 0.000 -0.030 -0.011 --- 2004 0.785 0.004 223.15 0.000 0.778 0.792 0.807 2010 0.500 0.007 70.05 0.000 0.486 0.514 0.807 Difference -0.284 0.008 -35.73 0.000 -0.300 -0.269 --- 1992 0.795 0.006 143.76 0.000 0.784 0.806 0.807 2010 0.500 0.007 70.05 0.000 0.486 0.514 0.807 Difference -0.295 0.009 -32.65 0.000 -0.313 -0.277 --- Source: Own computation from MDHS 1992, 2000, 2004 and 2010 Table 2.16 provides the growth rate in the asset index and estimates from the five different pro-poor indices discussed in Section 2.6. The results indicate that there has been both absolute and relative propoor growth in asset ownership in Malawi over the entire period of study from 1992 and 2010. Table 2.16: Indices of pro-poorness in child nutritional status between 1992 and 2010 Pro-poor indices 1992-2000 2000-2004 2004-2010 1992-2010 Growth rate(g) -0.076 0.217 0.445 0.626 Ravallion and Chen (2003) index 0.017 0.654 0.545 1.217 Kakwani and Pernia (2000) index 0.565 0.788 1.098 2.240 PEGR index -0.043 0.171 0.489 1.402 Ravallion and Chen (2003) index – g 0.093 0.437 0.100 0.591 PEGR index - g 0.033 -0.046 0.044 0.776 Source: Own computation from MDHS 1992, 2000, 2004 and 2010 Malawi experienced a negative growth rate of 0.08% between 1992 and 2000. The negative growth suggests that not only did poverty increase but the poor households were also negatively affected in relative terms. Between 2000 and 2004, we find evidence of absolute but not relative pro-poor growth. However, for the period between 2004 and 2010, we are able to conclude absolute but not relative propoor growth because the PEGR index – g is negative. 49 Stellenbosch University https://scholar.sun.ac.za In Table 2.17 we show the mean access of assets by area of residence and region based on the pooled data set. The table helps us to determine the driving factors behind the observed rural-urban and regional differences in the levels of asset poverty in Malawi. Furthermore, this analysis is important because as pointed out in Section 2.7, adjusting the asset index shifts the distribution to the right which may be a limitation for inequality comparisons over time because the mean has changed (see, e.g., Wittenberg & Leibbrandt (2015). Therefore, we use this as robustness checks for our pro-poor growth analysis results. With respect to private assets, large differences in ownership are noted for “Bicycle”, “Electricity”, “Car/truck” and “Motorcycle” between urban and rural areas. Most of the households from the Northern region own “Paraffin lamps”. Ownership of toilet facilities is low in Malawi; even in urban areas, mean access is at a dismal 14%. Traditional pit latrines remain the dominant form of toilet facilities. Decent floor material (tile and cement) and excellent water sources are more accessible in urban areas. 50 Stellenbosch University https://scholar.sun.ac.za Table 2.17: Pooled mean access of assets by area and region, all periods Description Average household size Private assets Radio Bicycle Paraffin lamp Electricity Car/truck Motorcycle/scooter Toilet facility Own flush toilet Shared flush toilet Traditional pit toilet Ventilated improved pit latrine No facility/bush Other Floor material Mud/earth Cement Bricks Wood Tiles Other Water source Piped into residence Public tap Piped into yard/plot Well in residence Public well Protected well/borehole Spring River/stream Pond/lake Dam Rainwater Other Urban 4.64 Rural 4.63 Northern 5.01 Central 4.75 Southern 4.41 Total 4.63 73.40% 32.50% 61.70% 29.50% 6.60% 2.10% 51.00% 42.90% 52.10% 2.30% 0.70% 0.90% 55.20% 34.90% 65.60% 6.70% 1.50% 0.90% 52.50% 42.00% 52.40% 5.00% 1.60% 1.00% 55.30% 43.20% 50.20% 7.30% 1.60% 1.10% 54.30% 41.40% 53.60% 6.40% 1.60% 1.10% 14.00% 0.30% 54.20% 7.00% 3.40% 21.20% 0.70% 0.10% 44.30% 2.80% 17.40% 34.80% 2.40% 0.30% 44.70% 2.60% 14.90% 35.20% 2.50% 0.00% 45.20% 3.60% 17.20% 31.50% 2.90% 0.10% 46.50% 3.60% 14.10% 32.90% 2.70% 0.10% 45.70% 3.40% 15.30% 32.80% 37.50% 8.10% 0.10% 0.30% 52.70% 1.20% 85.40% 0.70% 0.00% 0.00% 11.60% 2.40% 77.10% 3.10% 0.20% 0.00% 17.70% 1.90% 79.80% 1.50% 0.00% 0.00% 14.80% 3.80% 77.50% 1.50% 0.00% 0.10% 19.70% 1.10% 78.20% 1.80% 0.10% 0.10% 17.70% 2.20% 13.40% 26.30% 37.40% 7.50% 1.60% 2.60% 1.90% 4.90% 3.00% 0.20% 0.10% 1.20% 0.60% 2.50% 9.90% 32.60% 3.80% 7.90% 3.70% 18.60% 10.90% 0.10% 1.20% 8.20% 2.70% 10.10% 15.70% 25.20% 5.00% 4.50% 2.70% 15.00% 7.00% 0.30% 1.10% 10.70% 1.90% 4.50% 8.70% 29.80% 4.00% 9.90% 5.00% 19.70% 9.30% 0.10% 1.00% 6.20% 2.90% 5.70% 17.10% 29.50% 2.50% 6.10% 2.60% 14.80% 10.90% 0.10% 1.10% 6.70% 2.50% 6.00% 14.00% 28.90% 3.40% 7.10% 3.40% 16.50% 9.70% 0.10% 1.10% 7.20% Source: Own computation from MDHS 1992, 2000, 2004 and 2010 We also show in Figure 2.8 access to assets by type and survey period. This analysis explains the sources of the movements in the asset index over time. The movements themselves could be due to so many factors including technological changes and changing tastes and preferences. Furthermore, showing access to assets is important because adjusting the asset index shifts the distribution to the right and this 51 Stellenbosch University https://scholar.sun.ac.za may be a limitation for poverty comparisons over time because the mean has changed. We use this as robustness checks for our pro-poor growth analysis results. Between 1992 and 2010, ownership of radio and bicycle has expanded by 14 and 24 percentage points, respectively. On the other hand, ownership of paraffin lamp has dropped by 50 percentage points from 84% in 1992 to 34% in 2010. Access to tile/cement floor and electricity has marginally increased by around 1 percentage point over the period from 1992 and 2010. Ownership of car/truck and motorcycle/scooter has stagnated at 2% and 1%, respectively. Ownership of flush toilet and access to piped water into the dwelling has dropped by 2 percentage points. Figure 2.8: Access to assets by type and survey period Source: Own computation from MDHS 1992, 2000, 2004 and 2010; data labels for 1992 and 2010 2.21 Pro-poor growth in child nutritional status Our analysis is based on HAZ owing to the fact that it is long term as already discussed. The HAZ scores are calculated using the new WHO (2006) child growth standards and, therefore, directly comparable over time. Table 2.18 provides summary statistics on HAZ in terms of the mean and standard deviation. Table 2.18: Descriptive statistics for HAZ Survey year Observations Average Standard deviation 1992 3,288 -2.01 1.57 2000 9,396 -1.93 1.79 2004 8,309 -1.94 1.80 2010 4,653 -1.76 1.66 Total 25,646 -1.91 1.75 Source: Own computation from MDHS 1992, 2000, 2004 and 2010 52 Stellenbosch University https://scholar.sun.ac.za Just as with the asset index, there has been an improvement in the average HAZ scores from -2.01 in 1992 to -1.76 in 2010. The deviation in the HAZ scores has slightly increased. The poverty incidence curves given in Figure 2.9 indicate that incidence of poverty headcount is unambiguously highest in 1992. For the rest of the years, the curves are compact except for specific lower ranges of the z-scores. .6 .4 0 .2 Poverty headcount .8 1 Figure 2.9: Poverty incidence curves for HAZ by DHS survey year 0 20 1992 40 60 Height for age z-scores(HAZ) 2000 2004 80 100 2010 Source: Own computation from MDHS 1992, 2000, 2004 and 2010 Results for the calculated differences in the FGT headcount index over time and for successive survey periods are given in Table 2.19. As before, the poverty line is set at 2.3rd percentile which corresponds to the international cut-off of -2SD proposed in the WHO (2006) standards below which children are considered malnourished. The null hypothesis is that there exists no statistical difference in the HAZ scores between any two chosen periods against an alternative that there exist differences. The results show that there has been an improvement in the nutritional status of children aged below 5 years in Malawi during the period between 1992 and 2010; the incidence of stunting levels amongst under-five aged children in Malawi has decreased from about 54% in 1992 and to about 46% in 2010.We also find significant improvements in the levels of child nutritional status for the successive survey periods apart from the 1992 and 2000 pair where the reduction in the incidence of malnutrition is not statistically different from zero. 53 Stellenbosch University https://scholar.sun.ac.za Using the asset index as a measure of welfare, results indicated that poverty increased between 1992 and 2000 but not statistically significant. Therefore, it is concluded that the results are similar and that both asset poverty and child nutritional status remained the same over the period. Table 2.19: Differences in the FGT poverty headcount index for HAZ Description Estimate Std. Err. 1992 0.539 0.011 2000 0.519 0.006 Difference -0.020 0.013 2000 0.519 0.006 2004 0.496 0.007 Difference -0.023 0.009 2004 0.496 0.007 2010 0.457 0.000 Difference -0.039 0.007 1992 0.539 0.011 2010 0.457 0.000 Difference -0.082 0.011 T 49.174 82.173 -1.601 82.173 70.046 -2.430 70.046 . -5.484 49.174 . -7.491 P>|t| 0.000 0.000 0.111 0.000 0.000 0.016 0.000 . 0.000 0.000 . 0.000 [95% Conf. interval] Pov. line 0.517 0.561 2.300 0.506 0.531 2.300 -0.045 0.005 --0.506 0.531 2.300 0.482 0.510 2.300 -0.042 -0.004 --0.482 0.510 2.300 0.457 0.457 2.300 -0.053 -0.025 --0.517 0.561 2.300 0.457 0.457 2.300 -0.104 -0.061 --- Source: Own computation from MDHS 1992, 2000, 2004 and 2010 We also find evidence of both absolute and relative pro-poor growth in child nutritional status for 20002004, 2004-2010 and 1992-2010 as shown in Table 2.20. Not only did absolute poverty decline but the poor also benefited more. For 1992 and 2000, the Ravallion and Chen (2003) index gives a negative estimate of 0.203 thereby making us unable to conclude absolute pro-poorness despite the fact the Kakwani and Pernia (2000) index and PEGR index indicate otherwise. We are also unable to conclude relative pro-poorness using all the three relative measures, namely the Ravallion and Chen (2003) index – g, PEGR index- g and the Kakwani and Pernia (2000)'s index. Table 2.20: Indices of pro-poorness for HAZ Pro-poor indices 1992-2000 2000-2004 2004-2010 1992-2010 Growth rate(g) 0.223 0.028 0.044 0.313 Ravallion and Chen (2003) -0.203 0.360 0.946 1.042 Kakwani and Pernia (2000) 0.624 8.547 7.702 2.965 PEGR index 0.139 0.241 7.702 0.928 Ravallion and Chen (2003) – g -0.426 0.332 0.902 0.729 PEGR - g -0.084 0.213 0.295 0.615 Source: Own computation from MDHS 1992, 2000, 2004 and 2010 54 Stellenbosch University https://scholar.sun.ac.za 2.22 Conclusions In this study, we have measured poverty and inequality in Malawi both spatially and temporally. Over the period between 1992 and 2010, we find evidence of pro-poor growth in both absolute and relative terms. We have also identified factors associated with asset poverty in Malawi. Results from the study indicate that while household size is positively correlated with asset accumulation, age dependency ratio has a negative association. There is no statistically significance difference in asset ownership between male and female headed households. Households with older household heads do better in terms of asset ownership than those headed by younger people. Shock to sickness in the household has a negative association with asset ownership. Education attainment by the members of the household has been found to have a positive association with the level of assets in the household. Rural areas have higher levels of asset poverty compared to urban areas. Significant regional differences between the northern and central regions have also been found in terms of asset poverty with the latter lagging behind. We have also identified factors that are associated with the nutritional status of under-five children in Malawi. Using three anthropometric measures of nutrition, namely HAZ for stunting, WAZ for underweight and WHZ for wasting, it is shown that the incidence of stunting is the highest of the nutritional problems amongst under 5 children in Malawi followed by underweight. On the one hand, there no large differences between regions and areas in terms of child nutritional status. On the other hand, when assets are used, the welfare gap between regions and areas is bigger. 2.23 Policy discussion Non-monetary pro-poor growth analysis is important for policy because it allows us to identify the progress made in poverty reduction over time as well as analyse the redistribution effects that arise with growth in living standards. Specifically, as shown in the paper, the poor people in Malawi have benefited the most from the changes in assets and child-nutritional status that have occurred over time. This means that policy makers should not only focus on income growth when analysing progress towards goals set in the national development agenda such as those in the Malawi Growth and Development Strategy (MGDS) and the Sustainable Development Goals (SDGs). Subgroup poverty comparisons are important for targeting development policies towards specific groups that have been identified as the poorest. For example, our study has shown that poverty is higher in rural areas than urban areas. As a response to these findings, deliberate policy measures might be put in place to specifically target the identified poorest groups of Malawi. Similarly, policies can be developed to address the problem of inequality as informed by the results. A number of policy conclusions can be also be drawn from the regression analysis. Firstly, the negative association between asset ownership and age dependency ratio suggests that increasing the income 55 Stellenbosch University https://scholar.sun.ac.za generating opportunities for the economically active population could help in reducing asset poverty in Malawi. Younger household members could, for example, be equipped with the necessary skills as they wait to enter the working age group. Secondly, the significance of education attainment and the fact that the sizes of coefficients increase with the level of education imply that post-primary education should be emphasised in Malawi. Currently, the focus of education policy is on basic education (completion of primary education). Thirdly, since shocks to illnesses are negatively associated with asset ownership, it may be important to consider looking into policies that improve access to health and assist households to withstand shocks thereby preventing them from being pushed into poverty traps. The multivariate analysis of child malnutrition reveals that characteristics such as age, sex and twin status matter in the outcomes of nutritional status at the child level. Child birth order, mother education and economic status have a positive association with child nutritional status. These findings have policy implications too. Firstly, based on the literature linking recovery from stunting to cognitive outcomes, the study suggests that timing of nutritional inputs is critical in the cognitive development of children. Secondly, since male children tend to have weaker nutritional status compared to their female counterparts, nutritional feeding programmes might consider addressing this gender imbalance. Thirdly, the importance of female education for the nutritional status of children suggests that training women in child nutrition may require emphasis. Fourthly, related to the second implication, the significance of higher levels of education suggests that post-primary education should be emphasised in general. Currently, the focus of education policy is to achieve basic or primary education. Finally, since higher levels of economic status are associated with better nutritional status, policy should continue to be geared towards improving households’ living conditions. 56 Stellenbosch University https://scholar.sun.ac.za Chapter 3 Externalities and returns to education in Malawi: Panel data evidence 3.1 Introduction Malawi remains a poor country despite registering gains in poverty reduction over the past two decades. Estimates based on Malawi’s first, second and third integrated surveys (IHS1, IHS2 and IHS3) show that the incidence of poverty based on household percapita consumption has marginally fallen from 52.4% in 2005 to 50.7% in 2011. In 1998, the poverty rate was 65.3%. Rural areas, which make up about 85% of the population, have significantly higher poverty rates than urban areas, although the gap between the two is closing over time. The Gini coefficient shows that inequality increased from 0.401 in 1998 to 0.390 in 2005 to 0.452 in 2011 (National Statistical Office, 2012). On the one hand, the labour participation rate for Malawi, defined as the share of the population aged 15 and above working or seeking employment, is very high and stands at 88%. On the other hand, education levels are low. About 74% of the population aged 15 years and above do not have any qualification at all and 21% reported never having attended education. Therefore, literacy remains a challenge in Malawi given the high rate of labour force participation. As at 2011, the literacy rate (defined as the ability to read and write with understanding in any language) amongst people aged 15 years and above stood at 65%, which is an insignificant improvement from 64% reported in 2005 (National Statistical Office, 2012). In the Malawi Growth and Development Strategy (MGDS), education is identified as one of the themes necessary for growth and socio-economic development. Malawi’s formal education system consists of primary, secondary and tertiary or post-secondary education. The country’s education policy has been largely focussed towards increasing access to primary education. In recognition of this, primary school education was universally made free in 1994 for all government schools. According to National Statistical Office (2012), as a response to these policy changes, the net primary enrolment rate has increased to 86% in 2011 from 80% in 2004 while the primary dropout rate has dropped from 5% to 1% over the same period. In addition to universal free primary education, tertiary education has been subsidised in order to make it more affordable, although the country is yet to realise the benefits in terms of expansion in tertiary education. The Malawian labour market can be categorised into the formal and informal sectors. As is the case in many other developing countries, the formal sector in Malawi only consists of a small percentage of the labour force. Consequently, most people are involved in either self-employment activities or paid employment in the informal sector (Chirwa & Matita, 2009). The informal sector is, therefore, important to Malawi’s economy; and it accounts for 78% of total employment (National Statistical Office, 2014). 57 Stellenbosch University https://scholar.sun.ac.za Education has been identified as a tool for poverty reduction in Malawi (National Statistical Office, 2012). Through education attainment, the poor are said to be empowered and equipped for better opportunities in national development. It is against this background that the study seeks to look at the role of education in poverty reduction in Malawi. The link between poverty and education is identified through the labour market. A number of studies that link education to poverty reduction have been conducted. Da Maia (2012) looks at the link between education and poverty reduction in Mozambique. The study estimates the probability of an individual getting employment in any of the given sectors conditioned on education and models the relationship between education and earnings. Several researchers have looked at the role played by education in earnings (e.g., Mincer, 1974; Psacharopoulos, 1994 & 2002; and in Malawi, Chirwa & Matita, 2009; Chirwa & Zgovu, 2002). Clearly, the relationship between education and earnings is not only of great interest to many scholars but also important for a policy perspective. However, previous studies on Malawian labour markets (see Chirwa & Matita, 2009; Matita & Chirwa, 2009; Chirwa & Zgovu, 2002) have explored the link between education and earnings using crosssectional data sets due to lack of panel data. This study expands on the available literature by taking advantage of the newly released panel data set. Use of panel data has many advantages and this is well acknowledged in the literature. Firstly, we can control for unobservable individual heterogeneity. As shown in the literature, failure to control for individual specific effects leads to bias in results. Secondly, panel data contains rich information about cross-sectional variations and dynamic behaviour of the subjects of interest. Thirdly, with panel data, we are able to identify time effects which cannot be estimated with cross-sectional data. Baltagi (2013) provides a detailed discussion on the advantages and limitations of panel data. In addition to using panel data, the study distinguishes between the formal and informal sectors of the economy to see if the role played by education in earnings differs by sector. This decomposition is important considering the large size of the informal sector. Furthermore, the study corrects for sample selection bias which arises as a result of selection into economic sectors as well as working with only selected individuals who end up entering employment instead of the full working age-population. With regard to distinguishing between sectors, we improve on previous studies (e.g. Chirwa & Matita, 2009) by conducting coefficient comparison tests between groups by gender and sector. The test is routinely implementable in Stata 13.1 via the suest command (see Appendix A1). The need for conducting coefficient comparison and balancing tests has been motivated in Pischke and Schwandt (2014). The study also explores whether earnings from non-farm household enterprises are affected by the educational attainment of other members of the household. Basu and Foster (1998) argue that the 58 Stellenbosch University https://scholar.sun.ac.za presence of a literate person in a given household may generate positive intra-household education externalities. Unlike the isolated illiterate persons, illiterate individuals living in a household where at least one member is literate enjoy some of the literate person’s capabilities. Therefore, education is said to have some positive spill-over effects in that the literate members of the household may help in the running of enterprises in the same manner agricultural extension workers or medical personnel help in their communities. We explore whether there are positive education externalities in household enterprises by the use of maximum number of years of schooling in a household. This specification is also motivated by Jolliffe (2002) who finds that in household level income functions such as non-farm enterprises, the maximum or average level of education in the household is a better explanatory variable of household income. Furthermore, it is argued that in most developing countries income is largely earned at the household rather than the level of the individual, hence our focus on household enterprises. Moreover, according to data from IHS3, the enterprise sector is large in Malawi and affects a significant percentage of households in Malawi. Specifically, about 20% of households are involved in non-farm employment activities in Malawi (National Statistical Office, 2012). The chapter addresses three main specific issues (i) to investigate the factors influencing labour force participation and employment likelihood in Malawi; (ii) to estimate returns and externalities to education in Malawi; and (iii) to identify and deal with inconsistencies in the IHPS data. This includes conducting comparisons of earnings and consumption data in addition to robust treatment of outliers. 3.2 Methodology 3.2.1 Theoretical framework There are two main competing groups of theories for explaining labour market outcomes. First is the traditional neoclassical model of labour supply which argues for a simple competitive labour market whereby workers are paid their marginal product. In this competitive model of the labour market, individuals’ wages are largely determined by their productivity (Kerr & Teal, 2015). According to this model, wage differences across sectors or occupations are competitively eliminated by workers moving away from underpaid sectors to highly paid sectors. One of the early works explaining occupational choice is by Roy (1951). Using a two-sector model, Roy (1951) argued that individuals choose sectors in which they have a comparative advantage such that wage differences between sectors are simply a reflection of the differences in unobserved ability between employees in the two sectors. However, investments in education and training were not modelled in Roy’s model. The simplest form of the competitive model does not distinguish between degrees of job informality or differences in individual investments that would enhance productivity. However, by allowing for investment in human capital, individuals are assumed to make investments in education and other productivity enhancing abilities in such a way as to maximise their expected lifetime utility. Despite 59 Stellenbosch University https://scholar.sun.ac.za contributions by other researchers, much of the human capital theory is based on the work of Becker (1962) and Mincer (1974) after which the human capital theory became the standard for empirical work in labour economics (Card, 1999). In this strand of literature, earnings are assumed to be dependent on education or training and experience in the labour market The alternative is the segmented labour market hypothesis which observes labour market outcomes and wage differentials that are inconsistent with the competitive model. Specifically, the latter argues that people’s choices of where to work and the wage received are determined by institutional factors such as government legislation (e.g. minimum wage legislation) and other institutional factors (see (Leontaridi, 1998; Dunlop, 1957; Kerr, 1954). It is, for example, argued that labour market institutional constraints prevent the bargaining down of wages in high wage sectors by workers in low-wage sectors. Furthermore, it was observed that workers with similar jobs were paid different wages as a result of such institutional constraints. A number of explanations have been given for the existence of institutional constraints and wage setting mechanisms between sectors. One of these explanations in the developing country context is the Harris-Todaro model of migration (Harris & Todaro, 1970). According to this model, while the urban sector is insulated from the forces of supply and demand as a result of minimum wage legislation, the rural sector was assumed competitive. The existence of a minimum wage in the urban sector meant that rural workers or the unemployed were unable to bid the urban wage down to an equilibrating level. Under these conditions, this ensured that workers who find employment in the urban sector would therefore be paid more than if they were employed in the rural agricultural sector. Despite these criticisms, the human capital theory remains the leading theoretical model for explaining labour market outcomes and is used in this study. In addition to these two competing theoretical frameworks, recently, there have also been efforts towards improved understanding and definitions of informal markets as well as the linkages between the informal and formal economy. This has given rise to different theories or explanations as to what constitutes or gives rise to informal labour markets (Chen, 2012). This debate is relevant in the Malawi context because of the huge size of the informal employment, which makes up of 78% of total employment, as earlier stated. The recognition of the existence of an informal market in the analysis of labour markets in developing countries largely came as a result of the 1972 International Labour Organization (ILO) study in Kenya (Singer & Jolly, 1972) where it was found that the traditional and informal sectors of the economy included profitable and efficient enterprises as well as marginal activities. In addition to this, work by British anthropologist Hart (1973) found that there was no wage employment among unskilled migrants who had migrated from Northern Ghana into the capital city. Instead, these were involved in low-income activities in the informal sector not related to the formal sector. Fields (1975) made another contribution to the empirical and theoretical research on the informal sector by extending the Harris-Todaro 60 Stellenbosch University https://scholar.sun.ac.za framework to include an urban informal sector. In this framework, three sectors are recognised, namely the urban formal sector, the urban informal sector and the traditional informal sector. The urban informal sector was made up of casual employment and was viewed as a temporary sector in which people freely enter as they look for higher paid jobs in the formal sector. However, the presence of union activity and minimum wage legislation prevented people from entering the formal sector and therefore kept workers in the informal sector (Kerr & Teal, 2005). The definition of the informal sector has changed over the years. Previously, following the ILO Kenya report, the characteristics of informal activities included ease of entry, family ownership of enterprises and unregulated and competitive markets (Singer & Jolly, 1972). ILO now uses the expanded statistical criteria for distinguishing between formal and informal employment which was endorsed by the 2000 International Labour Conference and the 2003 International Conference on Labour Statisticians (Hussmanns, 2004). The formal sector primarily includes salaried employment in the private and government sectors as well as non-governmental organisations (NGOs). In this sector, the relationship between the employer and employees is governed by formal labour laws including employee benefits and income taxation. In contrast, the informal sector has two components, namely self-employment activities and informal wage employment. Self-employment mainly consists of employers in informal enterprises, own account workers in informal enterprises, contributing family workers (in informal and formal enterprises) and members of informal producers’ cooperatives. Informal self-employment includes enterprises that are not registered under any national legislative authority and not engaged in agricultural activities. Informal wage employment includes employees of informal enterprises, casual or day labourers, temporary or part-time workers, paid domestic workers, contract workers, unregistered or undeclared workers and industrial outworkers (also called homeworkers). Of great importance to the Malawian economy is casual employment (locally known as ganyu14). Despite being largely seasonal, it is very important in both urban and rural areas. 14 Ganyu is the dominant form of employment in the informal sector. 61 Stellenbosch University https://scholar.sun.ac.za 3.2.2 Estimating returns to education Mincerian earning functions are the standard approach for estimating returns to education in labour markets. The methodology started with the work of Mincer (1974) and has been widely used in the literature (Psacharopoulos, 1994). It is based on the human capital theory which argues that investment in education improves workers’ skills resulting in high productivity and, therefore, higher earnings (Mincer, 1974). The basic model is summarised below:- ln(Yit )   1   2 S it   3 E it   4 E it2   it (3.1) where, for any individual i  , at time t  , Y is the earnings of individual, S is the number of years of 2 schooling, E is the experience, E is experience squared and  is the error term. The coefficient  2 is interpreted as the private rate of return to education (RORE) and  2 * 100  gives the percentage return to one additional year of schooling. The above specification can be extended to include other control variables such as gender, location, etc. As is the practice in the literature, we improve on the classic model presented in equation (3.1) in two main ways. Firstly, we account for the fact that returns to education may be heterogeneous rather than homogeneous and secondly, we address the problem of selection bias which arises as a result of using non-random data for analysis. Due to its importance, we will deal with sample selection in the section that follows. As stated, the basic model disregards the differences in the level of educational achieved by looking at a single overall education level or years of schooling. This approach is called the one factor or homogeneous model since it assumes that there are no differential trends in the returns to education for different levels of education. There is little statistical evidence and causal empiricism for the homogenous model. The alternative approach, called the multiple factor approach or heterogeneous model, looks at the different levels of education as having separate effects on earnings. This involves replacing S with an educational dummy variable to represent the different educational categories. Seven educational qualifications are captured in the data, namely “None”, “Primary School Leaving Certificate of Education (PSLCE)”, “Junior Certificate of Education (JCE)”, “Malawi School Certificate of Education (MSCE)”, “Non-University Diploma”, “University Diploma” and “Post-Graduate degree”. In some cases of our analysis, we combine the last three categories into tertiary education 15. 15 In Malawi, PSLCE, JCE and MSCE are equivalent to 8, 10 and 12 years of schooling. For tertiary education, diplomas are usually completed within two or three years while undergraduate degrees take four years except for 62 Stellenbosch University https://scholar.sun.ac.za Our data set does not have information on the number of years of experience. Following the standard practice used in the literature (e.g., Matita & Chirwa, 2009; Kahyarara & Teal, 2008; Appleton, Bigsten, & Manda, 1999), we estimate the number of potential years of experience as age less years of schooling less preschool age. This assumes that once people complete their education, they immediately enter the labour market. Those without education are assumed to enter the labour market at the lowest labour market entry age of 15 years. 3.2.3 Sample selection Sample selection16 bias arises from a number of sources. Firstly, it could arise as a result of self-selection into different employment categories or sectors. Secondly, it could be due to non-random attrition in panel surveys when subjects drop out of the sample for some reason. However, in our data, attrition is not a major problem, as we will discuss in Section 3.3 on data. Consequently, we do not deal with sample selection caused by attrition. Thirdly, it may be as a result of working with a truncated sample. This raises the problem of sample selection because ideally we are interested in studying all individuals in the working age population (15-64) but end up with only those that actually entered employment. Fourth and finally, since earnings are only observed for individuals who are working at the time of the survey, our analysis is made on the sample selected on this basis. Failure to correct for sample selection would bias our results. The Heckman (1979) two-step procedure, following Wooldridge (2002), has been used to correct for selection bias. In the first step, the probability of an individual selecting into an economic sector or attriting through a probit model is estimated as: cit  xit  vit (3.2) where: i represents an individual, c denotes the binary response variable equal to either 1 or 0 and x is a set of regressors such as education level, age, sex, marital status, etc., including dummies for location. The error term is given by v . From equation (3.2), we obtain the inverse of the Mills ratio and use it as an explanatory variable in the estimation of either the wage equation (3.1) or the likelihood of employment conditional on an engineering, law and medicine which are completed in five years. A post-graduate degree usually takes two years to complete. 16 Some may refer to this as self-selection. Whether we define it as sample selection or self- selection is trivial in our study as long as we achieve our goal of addressing the non-random nature of the sample when estimating our employment and earnings functions. 63 Stellenbosch University https://scholar.sun.ac.za individual’s participation in the labour market (see Section 3.6.2). If the coefficient of the inverse Mills ratio is statistically significant, then we are justified in correcting for selection bias. We then proceed to correct for and report heteroskedasticity consistent standard errors in the first stage. 3.2.4 Modelling unobserved heterogeneity Let us consider the following model when time T  1,2 : yit  xit  uit (3.3) Where y it is the log of earnings for individual i at time t . Suppose that the error term is made up of two components as follows: u it  ni  vit (3.4) where ni is time invariant and correlated with xit , and vit is time varying and uncorrelated with xit . If the exogeneity assumption is violated, i.e. when E ( xit ni )  0 , the OLS estimator will be biased in cross-section. On the other hand, panel estimators can be used to control for unobserved individual timeinvariant heterogeneity and this allows us to obtain unbiased estimates of  . Even when the unobserved correlated effect is not time invariant, using panel data techniques reduces the magnitude of the bias. The most commonly estimated models with panel data are the fixed effects and random effects models and several considerations will affect the choice between the two. The first consideration is the nature of omitted variables. If we think that there are no omitted variables from the model or that the omitted variables are uncorrelated with the explanatory variables in the model, then a random effects model is the best (Williams, 2015). A random effects model under these assumptions will produce unbiased estimates of the coefficients, use all the data available, and yield the smallest standard errors. However, it is more likely that omitted variables will produce at least some bias in the estimates. If there are omitted variables and these variables are correlated with the variables in the model, then a fixed effects model provides a means for controlling for omitted variable bias. In a fixed-effects model, subjects serve as their own controls. For this to work, the omitted variables must have time-invariant values with timeinvariant effects. For example, gender does not change over time and its effect on the outcome in wave 1 is the same as the effect of gender in wave 2. Secondly, researchers consider the amount of variability within subjects: If subjects do not change much, or not at all, across time, then fixed effects models may not work very well or even at all. There is need to have within-subject variability in the variables to justify the use of fixed effects. When there is little 64 Stellenbosch University https://scholar.sun.ac.za variability within subjects, the standard errors from a fixed effects model may be too large to tolerate. Conversely, random effects models will often have smaller standard errors (Clark & Linzer, 2015). Third are the effects are we interested in studying. In fixed effects models, we are not interested in estimating the effects of variables that do not change or change very little over time. Rather, we control for them or “partial them out”. On the other hand, with random effects models, we are able to estimate the effects of time-invariant variables such as gender, although the method is no longer controlling for omitted variables (Williams, 2015). Given the above considerations, we choose to use the random effects model for three main reasons. First, education, whose effect we are interested in measuring, is generally a slow changing variable especially for a three year period over which we have data. Secondly, given that our panel is short (only two periods) there is not much within-subject variability in most of our variables. Third, a random effects model allows us to estimate the effects of time-invariant variables such as gender which is an important aspect in Malawi. With fixed effects, this is not possible since the variable gets dropped off after demeaning. Therefore, using a random effects model naturally comes at a cost and the trade-off is that their coefficients are more likely to be biased than the fixed effects estimates. Nevertheless, according to Wooldridge (2002), panel data techniques reduce the magnitude of bias compared to OLS. 3.3 Description of the data This section provides a brief description of the Malawi Integrated Household Panel Survey (IHPS) data, a two-wave panel conducted in 2010 and 2013. The survey was implemented by the National Statistical Office (NSO) of Malawi. The 2010 wave was part of the third nationally representative Integrated Household Survey (IHS3)17 conducted between March 2010 and March 2011 during which 3247 households were selected as a panel subcomponent to be resurveyed in 2013. The second wave, carried out between April and December 2013, saw the panel sample increase to 4,000 households because split-off members who formed new households were also included. 17 IHS3 consists of four questionnaire types, namely the household, agricultural, fisheries and community questionnaires. In the household questionnaire, individuals are asked if they were involved in agricultural and fishing activities over the past 12 months. Those who answered “Yes” are then further administered the agriculture and fisheries questionnaire where more details on crop, livestock and fishing practices are collected. Our analysis is based on the household questionnaire only where the scope of our study lies. 65 Stellenbosch University https://scholar.sun.ac.za The household formed the primary unit of analysis in the IHPS surveys. An attempt was, therefore, made to track all baseline households as well as members that moved away from the baseline dwellings between 2010 and 2013. Servants and guests at the time of the IHS3 were excluded and only individuals who were expected to be at least 12 years of age and known to be residents in mainland Malawi 18 were tracked. Of the 3,247 households initially chosen in 2010, some could not be located while others split into new households. Our analysis is based on the 3,104 households that are available in both waves. The rate of attrition at the household level was only 3.78 %. The 2010 baseline had 15,597 individuals of all ages, of which 14,232 are available in both waves, representing an overall attrition rate of 7.42 % at the individual level. Given these low rates of attrition, which also seem random, we pursue this issue no further because we believe the representativeness of the sample has not been affected19. For purposes of this study, our sample is restricted to economically active available in both waves. All calculations include survey weights. There are two sources of earnings as captured in the household questionnaire, namely wage employment and self-employment activities. Earnings from self-employment activities are provided as profit from non-farm enterprises over a period of 30 days while earnings from wage employment are given with an indication of the period over which earnings are earned, i.e. day, week, two weeks or month. Ganyu wages are given as daily earnings with an indication of the number of days worked in a week. All earnings are converted into real monthly figures in 2013 constant prices. 3.3.1 Work and non-work activities of the employed The surveys collected information on both work and non-work activities of individuals over the seven days prior to the administration of the questionnaire. Non-work activities or domestic tasks are given in (a) and (b) while the rest constitute work activities. a. How many hours did you spend yesterday collecting water? b. How many hours did you spend yesterday collecting firewood (or other fuel materials)? c. How many hours in the last seven days did you spend on household agricultural activities (including livestock and fishing-related activities) whether for sale or for household food? d. How many hours in the last seven days did you run or do any kind of non-agricultural or nonfishing household business, big or small, for yourself? 18 Excluding Likoma district, which is an Island on Lake Malawi. 19 In the literature, the most common ways of addressing attrition are Inverse Probability Weighting (IPW) and Heckman selection correction (e.g., Wooldridge, 2002). 66 Stellenbosch University https://scholar.sun.ac.za e. How many hours in the last seven days did you help in any of the household's non-agricultural or non-fishing household businesses, if any? f. How many hours in the last seven days did you engage in casual, part-time or ganyu labour? g. How many hours in the last seven days did you do any work for a wage, salary, commission, or any payment in kind, excluding ganyu? h. How many hours in the last seven days did you engage in an unpaid apprenticeship? All individuals and households involved in household agricultural activities identified from (c) are administered the household questionnaire where more information related to agriculture is collected. Questions (d) and (e) are further explored in the enterprise module where information on enterprise owners, customers and profits is collected. Questions (f) and (g) are further explored in the module for time use and labour where information on wages and occupations is collected. 3.3.2 Describing employment structure and hours worked There are about nine mutually exclusive employment types that can be identified in the data set as shown in Table 3.1. Public works programme (PWP) and church/religious organisations can be jointly referred to as non-governmental organisations (NGO), considering the nature of their activities. Table 3.1: Employment and occupation structures 2010 Occupations 2013 Frequency Percent Cumulative Frequency Percent Cumulative Private Company 270 8.9% 8.9% 303 8.0% 8.0% Private Individual 261 8.6% 17.4% 225 6.0% 14.0% Government 170 5.6% 23.0% 173 4.6% 18.6% Parastatal 14 0.5% 23.5% 15 0.4% 19.0% PWP 12 0.4% 23.9% 16 0.4% 19.4% NGO 45 1.5% 25.3% 63 1.7% 21.1% Other 6 0.2% 25.5% 18 0.5% 21.6% 1,620 53.2% 78.7% 2,056 54.5% 76.0% 650 21.3% 100.0% 904 24.0% 100.0% 3,048 100.0% 3,773 100.0% Casual/Ganyu Enterprises Total Source: Own computation from IPHS data On the one hand, the structure of employment observed in the table reflects a major similarity that Malawi shares with other less developed African countries, i.e. that casual employment and selfemployment activities (household enterprises) make up the largest share of total employment both in 67 Stellenbosch University https://scholar.sun.ac.za and outside of agriculture. This observation holds in both 2010 and 2013. Moreover, attrition levels are very low such that the structure observed in the panel is largely a reflection of the national figures. On the other hand, the observed structure is a contrast to what is seen in countries like South Africa where there is a very small informal sector (e.g., Kingdon & Knight, 2004; Kerr & Teal, 2015). The dominance of the informal sector is reflected in the number of observations and accompanying percentages in each of the employment types. Regular formal employment (private and government) have registered slight declines and possibly these partly account for the growth in the informal employment numbers. Total informal employment has increased as a result of growth in both casual employment and self-employment activities. The IHPS report states that the percentage of households in Malawi that operate non-agricultural enterprises has grown by 9 percentage points from 21% in 2010 to 30% in 2013. The survey collected information on the number of hours worked20 during the 7 days prior to the survey. We provide a breakdown of the average number of hours by occupation in Table 3.2. The table reveals that, on average, individuals in the panel worked for about 25 hours in a week in 2013. This is an insignificant increase from the 24 hours registered in 2010. The maximum working hours regulated by the government is 48 per week in all jobs. Most of the occupations registered increases in the number of hours worked. PWP and ganyu employment registered significant declines. The former is a safety net scheme targeting the poor who mainly utilise it when there is a need. The latter employees (ganyu) usually work for food and basic needs and their labour supply is largely dependent on meeting daily needs. Once their daily needs are met, there is little motivation to work longer. Overall, the private sector (companies and individuals) worked the longest number of hours in both 2010 and 2013. The government and parastatal worked for a similar number of hours. Table 3.2 also shows the average years of schooling by occupation. On average, number of years of schooling is about 7.5 and 7.4 years for 2010 and 2013, respectively. This is just below the official 8 years of primary education. What we find here is reflective of what the focus of government education policy has been in Malawi over the past two decades- the provision of free primary education. Government employees have the highest number of years of schooling across the two years while ganyu and public programme workers have the least. The levels of education partly explain the large earning differentials observed earlier, consistent with the human capital theory which states that education is positively correlated with earnings. 20 Includes agricultural work but excludes unpaid domestic activities: water and firewood collection. 68 Stellenbosch University https://scholar.sun.ac.za Table 3.2: Changes in the average weekly hours worked and years of education by year 2010 Average weekly Years of hours worked education Private Company 40.20 8.80 (25.50) (4.50) Private Individual 32.80 7.50 (24.60) (3.10) Government 32.90 13.40 (23.10) (5.10) Parastatal 36.30 11.20 (21.0) (4.80) Public Works Programme 26.20 7.70 (27.70) (3.50) Church/Religious Organisation 30.90 10.50 (23.80) (5.30) Other 47.30 10.90 (24.40) (3.60) Casual/Ganyu 16.40 6.50 (17.60) (2.60) Non-agricultural enterprises 29.61 7.43 (21.40) (3.40) Total 24.10 7.50 (22.10) (3.70) Occupations 2013 Average weekly Years of hours worked education 44.10 9.00 (23.60) (04.50) 41.30 7.70 (28.0) (3.30) 34.10 14.50 (23.20) (5.60) 35.20 10.60 (29.30) (5.20) 13.60 5.80 (15.10) (3.30) 40.30 10.80 (27.0) (5.50) 30.00 9.40 (17.80) (4.20) 15.10 6.40 (18.10) (2.70) 33.28 7.42 (22.10) (3.40) 24.80 7.40 (23.50) (3.80) Source: Own computations from IHPS data; standard deviations in parenthesis. 3.3.3 Treatment of outliers, missing data and zero earnings Missing earnings, for example, because individuals refused to answer or the respondent did not know, are not imputed. Negative and zero earnings were dropped since their natural logs are undefined21. As is the practice in the literature, we only look at economically active individuals aged between 15 and 64 years. We also restrict our sample to observations with non-missing values in any of the variables reported in our tables. The resulting samples are 5,377 individuals and 1,702 households with positive real earnings. Next, we explain how outliers have been treated, using wage earners as an example. Observations that are substantially different from the rest can make a difference to the regression results obtained. It is, therefore, important to not only investigate these unusual observations but also find ways of dealing with them. Wittenberg (2014) and Burger and Yu (2007) provide a good discussion on dealing with outliers. We discuss four approaches in the ensuing paragraphs. 21 Researchers sometimes consider an alternative of adding a very small number to earnings before taking the log, which would allow considering zero incomes. 69 Stellenbosch University https://scholar.sun.ac.za The first approach is to take out millionaires which are clear outliers. In our data set, there were 24 millionaires with average monthly earnings of MWK2,269 702 compared to MWK 38,471 for the rest of the 5353 individuals. However, the choice of millionaires is arbitrary and has a potential of removing genuine earners, especially considering that the minimum education level of these millionaires is MSCE- above the typical education level of those in the labour force. The second approach is to remove outliers by identifying observations with extreme regression residuals. In linear regression, an outlier is an observation with a large residual. This is achieved by estimating a simple Mincerian type wage regression of the log of real monthly wages on education, age, age squared, gender and occupation. After running this regression, studentised or standardised residuals were created. In this approach, studentised residuals with absolute values greater than five are flagged as extreme and corresponding observations dropped. Using this approach, only five observations were flagged as extreme. We reduced the cut-off to 4 resulting in 14 individuals being flagged as having earnings that were too high or low for their characteristics. The third approach is robust regression. When data is contaminated with outliers, using studentised residuals has been found to be insufficient in identifying the ‘bad’ observations. Robust regression is an alternative to least squares regression when this is the case, i.e. when data is contaminated with outliers or influential observations (Wittenberg, 2014). This approach is easily handled in Stata 13.1 and observations are given weights depending on whether they are outliers or not. Outliers are assigned zero weights and consequently identified as not belonging in the regression. In total, robust regression identified 35 observations as being extreme and this included all the outliers also identified through the studentised residuals. The final approach is to remove observations in the 100th percentile. We generated a new variable containing percentiles of real monthly earnings. This was used to identify and then drop 56 observations in the 100th percentile with average real monthly earnings of MWK1,394,771 compared to MWK355,924 in the 99th percentile. However, this results in a loss of 56 observations, which we deemed excessive. One can equally consider dropping observations in the bottom percentile but we are more concerned with outliers in the higher percentiles. The median of earnings is low (about MWK8,400) while the mean is high (about MWK26,253), suggesting the data distribution is positively skewed by the presence of large outliers. Considering sample size issues, we used the second approach with results from the other three approaches presented as robustness checks (see Section 3.8.1.3) . 70 Stellenbosch University https://scholar.sun.ac.za 3.3.4 Dealing with inconsistencies We noted some inconsistencies in terms of the period over which salaries were paid. About 6% of the total sample in the formal sector reported a payment period different from that reported in 2010. Similarly, about 24% of those in ganyu employment reported a different number of days spent on ganyu per week in 2013 than they previously reported in 2010. One may reason that some of these changes may be genuine considering that people switch jobs and occupations. However, before applying the changes, we first checked for unusually large swings in wages and earnings. Moreover, the inconsistencies were not unique to a specific occupation but rather spread across all occupationsconfirming our argument. These inconsistencies were dealt with by creating a period that is consistent in both waves22. In order to reduce noise introduced by individual heterogeneity and arrive at a fairly consistent data set, we only worked with a balanced panel and also dealt with outliers which were partly responsible for unrealistically large increases in earnings. Some of the outliers could have been as a result of the manner in which we generate our monthly earnings, i.e. (ganyu days in a week*daily wage*4). In some cases, we found some few ganyu millionaires. Our analysis showed that of the 56 total outliers, 7 were ganyu earners. We had to deal with the outliers because we believe that a ganyu earner who had a large windfall payment on a single day will have their earnings overstated after converting the daily earnings into monthly wages. Nevertheless, our ganyu earnings compare very well with data from FISP as explained in Section 3.7.3. 3.4 Labour force participation The IHS questionnaire contains four questions which allow us to compute labour force participation and unemployment rates for Malawi. The questions are: a. Did you do work for any number of hours in the past 7 days? b. Even though you did not do any activities in the last seven days, do you have a job or business or any economic activity to return to? c. If you were offered a job, would you be willing to accept the job? d. In the past four weeks, have you taken any action to look for any kind of work or start any kind of business / income generating activity? The labour force is the sum of the employed and unemployed. Two standard definitions of unemployment exist in the literature, namely broad and narrow definitions. Broad unemployment is where a person is without work and available for work during the reference period. In this case, it 22 See Appendix A2 for description of the Stata code. 71 Stellenbosch University https://scholar.sun.ac.za includes responses in (a), (b) and (c). Narrow unemployment is where a person is without work and available for work during the reference period and is seeking work. This would be captured in (a), (b), (c) and (d). The broad definition counts all the jobless people who want jobs even though they did not search for one. The labour force participation comprises those employed and seeking work. 3.4.1 Size of labour force and labour force participation rates Table 3.3 shows labour force participation for the balanced panel according to some selected characteristics. One key finding is that, regardless of the measure used, the labour force participation for the balanced panel has increased by almost 10 percentage points over the three-year period. For the broad measure, the labour force participation rate has increased from 81% in 2010 to 91% in 2013. The results indicate very high labour force participation rates - a key characteristic of low-income agricultural economies23. We also note some gender gaps in favour of males in terms of both labour force participation and employment rates. The gap has, however, marginally declined over three years. Based on human capital theory, education does not only improve an individual’s chances of getting employment but also positively impacts on earnings. Based on the table results, this theory seems to hold amongst those with some level of education, i.e. if we exclude those without formal education. The analysis shows that the labour force participation rate increases (due to higher employment rates) with the level of education and is highest amongst those with university education, reaching around 90% in both 2010 and 2013. It is, however, worth noting that those without formal education have equally high levels of labour force participation - registering rates of as high as 81% in 2010 and 91% in 2013 when measured in broad terms. Nonetheless, most of these are probably those who immediately enter the informal sector of the labour market because they do not have formal education. With respect to age-groups, we note that compared to any other age groups, the (15-19) 24 age group has the lowest participation rate, at 58% in 2010 and about 77% in 2013. Consequently, as we will see, this age-group has the largest rates of unemployment although the situation is improving. The rise in the labour force participation rate between 2010 and 2013 is partly due to the fact that we are working with a balanced panel such that no new young entrants are captured in the second wave, i.e. wave 2 would only cover the 18 and 19-year olds. 23 In Malawi as well as the rest of Sub-Saharan Africa, the traditionally high rates of labour force participation can be linked to the existence of very large agricultural sectors. 24 Based on the IHS3 figures, about 57% of the Malawian population is aged between 0 and 19 years. In our data set, almost 22% is in the 15-19 age-group (National Statistical Office, 2012). 72 Stellenbosch University https://scholar.sun.ac.za Table 3.3: Labour force participation rates according to characteristics Description Malawi Sex Male Female Education None PSLCE JCE MSCE Tertiary Age groups 15-19 20-24 25-29 30-34 35-39 40-44 45-49 50-54 55-59 60-64 Regions Northern Central Southern Working age population 2010 Share 2013 Share 7,086 100% 8,106 100% Broad labour force 2010 Share 2013 Share 5,737 100% 7,336 100% Narrow labour force 2010 Share 2013 Share 5,033 100% 6,578 100% 3,331 3,755 47% 53% 3,869 4,237 48% 52% 2,791 2,946 49% 51% 3,526 3,810 48% 52% 2,537 2,496 50% 50% 3,241 3,337 49% 51% 5,275 754 576 386 96 74% 11% 8% 5% 1% 5,841 958 719 463 125 72% 12% 9% 6% 2% 4,278 589 449 335 86 75% 10% 8% 6% 2% 5,326 832 636 430 112 73% 11% 9% 6% 2% 3,790 498 376 288 81 75% 10% 7% 6% 2% 4,787 744 560 385 102 73% 11% 9% 6% 2% 1,487 1,209 1,102 868 688 460 466 311 255 239 21% 17% 16% 12% 10% 6% 7% 4% 4% 3% 1,796 1,311 1,203 1,023 762 604 442 407 288 269 22% 16% 15% 13% 9% 7% 5% 5% 4% 3% 869 988 954 774 625 415 427 272 220 194 15% 17% 17% 13% 11% 7% 7% 5% 4% 3% 1,387 1,179 1,142 986 734 583 420 391 271 242 19% 16% 16% 13% 10% 8% 6% 5% 4% 3% 691 829 833 681 578 374 402 259 203 182 14% 16% 17% 14% 11% 7% 8% 5% 4% 4% 1,195 1,000 1,026 894 685 541 399 362 250 224 18% 15% 16% 14% 10% 8% 6% 6% 4% 3% 924 3,054 3,109 13% 43% 44% 1,055 3,541 3,510 13% 44% 43% 740 2,467 2,530 13% 43% 44% 964 3,174 3,198 13% 43% 44% 665 2,212 2,156 13% 44% 43% 861 2,875 2,842 13% 44% 43% Source: Own calculations from IHPS data 3.4.2 Changes in the labour force according to background characteristics In the next few paragraphs, we examine the patterns in the working age-population and labour force over the three years under study for individuals in the panel. The results are presented in Table 3.4 and show that there has been an increase of 14% in the number of people making up the working age population between 2010 and 2013 in the balanced sample. This increase is higher than the average GDP growth rate of around 3% experienced during the same period, raising fears about the economy’s capacity to absorb the increasing labour supply. The labour force has grown both in broad and narrow terms by about 28% and 31%, respectively. These observed increases in the labour force could either be genuine or simply as a result of better capturing of the labour force in the administration of the questionnaire. With respect to gender, there was a larger increase in the working age population for males compared to females. However, this pattern is reversed when we look at the labour force where females now have a higher growth rate than males for both broad and narrow definitions. 73 Stellenbosch University https://scholar.sun.ac.za Amongst the education categories, the largest growth rate for the working age population was largest amongst those with tertiary education although the growth is from a very small number. However, the growth in the labour force has been the largest amongst those with PSCLE and JCE. This observation consistently holds for both the broad and narrow definitions of the labour force. Table 3.4: Changes in the labour force according to characteristics Description Malawi Sex Male Female Education None PSLCE JCE MSCE Tertiary Age groups 15-19 20-24 25-29 30-34 35-39 40-44 45-49 50-54 55-59 60-64 Regions Northern Central Southern Working age population 2010 2013 % change 7,086 8,106 14% Broad labour force 2010 2013 % change 5,737 7,336 28% Narrow labour force 2010 2013 % change 5,033 6,578 31% 3,331 3,755 3,869 4,237 16% 13% 2,791 2,946 3,526 3,810 26% 29% 2,537 2,496 3,241 3,337 28% 34% 5,275 754 576 386 96 5,841 958 719 463 125 11% 27% 25% 20% 30% 4,278 589 449 335 86 5,326 832 636 430 112 25% 41% 42% 28% 30% 3,790 498 376 288 81 4,787 744 560 385 102 26% 49% 49% 34% 26% 1,487 1,209 1,102 868 688 460 466 311 255 239 1,796 1,311 1,203 1,023 762 604 442 407 288 269 21% 8% 9% 18% 11% 31% -5% 31% 13% 12% 869 988 954 774 625 415 427 272 220 194 1,387 1,179 1,142 986 734 583 420 391 271 242 60% 19% 20% 27% 18% 41% -1% 44% 23% 25% 691 829 833 681 578 374 402 259 203 182 1,195 1,000 1,026 894 685 541 399 362 250 224 73% 21% 23% 31% 19% 44% -1% 40% 23% 23% 924 3,054 3,109 1,055 3,541 3,510 14% 16% 13% 740 2,467 2,530 964 3,174 3,198 30% 29% 26% 665 2,212 2,156 861 2,875 2,842 29% 30% 32% Source: Own computations from IHPS data. There have been positive growth rates across all age-groups except the (45-49) group which has registered a decline of about 5% in the working age population and a corresponding drop of 1% for both the broad and narrow definitions of the labour force. All the three regions have registered increases in the working- age population numbers and this is also reflected in the broad and narrow labour force numbers. Using the broad definition, the Northern region has registered the largest growth rate in proportion to the rest. On the other hand, when the narrow definition is used, the largest increase is observed in the Southern region. 74 Stellenbosch University https://scholar.sun.ac.za 3.4.3 Shares in the labour force According to results presented in Table 3.5, females make up the majority of the labour force regardless of whether the broad or narrow definition is used. The proportion of females in the labour force ranges between 50% and 53% and this reflects well with the population shares observed in the 2008 census data. However, proportionately fewer females enter the labour market compared to males. We will see this clearly when we discuss unemployment rates in Section 3.5. Table 3.5: Shares of working-age population and labour force according to characteristics 2010 7,086 Broad labour force participation Narrow labour force participation 2010 2013 2010 2013 2013 Number Rate Number Rate Number Rate Number Rate 8,106 5,737 81% 7,336 91% 5,033 71% 6,578 81% 3,331 3,755 3,869 4,237 2,791 84% 2,946 78% 3,526 3,810 91% 90% 2,537 76% 2,496 66% 3,241 3,337 84% 79% 5,275 754 576 386 96 5,841 958 719 463 125 4,278 589 449 335 86 81% 78% 78% 87% 90% 5,326 832 636 430 112 91% 87% 88% 93% 90% 3,790 498 376 288 81 72% 66% 65% 75% 84% 4,787 744 560 385 102 82% 78% 78% 83% 82% 1,487 1,209 1,102 868 688 460 466 311 255 239 1,796 1,311 1,203 1,023 762 604 442 407 288 269 869 988 954 774 625 415 427 272 220 194 58% 82% 87% 89% 91% 90% 92% 88% 86% 81% 1,387 1,179 1,142 986 734 583 420 391 271 242 77% 90% 95% 96% 96% 97% 95% 96% 94% 90% 691 829 833 681 578 374 402 259 203 182 46% 69% 76% 78% 84% 81% 86% 84% 80% 76% 1,195 1,000 1,026 894 685 541 399 362 250 224 67% 76% 85% 87% 90% 90% 90% 89% 87% 84% 924 3,054 3,109 1,055 3,541 3,510 740 80% 2,467 81% 2,530 81% 964 3,174 3,198 91% 90% 91% 665 72% 2,212 72% 2,156 69% 861 2,875 2,842 82% 81% 81% Working age population Description Malawi Sex Male Female Education None PSLCE JCE MSCE Tertiary Age groups 15-19 20-24 25-29 30-34 35-39 40-44 45-49 50-54 55-59 60-64 Regions Northern Central Southern Source: Own computations from IHPS data. As far as education is concerned, we make two important observations. First, is that the labour force in Malawi is highly uneducated with percentages as high as between 72% and 75%. This is despite two decades of free primary education in Malawi. The second observation is that despite remaining highly uneducated, the labour force has become slightly more educated on average. As the table shows, the shares of those without formal education (None) have dropped by two percentage points from 74% to 72%. A similar percentage point decrease has been registered in the broad and narrow labour force figures. We also note that the shares of individuals with some education have increased over the three75 Stellenbosch University https://scholar.sun.ac.za year survey period. However, much of the increase in the levels of education is accounted for by PSLCE and JCE. MSCE and Tertiary have remained constant at 8% and 2%, respectively. The table also shows that the majority of the work force in Malawi is made up of the youth and this is reflective of Malawi’s population pyramid. Using the International Labour Organisation (ILO) definition, the youth (15-24) make up 38% of the total working-age-population. This proportion has been stable for both 2010 and 2013. On the other hand, using the Southern African Development Community (SADC) definition, the youth (15-34) consist of 66% in both 2010 and 2013. Regardless of the definition used, the youth make up the majority of the labour force. We observe consistent shares in the working age population among regions for both the broad and narrow labour force definitions. The Northern region has the lowest shares across the survey periods partly because the population is also low there. 3.4.4 Multivariate analysis of labour force participation We analyse the likelihood of labour force participation by the use of a probit regression. The results are presented in Table 3.6. Both models generate correctly predicted probabilities of at least 78% and this is quite high and good. Our dependent variable is a binary variable equal to ‘1’ if an individual is in the labour force and ‘0’, if not. For both the broad and narrow definitions, the explanatory variables are: sex of individuals (reference group: male), five year age categories (reference group: (15-19), education levels (reference group: none), marital status (reference group: married or non-formal union), age dependency ratio25, region (reference group: Northern), year (reference group: 2010). Under both the broad and narrow definitions, females are less likely to enter the labour market compared to males. Compared to the reference category, older individuals are more likely to participate in the labour market. We also note that labour force participation is generally concave with respect to age first increasing before reaching a turning point. After controlling for other factors and regardless of whether we use the broad or narrow definition, individuals with primary and junior secondary education are significantly less likely to participate in the labour market than the reference group who has no education. This is perhaps because they choose to continue with their education unlike those without education who might as well join the market straight away. In the narrow definition, we pick up an interesting and expected result where those with tertiary 25 Age dependency ratio is given as the ratio of dependents (people younger than 15 or older than 64) to the working-age population (those aged 15-64). 76 Stellenbosch University https://scholar.sun.ac.za education are more likely to enter the labour market compared to those without education, but this is not the case for the broad definition. As the lower labour force participation rate amongst those with some education could be the result of longer time spent studying, it is instructive to also see whether those without education also have a higher rate of participation if the sample is limited to those above 25 years. Table 3.6: Probit regressions on labour force participation Description Female Age categories 20-24 25-29 30-34 35-39 40-44 45-49 50-54 55-59 60-64 Education level Primary Junior secondary Senior secondary Tertiary Not married Age dependency ratio Region Central Southern Year=2013 Constant Percent correctly predicted probability R-squared Observations Broad -0.215*** (0.029) Narrow -0.365*** (0.025) 0.496*** 0.743*** 0.833*** 0.852*** 0.793*** 0.870*** 0.782*** 0.609*** 0.459*** (0.040) (0.050) (0.058) (0.067) (0.075) (0.080) (0.086) (0.091) (0.084) 0.317*** 0.571*** 0.686*** 0.806*** 0.719*** 0.906*** 0.786*** 0.692*** 0.598*** (0.037) (0.043) (0.049) (0.056) (0.062) (0.069) (0.072) (0.080) (0.077) -0.076* -0.130*** 0.058 0.078 -0.521*** 0.051** (0.042) (0.044) (0.057) (0.098) (0.033) (0.021) -0.099*** -0.198*** -0.043 0.209** -0.398*** 0.099*** (0.037) (0.039) (0.046) (0.085) (0.030) (0.019) -0.110*** 0.005 0.532*** 0.760*** 86.13% 0.162 15,192 (0.037) (0.037) (0.027) (0.052) -0.026 -0.053* 0.377*** 0.457*** 78.08% 0.123 15,192 (0.033) (0.032) (0.022) (0.047) Notes: *, **, *** denote significance at 10%, 5% and 1% levels; robust standard errors in parenthesis Individuals who are unmarried (separated, divorced, widowed or never married) are less likely to enter the labour force compared to those that are married or living with a partner. Those who come from households with more children and elderly (high age dependency ratio) are more likely to participate in the labour market. In the broad and narrow definitions, individuals residing in the Central region and Southern region have a significantly lower likelihood of labour force participation compared to those in the Northern region. The conditional likelihood of labour force participation is greater in 2013, either 77 Stellenbosch University https://scholar.sun.ac.za indicating the presence of more job opportunities in 2013 than in 2010 or simply the fact that there are now more people looking for jobs. 3.5 Unemployment From Table 3.4, we noted that there was a growth in the labour force regardless of whether the broad or narrow definition is used. However, the numbers of unemployed and unemployment rates shown in Table 3.7 have been relatively stable over the three years. Broad unemployment rates have remained constant at 13% while narrow unemployment has only marginally increased from 2% in 2010 to 3% in 2013. This means that much of the labour force growth was absorbed, probably on account of existing vacancies or new jobs created. Table 3.7: Unemployment shares and rates by year Description Malawi Sex Male Female Education None PSLCE JCE MSCE Tertiary Age groups 15-19 20-24 25-29 30-34 35-39 40-44 45-49 50-54 55-59 60-64 Regions Northern Central Southern Broadly unemployed 2010 2013 Number Share Rate Number Share Rate 743 100% 13% 938 100% 13% Narrowly unemployed 2010 2013 Number Share Rate Number Share Rate 118 100% 2% 222 100% 3% 273 470 37% 10% 63% 16% 371 566 40% 11% 60% 15% 59 59 50% 50% 2% 2% 104 118 47% 53% 3% 4% 500 95 79 60 8 67% 13% 11% 8% 1% 12% 16% 18% 18% 10% 666 104 93 59 15 71% 11% 10% 6% 2% 13% 12% 15% 14% 14% 73 12 11 17 4 62% 10% 10% 14% 4% 2% 2% 3% 6% 5% 160 18 20 18 6 72% 8% 9% 8% 3% 3% 2% 4% 5% 6% 175 183 144 97 47 37 23 10 17 10 24% 25% 19% 13% 6% 5% 3% 1% 2% 1% 20% 18% 15% 13% 8% 9% 5% 4% 8% 5% 208 218 154 124 61 52 30 43 24 23 22% 23% 16% 13% 7% 6% 3% 5% 3% 2% 15% 19% 13% 13% 8% 9% 7% 11% 9% 9% 8 36 35 15 9 3 5 2 3 2 7% 31% 29% 13% 8% 3% 5% 1% 2% 1% 1% 4% 4% 2% 2% 1% 1% 1% 1% 1% 26 43 43 38 17 15 11 18 5 7 12% 20% 20% 17% 8% 7% 5% 8% 2% 3% 2% 4% 4% 4% 3% 3% 3% 5% 2% 3% 72 268 403 10% 10% 36% 11% 54% 16% 136 355 446 15% 14% 38% 11% 48% 14% 8 47 63 7% 40% 53% 1% 2% 3% 39 70 113 17% 32% 51% 4% 2% 4% Source: Own computations from IHPS data. 78 Stellenbosch University https://scholar.sun.ac.za As also earlier discussed, there is an evident gender gap in favour of males as reflected in higher rates of unemployment for females. The gap is partly due to the fact that there are more females in the population but could also be as a result of gender discrimination. Nevertheless, the gap seems to be closing as shown by the decline in the share of females in broad unemployment from 63% in 2010 to 60% in 2013. With respect to education, those without education make up the largest share of unemployment numbers and the proportions decline as the level of education increases, consistent with the human capital theory. However, there seems to be no clear pattern between unemployment rates and levels of education since unemployment is prevalent at all levels of education. While the unemployment rates are lower for those without education (because they can enter the ‘labour’ market earlier), they increase for individuals with PSLCE, JCE and MSCE, probably because these people are still in school. The level and rates of unemployment are the lowest amongst those with tertiary education. The younger age groups not only make up the largest shares of unemployment figures but also experience the highest rates of unemployment, ranging between 13% and 20% when broadly measured and between 1% and 4% using narrow measurement. 3.6 Employment trends and characteristics This section looks at employment trends between 2010 and 2013 as well as the characteristics of those in employment in Malawi. We also examine the work activities of the employed. 3.6.1 Employment shares and growth rates Table 3.8 shows that the shares of employment have fairly remained stable by gender although females dominate in 2013 on account of growth. We observe a drop in the share of those without education from 76% in 2010 to a still high 73% in 2013. The youth, regardless of how we define them, dominate the shares of employment numbers. We note, however, that in 2013, the (15-19) age group dominates the share of employment at 18%, overtaking the (20-24) and (25-29) age groups which previously dominated in 2010. On average, the number employed has grown by 28% and 29% for broad and narrow measures, respectively. A breakdown by gender shows that there was a larger increase for females than males regardless of the measure used. The largest growth in the number employed occurred amongst those with either PSLCE (49%) or JCE (48%) as their highest level of education. All age-groups, except those in the (45-49) group, have registered growth between 2010 and 2013. The largest increase (at least 70%) has been registered in the (15-19) age group. These are likely to be individuals who drop out of the education system in search of employment. 79 Stellenbosch University https://scholar.sun.ac.za Table 3.8: Broad and narrow employment shares and growth by year Description Malawi Sex Male Female Education None PSLCE JCE MSCE Tertiary Age groups 15-19 20-24 25-29 30-34 35-39 40-44 45-49 50-54 55-59 60-64 Regions Northern Central Southern Broadly employed 2010 2013 Growth Number Share Number Share (2010-2013) 4,994 100% 6,398 100% 28% 2,518 2,476 50% 50% 3,155 3,243 49% 51% 25% 31% 2,478 2,437 50% 50% 3,137 3,219 49% 51% 27% 32% 3,777 494 370 275 78 76% 10% 7% 6% 2% 4,660 728 543 371 97 73% 11% 8% 6% 2% 23% 47% 47% 35% 24% 3,717 486 364 271 77 76% 10% 7% 6% 2% 4,627 725 540 368 96 73% 11% 8% 6% 2% 24% 49% 48% 36% 25% 694 805 811 677 578 377 403 262 203 184 14% 16% 16% 14% 12% 8% 8% 5% 4% 4% 1,179 961 989 862 673 531 390 348 247 219 18% 15% 15% 13% 11% 8% 6% 5% 4% 3% 70% 19% 22% 27% 16% 41% -3% 33% 22% 19% 683 793 798 666 569 371 397 258 200 181 14% 16% 16% 14% 12% 8% 8% 5% 4% 4% 1,170 957 983 856 668 526 389 345 245 218 18% 15% 15% 13% 11% 8% 6% 5% 4% 3% 71% 21% 23% 29% 17% 42% -2% 34% 23% 20% 668 2,200 2,127 13% 44% 43% 828 2,819 2,751 13% 44% 43% 24% 28% 29% 657 2,165 2,093 13% 44% 43% 823 2,805 2,728 13% 44% 43% 25% 30% 30% Source: Own computations from IHPS data 3.6.2 Narrowly employed 2010 2013 Growth Number Share Number Share (2010-2013) 4,915 100% 6,356 100% 29% Multivariate analysis of employment likelihood Not all people who are in the labour force end up being employed. This raises the problem of sample selection, which we address using the Heckman (1979) two-step procedure as discussed earlier in Section 3.2.3. In the first step, we estimate the probability of an individual either participating in the labour force or not. From the first step in Section 3.4.4, we obtained the inverse of the Mills ratio and used it as an additional explanatory variable in the second step, the probit on the likelihood of employment. Our results are presented in Table 3.9. 80 Stellenbosch University https://scholar.sun.ac.za Table 3.9: Two-step Heckman probit results on employment likelihood Description Female Age categories 20-24 25-29 30-34 35-39 40-44 45-49 50-54 55-59 60-64 Education level PSLCE JCE MSCE Tertiary Not married Region Central Southern Year=2013 Inverse mills ratio Constant Percent correctly predicted probability R-squared Observations Broad -0.220*** (0.029) Narrow -0.076* (0.045) -0.171** -0.038 0.073 0.204** 0.144 0.243** 0.156 0.204** 0.200** (0.067) (0.083) (0.089) (0.092) (0.092) (0.10) (0.098) (0.098) (0.091) -0.070 -0.019 0.040 0.118 0.088 0.140 0.071 0.108 0.075 (0.058) (0.086) (0.099) (0.109) (0.104) (0.118) (0.112) (0.112) (0.106) -0.051 -0.132*** -0.164*** -0.018 -0.118** (0.037) (0.040) (0.044) (0.078) (0.048) -0.006 -0.035 -0.085* -0.080 -0.108** (0.039) (0.047) (0.045) (0.078) (0.051) 0.063* -0.072** 0.002 -1.710*** 1.284*** 75.63% 0.106 15,192 (0.033) (0.031) (0.046) (0.235) (0.113) 0.022 -0.034 0.026 -1.654*** 1.428*** 75.65% 0.106 15,192 (0.032) (0.032) (0.047) (0.244) (0.141) Notes: *, **, *** denote significance at 10%, 5% and 1% levels; robust standard errors in parenthesis First, we note that the coefficient of the inverse Mills ratio is statistically significant, indicating the presence of sample selection. It is, therefore, important to correct for selection bias. We then proceed to correct for selection bias and report heteroskedasticity consistent standard errors in the first stage. Under the broad definition, the results show also that the following groups are less likely to enter employment: females, those aged (20-24), those with JCE and MSCE, individuals who are not married, and people living in the Southern region, while those aged 35 and above as well as those from the Central region are more likely to be employed. Fewer variables are significant in the narrow definition, where we find that the following groups are significantly less likely to be employed: females, individuals with MSCE and those that are not married. 3.7 Earnings and changes in employment status Table 3.10 shows that overall, the total average real monthly wages have increased by 37% from MWK 20,186 in 2010 to MWK27,704 in 2013. In the ensuing paragraphs, we attempt to breakdown and explain the sources of the increase. The first step is to examine how the employment status of individuals has 81 Stellenbosch University https://scholar.sun.ac.za changed between the two waves. Conditional on missing earnings, we came up with four employment statuses, unemployed in both waves (not shown in table), employed in either 2010 or 2013 only, and employed in both periods. Table 3.10: Mean monthly total wages by employment status and survey period 2010 SD 2013 SD Mean N Percent Mean Employment status Wave 1 only 13,123 914 38% . . (56,494) Wave 2 only . . . 18,552 (49,050) . Both waves 24,696 1,520 62% 36,119 (96,957) (75,160) Total 20,186 ( 68,713) 2,434 100% 27,704 (78,263) Source: Own computation from IHPS data, earnings expressed in constant 2013 prices. 3.7.1 N Percent . 1,411 1,532 2,943 . 43% 57% 100% Employed in either wave The table shows that part of the increase in earnings is explained by the new entrants into the labour market (n=1,411) with average real monthly earnings of MWK18,552 compared to the MWK13,123 in 2010 of those who have now exited (n=914) the labour market. Between these two groups, average earnings have increased by 41%. Consequently, those that have exited the market have been replaced by higher earning individuals. 3.7.2 Employed in both waves We observe that earnings are higher amongst individuals employed in both years compared to those only employed in either period. The gap in earnings between these two groups is stable, i.e. MWK13,123 versus MWK24,696 (1.88 times higher) in 2010 and MWK18,552 versus MWK36,119 (1.95 times more) in 2013. Moreover, those employed in both periods also experienced an increase in earnings; their earnings increased from MWK24,696 in 2010 to MWK36,119 in 2013, representing an increase of 46% over three years. Considering the importance of ganyu employment in Malawi, the analysis in Table 3.10 is repeated for ganyu earners. The results are given in Table 3.11 where a similar pattern is observed. First, the largest increase in earnings is observed for individuals employed in both waves. Second, those employed in wave 2 only (new entrants) earn more compared to individuals in wave 1 only. 82 Stellenbosch University https://scholar.sun.ac.za Table 3.11: Mean monthly ganyu wages by employment status and survey period 2010 2013 Mean SD N Percent Mean SD N Percent Wave 1 only 9,234 (32,415) 711 47% . . . . Wave 2 only . . . . 15,733 (42,866) 1,136 58% Both waves 10,013 (16,498) 792 53% 20,231 (54,093) 835 42% Total 9,649 (25,222) 1,503 100% 17,718 (48,183) 1,971 100% Employment status Source: Own computation from IHPS data, earnings expressed in constant 2013 prices. We further examine if there is any relationship between employment status and education attainment. Table 3.12 shows that of those without education qualification (“None”), 47% were unemployed in both waves and this forms the majority. On the other hand, 61% and 60% of those with university diploma and post-graduate degree, respectively, were employed in both periods. New entrants into the labour market (wave 2 only) have more education compared to those that have dropped out of the labour market (wave 1 only). Table 3.12: Employment status and education attainment of individuals Description None Unemployed in both waves 5,253 960 756 350 47% 56% 58% 41% 1,616 173 122 74 15% 10% 9% 9% 2,106 268 160 126 19% 16% 12% 15% 2,143 311 256 300 19% 18% 20% 35% 11,118 1,712 1,295 849 100% 100% 100% 100% Employed in wave 1 only Employed in wave 2 only Employed in both waves Total PSCLE JCE MSCE Non-University University Post-graduate Total Diploma Diploma degree 26 11 3 7,358 20% 16% 13% 48% 18 6 4 2,013 14% 10% 19% 13% 17 9 2 2,687 13% 13% 8% 18% 70 41 14 3,135 54% 61% 60% 21% 130 67 23 15,194 100% 100% 100% 100% Source: Own computation from IHPS data, percentages in italics. 3.7.3 Identifying sources of increases in earnings Figure 3.1 shows real monthly wages by occupation and survey year. Although we do not fully know the reasons behind the changes, it is important to point out two observations. Firstly, as we will note in the figure, the average earnings in the NGO sector (PWP and churches) have dropped between 2010 and 2013. These might have been negatively affected by donor aid withdrawals that Malawi faced recently.26 26 The “Other” category is made up of 7 and 19 individuals in 2010 and 2013, respectively. Perhaps we need to add them up to the dominant category or a category with similar characteristics. 83 Stellenbosch University https://scholar.sun.ac.za Secondly, our results compare well with Farm Input Subsidy Programme (FISP) evaluation panel data currently being analysed by other researchers in Malawi where large increases have been observed in real ganyu wages (nominal ganyu daily wage rates divided by maize prices) for some districts between 2012 and 2015 (see Figure A1 in the appendix). Figure 3.1: Histogram of real monthly earnings by survey year and occupation Source: Own computation from IHPS data, bars represent standard errors. Given this comparison, one can argue that the increases in the earnings may be genuine although it may be difficult to isolate the main factors from the many drivers behind this rise. Gross Domestic Product (GDP) growth may be a contributor but the economy only grew at an average of about 3% per year. As for ganyu wages, it is possible that with the availability of food in many parts of the country, the ganyu workers tended to have higher bargaining power and asked for more wages. It is also worth noting that in real terms, minimum wages were adjusted upwards twice between 2010 and 2013; first by 50% effective 1st January 2011 and second by 34% effective 1st July 2013. These could explain the increases in earnings in the private and government sectors, although this largely depends on effective implementation and monitoring and would only affect low-income workers. Despite working the same number of hours, those in government received significantly more per month than their counterparts in parastatal companies. Nevertheless, parastatal organisations more than doubled their earnings between 2010 and 2013. There is some catch up between the two groups. 84 Stellenbosch University https://scholar.sun.ac.za 3.8 Econometric analysis of returns to education As explained in Section 3.3, the IHS questionnaire collects information on wage employment and household non-farm enterprises. The former is analysed at the individual and the latter at the household level. It is, however, possible to link household to individual data by enterprise owner. Earnings from household non-farm enterprises are analysed at a household rather than the individual level for three reasons. Firstly, we reason that wages and enterprise earnings (profit) may not necessarily be explained by the same factors. This is confirmed because the R-squares for the wage regressions improve after splitting the analysis (for wages and household earnings separately). Secondly, the IHS data have a separate module for household enterprises with background characteristics such as industry, customers and age of enterprises which allows us to achieve this purpose. Thirdly, looking at enterprises separately enables us to explore the existence of education externalities within the household. We analyse externalities by using maximum education in the household. Average household education is used for robustness checks and also works well. We begin our analysis by looking at wage employment before moving on to enterprise earnings in Section 3.8.2. 3.8.1 Wage employment Recall that according to the human capital theory, investment in education improves workers’ skills resulting in high productivity and, therefore, higher earnings (Mincer, 1974). We begin our analysis with ordinary least squares estimation (OLS) for 2010, 2013 and the pooled sample. The results are given in Table 3.13 where the dependent variable is the log of real monthly earnings 27. The explanatory variables are years of schooling, years of experience, square of experience, sex (base category: male), region (base: Northern) and occupation (base: private individual). Across the three models, the results show that there are large returns to education of between 6.8% and 7.7%. The strongly positive returns to education are consistent with other findings in Malawi (Chirwa & Matita 2009; Chirwa & Zgovu 2002). Similar results have been found in other African countries (e.g., in Cameroon by Ewoudou & Vencatachellum, 2006; in Rwanda by Lassibille & Tan, 2005); Bennell, 1996, for Sub-Saharan Africa). The negative and significant gender dummy is consistent with the general finding that females earn less than their male counterparts (Chirwa & Matita, 2009). We also find that average earnings for private individuals are significantly lower than those in private companies. As expected, ganyu earnings are consistently significantly lower in all the models. It is only in the 2013 27 The interpretation of the coefficients is the percentage change in the monthly earnings given a unit change in the explanatory variable. For dummy variables the percentage effect of a change from the base category. 85 Stellenbosch University https://scholar.sun.ac.za model that workers in public works programme earn significantly less than those in private companies, albeit only at the 10% level of significance. Table 3.13: OLS results for log of real monthly wages OLS 2013 0.068*** (0.009) 0.055*** (0.006) -0.001*** (0.000) -0.276*** (0.042) -0.019 (0.098) -0.144 (0.096) -0.555*** (0.126) 0.232 (0.162) 0.386* (0.197) -0.262* (0.157) 0.171 (0.234) 0.036 (0.283) -0.805*** (0.105) 9.080*** (0.168) 0.282 2,943 Description 2010 Years of schooling 0.077*** (0.008) Experience 0.037*** (0.006) Experience squared -0.001*** (0.000) Female -0.260*** (0.045) Central -0.101 (0.081) Southern -0.257*** (0.074) Private Individual -0.210* (0.108) Government 0.206 (0.130) Parastatal 0.179 (0.196) Public Works Program 0.555 (0.376) Church/Religious Organisation -0.024 (0.335) Other 0.135 (0.490) Casual/Ganyu -0.803*** (0.094) Constant 8.962*** (0.153) R-squared 0.340 Observations 2,434 Pooled 0.072*** (0.008) 0.047*** (0.005) -0.001*** (0.000) -0.267*** (0.032) -0.054 (0.075) -0.202*** (0.066) -0.392*** (0.094) 0.206* (0.122) 0.305** (0.145) 0.094 (0.193) 0.107 (0.231) 0.128 (0.284) -0.795*** (0.083) 9.015*** (0.137) 0.295 5,377 Notes: *, **, *** denote significance at 10%, 5% and 1% levels; robust standard errors in parenthesis 86 Stellenbosch University https://scholar.sun.ac.za 3.8.1.1 Homogeneous returns to education Next, we discuss findings from three models, namely OLS, fixed effects and random effects beginning with estimation results presented in Table 3.14. In each of these models, we assume that returns to education are homogeneous and also ignore selectivity bias28. Table 3.14: OLS, Fixed effects and random effects results for log of real monthly wages Pooled OLS 1 2 3 Years of schooling 0.109*** 0.072*** 0.072*** (0.007) (0.008) (0.007) Experience 0.047*** 0.047*** (0.005) (0.005) Experience squared -0.001*** -0.001*** (0.000) (0.000) Female -0.267*** -0.268*** (0.032) (0.032) Central -0.054 -0.05 (0.075) (0.074) Southern -0.202*** -0.194*** (0.066) (0.066) Private Individual -0.392*** -0.369*** (0.094) (0.096) Government 0.206* 0.216* (0.122) (0.122) Parastatal 0.305** 0.298** (0.145) (0.139) Public Works Program 0.094 0.061 (0.193) (0.210) Church/Religious Organisation 0.107 0.091 (0.231) (0.225) Other 0.128 0.064 (0.284) (0.280) Casual/Ganyu -0.795*** -0.805*** (0.083) (0.085) Year=2013 0.297*** (0.039) Constant 8.418*** 9.015*** 8.868*** (0.057) (0.137) (0.133) R-squared 0.137 0.295 0.313 Observations 5,377 5,377 5,377 Description 4 0.009 (0.008) Fixed effects 5 0.055*** (0.010) 0.068*** (0.011) -0.000* (0.000) 0.567*** (0.208) 0.365 (0.265) -0.113 (0.081) 0.085 (0.145) 0.290** (0.126) -0.247 (0.251) 0.172 (0.169) 0.352** (0.164) -0.487*** (0.089) 6 0.015 (0.015) 0.018 (0.017) 0.000 (0.000) 0.418** (0.203) 0.281 (0.256) -0.111 (0.079) 0.070 (0.151) 0.275** (0.127) -0.305 (0.241) 0.114 (0.166) 0.308** (0.151) -0.495*** (0.088) 0.263*** (0.053) 9.306*** 7.816*** 8.918*** (0.066) (0.253) (0.374) 0.638 0.672 0.685 5,377 5,377 5,377 Random effects 7 8 9 0.115*** 0.086*** 0.083*** (0.004) (0.005) (0.005) 0.043*** 0.041*** (0.004) (0.004) -0.001*** -0.001*** (0.000) (0.000) -0.277*** -0.280*** (0.030) (0.030) -0.030 -0.036 (0.037) (0.037) -0.179*** -0.183*** (0.037) (0.037) -0.302*** -0.292*** (0.056) (0.056) 0.246*** 0.253*** (0.070) (0.070) 0.355*** 0.371*** (0.111) (0.109) 0.177 0.143 (0.180) (0.186) 0.155 0.144 (0.124) (0.124) 0.522*** 0.443** (0.178) (0.175) -0.731*** -0.754*** (0.049) (0.048) 0.240*** (0.021) 8.382*** 8.846*** 8.791*** (0.034) (0.085) (0.084) 0.233 0.374 0.383 5,377 5,377 5,377 Notes: *, **, *** denote significance at 10%, 5% and 1% levels; robust standard errors in parenthesis The fixed effects model does not work well, indicating very low within variation29 in our variables especially after the inclusion of the time dummy. The time dummy further reduces the variation in the data and this somehow causes the results to change dramatically30. As the results show, the coefficient for education in the OLS regression is almost 5 times as bigger as that of fixed effects- a small coefficient 28 Selection bias is explained in Section 3.2.3. 29 Low within variation refers to a situation where independent variables change very gradually over time. 30 We tested as to whether we need to be using a time dummy and the results support its inclusion. 87 Stellenbosch University https://scholar.sun.ac.za for the fixed effect compared to OLS is a sign that there is little within variation. Moreover, in the fixed effects model, the coefficient for education is insignificant. We, therefore, concentrate on OLS and random effects which yield consistent results except for differences in the magnitudes in selected cases. The Breusch Pagan LM test31 yields significant results, indicating that the random effects model is more appropriate compared to OLS. The time dummy shows that average monthly earnings are higher in 2013 compared to 2010. 3.8.1.2 Heterogeneous returns to education The results from the basic models discussed above disregard the differences in the level of educational attainment by looking at a single overall education level- years of schooling. This homogeneous model assumes that there are no differential trends in the returns to education for different levels of education. As earlier discussed, there is little statistical evidence and causal empiricism for the homogenous model. The heterogeneous model provides the alternative and looks at the different levels of education as having separate effects on earnings. Using this model specification, we replace years of schooling ( S ) with an educational dummy variable to represent the different educational categories discussed in Section 3.2.2. The results are presented in Table 3.15 and we provide a graphical illustration in Figure 3.2 but based on the random effects only. Regardless of gender, the returns to education increase with the level of education supporting a convex relationship between education and earnings. Our results confirm the finding that the returns increase with the level of schooling in Sub-Saharan Africa but are also contrary to the literature supporting concave rates of returns such as Psacharopoulos (1994). Also consistent with the international literature is the finding that female workers tend to have much higher returns on education than male workers, particularly at higher levels of education. Similar findings have been established in Malawi by Chirwa and Matita (2009). Although more men enter the labour market, they tend to have lower returns than their female counterparts with similar education levels suggesting that female education is more effective in generating returns in Malawi. The high rates of return to higher education for females and tertiary education may mean that gender discrimination favouring men is reduced at higher levels of education. It is worth noting that the education system in Malawi has greatly emphasised on primary education especially with the introduction of free primary education in 1994. There has been a lot of expansion in primary education attainment and this possibly explains the low returns. 31 The test helps a researcher decide between a random effects regression and a simple OLS regression (e.g., Baltagi, B.H. & Li, Q., 1990). The null hypothesis is that variances across individuals are zero. Put differently, that there is no significant difference across units (i.e. no panel effect). 88 Stellenbosch University https://scholar.sun.ac.za Table 3.15: OLS and random effects results for log of monthly wages using education categories Pooled OLS Male Female All PSLCE 0.168** -0.022 0.120* (0.076) (0.117) (0.067) JCE 0.341*** 0.428** 0.356*** (0.088) (0.208) (0.087) MSCE 0.670*** 1.184*** 0.753*** (0.106) (0.166) (0.094) Non-University Diploma 1.430*** 1.617*** 1.498*** (0.149) (0.206) (0.135) University Diploma 1.891*** 2.085*** 1.995*** (0.173) (0.330) (0.160) Post-Graduate Diploma 2.773*** 3.220*** 2.819*** (0.299) (0.295) (0.314) Experience 0.056*** 0.034*** 0.045*** (0.005) (0.006) (0.004) Experience squared -0.001*** -0.001*** -0.001*** (0.000) (0.000) (0.000) Female -0.273*** (0.032) Central -0.101 -0.07 -0.09 (0.081) (0.087) (0.074) Southern -0.214*** -0.238*** -0.227*** (0.068) (0.083) (0.063) Private Individual -0.268*** -0.339* -0.292*** (0.096) (0.193) (0.090) Government 0.119 -0.004 0.113 (0.129) (0.179) (0.110) Parastatal 0.379*** -0.188 0.247** (0.139) (0.186) (0.122) Public Works Program 0.052 0.162 0.139 (0.275) (0.229) (0.204) Church/Religious Organisation -0.233 0.53 0.009 (0.165) (0.379) (0.210) Other 0.066 0.01 0.044 (0.293) (0.316) (0.238) Casual/Ganyu -0.637*** -0.784*** -0.698*** (0.094) (0.160) (0.086) Year=2013 0.282*** 0.298*** 0.289*** (0.038) (0.054) (0.038) Constant 9.100*** 9.124*** 9.234*** (0.119) (0.183) (0.111) R-squared 0.308 0.316 0.337 Observations 3,244 2,133 5,377 Description Random effects Male Female All 0.162*** 0.052 0.141*** (0.051) (0.082) (0.043) 0.411*** 0.351*** 0.401*** (0.057) (0.103) (0.050) 0.726*** 1.071*** 0.800*** (0.063) (0.132) (0.057) 1.435*** 1.612*** 1.503*** (0.093) (0.162) (0.082) 1.932*** 1.999*** 1.968*** (0.132) (0.20) (0.109) 2.590*** 3.069*** 2.673*** (0.206) (0.290) (0.186) 0.046*** 0.027*** 0.037*** (0.005) (0.005) (0.004) -0.001*** -0.000*** -0.001*** (0.000) (0.000) (0.000) -0.282*** (0.029) -0.088* -0.085 -0.087** (0.046) (0.057) (0.036) -0.194*** -0.234*** -0.214*** (0.045) (0.058) (0.036) -0.198*** -0.254* -0.208*** (0.060) (0.134) (0.055) 0.154** 0.061 0.149** (0.075) (0.135) (0.066) 0.279*** 0.307 0.311*** (0.102) (0.252) (0.102) 0.136 0.211 0.204 (0.266) (0.236) (0.170) 0.015 0.263 0.1 (0.143) (0.221) (0.119) 0.349* 0.281 0.324** (0.203) (0.218) (0.145) -0.576*** -0.744*** -0.632*** (0.054) (0.122) (0.049) 0.237*** 0.220*** 0.234*** (0.027) (0.035) (0.021) 9.103*** 9.155*** 9.232*** (0.083) (0.145) (0.069) 0.385 0.424 0.423 3,244 2,133 5,377 Notes: *, **, *** denote significance at 10%, 5% and 1% levels; robust standard errors in parenthesis 89 Stellenbosch University https://scholar.sun.ac.za Figure 3.2: Rates of return on education by gender Source: Own computation from IHPS data. 3.8.1.3 Robustness checks We conduct five main robustness checks to see if our main results are preserved under different conditions and assumptions. These include decomposition of results by gender, analysis of returns to education only based on individuals employed in both waves, alternative treatment of outliers, sample selection and distinguishing the results by sector. We discuss these in the paragraphs that follow. Firstly, we decompose our results by gender. We report OLS and random effects regression results in Table 3.16 having split the sample by the gender of earners. A version of this segregation is already discussed in Section 3.8.1.2 but assuming heterogeneous returns to education. The table shows that the key results after dividing the sample into male and female sub-samples are preserved. The difference for the coefficient comparison test (male minus female) reveals that the coefficients for females are smaller compared to men in most cases as shown by positive and significant differences for years of schooling, experience, government and ganyu. The difference is the largest for ganyu employment. The only case where females have a larger coefficient is within church/religious organisations where the magnitude is very surprisingly large. Secondly, we consider individuals employed in both waves only. As we earlier noted in Section 3.7, these individuals experienced the largest jump in earnings over three years and also have the highest levels of education. The results from Table 3.16 (last two columns) do not differ much compared to those based on the full sample (which includes those only employed in either waves 1 or 2). 90 Stellenbosch University https://scholar.sun.ac.za Table 3.16: Regressions of log monthly wages by gender and employees in both waves Pooled OLS Random effects Employed in both waves Male Female Difference Male Female OLS Random Years of schooling 0.078*** 0.056*** 0.022** 0.089*** 0.068*** 0.076*** 0.083*** (0.008) (0.011) (0.006) (0.008) (0.008) (0.006) Experience 0.056*** 0.035*** 0.021** 0.049*** 0.030*** 0.039*** 0.031*** (0.006) (0.007) (0.005) (0.006) (0.006) (0.006) Experience squared -0.001*** -0.001*** 0.000** -0.001*** -0.000*** -0.001*** -0.000*** (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) Female -0.174*** -0.194*** (0.048) (0.043) Central -0.059 -0.034 -0.025 -0.036 -0.034 -0.059 -0.023 (0.080) (0.090) (0.047) (0.059) (0.084) (0.053) Southern -0.173** -0.227*** 0.054 -0.158*** -0.216*** -0.166** -0.139*** (0.070) (0.085) (0.046) (0.060) (0.076) (0.051) Private Individual -0.303*** -0.618*** 0.315 -0.237*** -0.508*** -0.374*** -0.263*** (0.097) (0.20) (0.060) (0.139) (0.105) (0.065) Government 0.209 0.163 0.046** 0.246*** 0.196 0.207 0.254*** (0.138) (0.201) (0.078) (0.152) (0.137) (0.080) Parastatal 0.430*** -0.182 0.612 0.324*** 0.414* 0.376** 0.412*** (0.155) (0.253) (0.117) (0.234) (0.153) (0.117) Public Works Program 0.024 -0.117 0.141 0.12 -0.056 -0.204 -0.245* (0.280) (0.237) (0.294) (0.243) (0.186) (0.147) Church/Religious Organisation -0.174 0.591 -0.765** 0.026 0.372 0.122 0.208 (0.179) (0.392) (0.144) (0.245) (0.248) (0.139) Other 0.029 0.036 -0.007 0.421* 0.43 -0.179 0.264 (0.337) (0.431) (0.242) (0.287) (0.270) (0.180) Casual/Ganyu -0.692*** -1.108*** 0.416** -0.646*** -1.057*** -0.742*** -0.678*** (0.088) (0.167) (0.053) (0.115) (0.086) (0.057) Year=2013 0.286*** 0.297*** -0.011 0.240*** 0.221*** 0.328*** 0.256*** (0.039) (0.056) (0.027) (0.035) (0.042) (0.026) Constant 8.636*** 9.101*** -0.465** 8.566*** 9.026*** 8.923*** 8.922*** (0.135) (0.227) (0.103) (0.161) (0.145) (0.111) R-squared 0.287 0.278 0.348 0.376 0.348 0.411 Observations 3,244 2,133 3,244 2,133 3,052 3,052 Description Notes: *, **, *** denote significance at 10%, 5% and 1% levels; robust standard errors in parenthesis Thirdly, we compare results based on alternative treatment of outliers. Table 3.17 gives results based on four alternative ways of dealing with outliers discussed in Section 3.3.3. Overall, the results do not change much considering both magnitude and direction, implying that the method of outlier treatment does not really matter. However, failure to account for outliers yields larger returns to education and this would be more evident if we decompose by groups such as sector. 91 Stellenbosch University https://scholar.sun.ac.za Table 3.17: Random effect results based on alternative treatment of outliers Description Outliers included Years of schooling 0.087*** (0.005) Experience 0.042*** (0.004) Experience squared -0.001*** (0.000) Female -0.275*** (0.030) Central -0.04 (0.038) Southern -0.181*** (0.038) Private Individual -0.300*** (0.057) Government 0.228*** (0.072) Parastatal 0.298** (0.125) Public Works Program 0.123 (0.186) Church/Religious Organisation 0.114 (0.125) Other 0.410** (0.174) Casual/Ganyu -0.760*** (0.049) Year=2013 0.246*** (0.022) Constant 8.759*** (0.085) R-squared 0.380 Observations 5,391 Random effects Millionaires 100th percentile Robust regression Extreme residuals 0.081*** 0.077*** 0.085*** 0.083*** (0.005) (0.004) (0.005) (0.005) 0.041*** 0.040*** 0.040*** 0.041*** (0.004) (0.004) (0.004) (0.004) -0.001*** -0.001*** -0.001*** -0.001*** (0.000) (0.000) (0.000) (0.000) -0.279*** -0.274*** -0.273*** -0.280*** (0.030) (0.029) (0.029) (0.030) -0.024 -0.043 -0.042 -0.036 (0.036) (0.036) (0.036) (0.037) -0.168*** -0.189*** -0.189*** -0.183*** (0.036) (0.035) (0.036) (0.037) -0.285*** -0.278*** -0.296*** -0.292*** (0.055) (0.054) (0.055) (0.056) 0.180*** 0.162*** 0.235*** 0.253*** (0.064) (0.061) (0.069) (0.070) 0.362*** 0.344*** 0.362*** 0.371*** (0.107) (0.103) (0.108) (0.109) 0.134 0.055 0.139 0.143 (0.186) (0.165) (0.185) (0.186) 0.095 0.045 0.066 0.144 (0.115) (0.110) (0.113) (0.124) 0.435** 0.417** 0.431** 0.443** (0.174) (0.176) (0.173) (0.175) -0.748*** -0.746*** -0.762*** -0.754*** (0.048) (0.047) (0.048) (0.048) 0.242*** 0.243*** 0.233*** 0.240*** (0.021) (0.021) (0.021) (0.021) 8.786*** 8.826*** 8.786*** 8.791*** (0.083) (0.081) (0.083) (0.084) 0.370 0.370 0.397 0.383 5,367 5,335 5,356 5,377 Notes: *, **, *** denote significance at 10%, 5% and 1% levels; robust standard errors in parenthesis Fourthly, we address sample selection which as indicated can bias results if not addressed in the earnings regression. We, therefore, correct for sample selection using the Heckman selection model as earlier explained. In the first step, we model the probability of labour force participation. In the second step, we model earnings as explained in equation 2 where the inverse mills ratio is now added as an additional explanatory variable. The sample size is reduced by two observations due to the omission of missing values in the process of generating the mills ratio from the first stage. The sample selection corrected results presented in Table 3.18 and do not differ much from the uncorrected findings in Table 3.13 and Table 3.14. A simple interpretation of this is that sample selection is not a very serious issue in our data. 92 Stellenbosch University https://scholar.sun.ac.za Table 3.18: Wage functions corrected for sample selection Description Years of schooling OLS (2010) OLS (2013) 0.067*** 0.061*** (0.009) (0.008) Experience 0.014* 0.030*** (0.008) (0.008) Experience squared -0.000* -0.001*** (0.000) (0.000) Female -0.183*** -0.257*** (0.048) (0.042) Central -0.091 0.018 (0.081) (0.099) Southern -0.266*** -0.139 (0.075) (0.096) Private Individual -0.200* -0.563*** (0.106) (0.125) Government 0.218* 0.231 (0.131) (0.162) Parastatal 0.184 0.377* (0.187) (0.201) Public Works Programme 0.528 -0.251 (0.371) (0.157) Church/Religious Organisation -0.033 0.167 (0.337) (0.229) Other 0.156 0.016 (0.479) (0.277) Casual/Ganyu -0.788*** -0.804*** (0.095) (0.105) Inverse mills ratio_2010 -0.713*** (0.195) Inverse mills ratio_2013 -1.031*** (0.330) Inverse mills ratio Year=2013 Constant R-squared Observations 9.401*** (0.187) 0.346 2,434 9.503*** (0.204) 0.287 2,941 Pooled OLS 0.065*** (0.008) 0.027*** (0.006) -0.000*** (0.000) -0.224*** (0.031) -0.029 (0.075) -0.195*** (0.066) -0.365*** (0.094) 0.220* (0.122) 0.295** (0.138) 0.054 (0.205) 0.089 (0.225) 0.059 (0.275) -0.796*** (0.084) Random 0.077*** (0.005) 0.024*** (0.005) -0.000*** (0.000) -0.242*** (0.030) -0.018 (0.037) -0.186*** (0.037) -0.287*** (0.056) 0.256*** (0.070) 0.377*** (0.110) 0.129 (0.181) 0.137 (0.124) 0.443** (0.175) -0.743*** (0.048) -0.722*** (0.182) 0.208*** (0.044) 9.274*** (0.163) 0.317 5,375 -0.675*** (0.132) 0.162*** (0.025) 9.155*** (0.107) 0.387 5,375 Notes: *, **, *** denote significance at 10%, 5% and 1% levels; robust standard errors in parenthesis However, it is important to correct for sample selection as we have done given that the inverse mills ratio is significant in each of the four models. Without sample correction, the returns to education tend to be larger by between 0.7 and 0.8 percentage points than with sample selection. Our estimates are, 93 Stellenbosch University https://scholar.sun.ac.za therefore, upwardly biased when sample selection is not addressed. Self-selection between sectors is addressed in the section that follows. The final robustness check is to distinguish between formal and informal sectors. Just like in many other developing countries, the formal sector in Malawi only absorbs a small percentage of the labour force (Chirwa & Matita, 2009). In our data set, about 77.54% of individuals aged between 15 and 64 years with positive earnings are employed in the informal sector. Due to the large size of the informal sector, studying earning differentials between economic sectors becomes important for policy. The process of accounting for self-selection is the same as before; we first run a probit on the choice of employment sector (see Table 3.19) from which we obtain the inverse mills ratio used in the second stage. The first stage results show that the following are less likely to enter the formal sector: females, those not married and individuals from households with high age dependency ratio. The rest of the explanatory variables are associated with a higher probability of entering the formal sector. Table 3.19: Probit on choice of employment sector Explanatory variables Coefficient Standard error Female -0.439*** (0.045) Age groups 20-24 0.278*** (0.098) 25-29 0.607*** (0.096) 30-34 0.911*** (0.099) 35-39 0.913*** (0.103) 40-44 1.079*** (0.110) 45-49 1.040*** (0.113) 50-54 1.093*** (0.124) 55-59 0.906*** (0.140) 60-64 0.737*** (0.141) PSLCE 0.512*** (0.062) JCE 0.990*** (0.067) MSCE 1.867*** (0.077) Non-University Diploma 2.343*** (0.166) University Diploma 2.691*** (0.267) Post Graduate degree 2.420*** (0.412) Not married -0.096* (0.051) Age dependency ratio -0.212*** (0.033) Central 0.160*** (0.056) Southern 0.296*** (0.055) Year=2013 -0.169*** (0.041) Constant -1.276*** (0.104) Percent correctly predicted 80.10% Pseudo R-squared 0.315 Observations 5,416 Notes: *, **, *** denote significance at 10%, 5% and 1% levels; robust standard errors in parenthesis 94 Stellenbosch University https://scholar.sun.ac.za For each of the two sectors, we present two sets of results, namely without and with self-selection correction. To achieve this, we distinguish between the formal and informal sectors. Sector decomposition analysis of earnings sheds some light on the importance of distinguishing between the different types of employment sector in estimating returns to education in developing countries. Studies that fail to take this into account tend to overstate the returns to education by assuming that returns are the same in both the formal and informal sectors. We with uncorrected results (without sample selection) in Table 3.20 where we also conduct the coefficient comparison test. Table 3.20: Regression results on log of monthly wages by sector without sample selection Description Formal Years of schooling 0.099*** (0.011) Experience 0.041*** (0.008) Experience squared -0.001*** (0.000) Female -0.079 (0.076) Central 0.030 (0.136) Southern -0.072 (0.132) Private Individual -0.328*** (0.092) Government 0.082 (0.128) Parastatal 0.262** (0.130) Public Works Programme 0.054 (0.216) Church/Religious Organisation 0.031 (0.215) Other 0.025 (0.276) Year=2013 0.204*** (0.060) Constant 8.576*** (0.221) R-squared 0.238 Observations 1,909 Pooled OLS Random effects Informal Difference Formal Informal 0.037*** 0.062*** 0.086*** 0.047*** (0.009) (0.006) (0.007) 0.048*** -0.007 0.027*** 0.048*** (0.006) (0.006) (0.004) -0.001*** 0.000 -0.000** -0.001*** (0.000) (0.000) (0.000) -0.345*** 0.266*** -0.117* -0.368*** (0.033) (0.066) (0.032) -0.113 0.143 0.085 -0.122*** (0.079) (0.073) (0.040) -0.264*** 0.192 -0.052 -0.263*** (0.070) (0.071) (0.040) -0.243*** (0.056) 0.171** (0.068) 0.347*** (0.098) 0.113 (0.182) 0.158 (0.126) 0.353** (0.139) 0.322*** -0.118 0.164*** 0.280*** (0.047) (0.030) (0.028) 8.348*** 0.228 8.767*** 8.308*** (0.115) (0.135) (0.078) 0.120 0.297 0.132 3,468 1,909 3,468 Notes: *, **, *** denote significance at 10%, 5% and 1% levels; robust standard errors in parenthesis The returns to education are positive in both sectors but with larger magnitudes in the formal sector for both OLS and random effects. It is not surprising for the informal sector to have positive returns to 95 Stellenbosch University https://scholar.sun.ac.za education. Previous research in Malawi by Chirwa and Zgovu (2002) has also shown that casual employment (informal sector wage employment) on peak labour tasks may be well paid, above the daily minimum wage, but short-lived. The convex relationship between earnings and experience is also maintained in both sectors except that the coefficient is bigger in the informal sector than the formal sector. This implies that experience matters more in the informal sector. The time dummy coefficient is also positive in both models but larger in the informal sector, an indication that earnings grew much stronger within the informal sector. Next, we look at results corrected for sample selection in Table 3.21. Table 3.21: Regression results on log of monthly wages by sector with sample selection Pooled OLS Formal Informal Years of schooling 0.058*** 0.012 (0.014) (0.011) Experience 0.029*** 0.030*** (0.010) (0.008) Experience squared -0.001*** -0.001*** (0.000) (0.000) Female 0.070 -0.207*** (0.087) (0.048) Central -0.005 -0.154* (0.137) (0.082) Southern -0.153 -0.329*** (0.130) (0.071) Private Individual -0.275*** (0.092) Government 0.060 (0.126) Parastatal 0.266* (0.137) Public Works Programme 0.116 (0.218) Church/Religious Organisation 0.015 (0.215) Other 0.062 (0.269) Inverse mills ratio -0.459*** -0.323*** (0.146) (0.082) Year=2013 0.249*** 0.371*** (0.064) (0.045) Constant 9.472*** 9.144*** (0.330) (0.256) R-squared 0.251 0.127 Observations 1,909 3,466 Description Random effects Formal Informal 0.044*** 0.015* (0.007) (0.008) 0.012* 0.027*** (0.006) (0.005) 0.000 -0.000*** (0.000) (0.000) 0.041 -0.210*** (0.065) (0.041) 0.023 -0.169*** (0.073) (0.040) -0.145** -0.337*** (0.072) (0.041) -0.178*** (0.057) 0.111* (0.066) 0.324*** (0.099) 0.209 (0.187) 0.148 (0.126) 0.366*** (0.131) -0.570*** -0.363*** (0.071) (0.059) 0.215*** 0.331*** (0.029) (0.029) 9.799*** 9.224*** (0.174) (0.163) 0.311 0.142 1,909 3,466 Notes: *, **, *** denote significance at 10%, 5% and 1% levels; robust standard errors in parenthesis 96 Stellenbosch University https://scholar.sun.ac.za We note a number of things that we did not observe without sample selection. Although the key results are preserved in terms of direction and significance after correcting for self-selection, the magnitudes of the coefficients significantly become smaller. This suggests that failure to account for sample selection upwardly biases the results reported in studies. For example, after controlling for sample selection, we find a smaller statistically significant coefficient for gender in the formal sector compared to the formal sector. In addition, while the coefficients for regions are insignificant in the formal sector, the differences are not only significant but also larger in the informal sector. 3.8.2 Household non-farm enterprise earnings In this section, we explore two main ideas. Firstly, we would like to establish if there are positive ‘externalities’ in education in the running of household enterprises. Secondly, we establish if the observed externalities have a turning point beyond which they either increase or decrease. Table 3.22 presents the results where we establish that strong education externalities exist within the households in non-farm enterprises. After controlling for other factors, we find returns to education of up to 12.8% and 15.8% for OLS and random effects, respectively. The control factors are experience, the gender of the owner, location of the enterprise, enterprise industry, the age of the enterprise, type of enterprise customers, enterprise registration status with an association and time dummy. We add a square term of maximum education to establish if education externalities have a turning point beyond which they either increase or decrease. Some studies have established that education externalities have been found to be monotonic. For example, Mussa (2014) uses quartiles of average household education and finds that the marginal effects do not switch signs with respect to efficiency and production uncertainty of maize in Malawi. This finding is confirmed in this study; the coefficient of the square of maximum education is not significant, implying that education externalities are monotonic, i.e. they do not increase with levels of education. Our study finds contrary evidence to that by Matita and Chirwa (2009), who find that enterprises owned by females tend to be less profitable by about 73% compared to those owned by males. We find no statistically significant earning differences between males and females. Enterprises in rural areas earn almost 60% lower than their counterparts operating in urban areas. This is probably due to the fact that the demand for goods and services is higher in urban areas than in rural areas. Matita and Chirwa (2009) find similar results. With respect to industry, the following enterprises are more profitable when compared to agricultural enterprises: mining and quarrying (OLS only), construction and transport, storage and communication sector. Older enterprises tend to generate more profit probably because they are more likely to have a 97 Stellenbosch University https://scholar.sun.ac.za loyal customer base accumulated over time and have accumulated the necessary experience. As expected, enterprises in the informal sector are less profitable compared to those in the formal sector. Informal sector enterprises are typically micro and small enterprises with limited access to capital and entrepreneurial skills. Table 3.22: Regressions for monthly household non-farm enterprise earnings Explanatory variables Maximum years of schooling Maximum years of schooling squared Owner experience Owner experience squared Enterprise owner is female Rural Enterprise industry Mining and Quarrying Manufacturing Construction Wholesale, Retail and Trade Transport, Storage and Communication Financing, Insurance and Business Community, Social and Personnel Informal sector Age of household enterprise Main enterprise customers Traders Other small businesses Large established businesses Marketing board (ADMARC) Other Not registered with enterprise association Year=2013 Constant R-squared Observations OLS 1 2 3 4 0.144*** 0.189*** 0.128** 0.128** (0.018) (0.059) (0.053) (0.053) -0.002 -0.002 -0.002 (0.002) (0.002) (0.002) 0.008** 0.008** (0.003) (0.003) -0.000*** -0.000*** (0.000) (0.000) 0.055 0.053 (0.069) (0.068) -0.567*** -0.567*** (0.117) (0.118) Random effects 5 6 7 0.143*** 0.229*** 0.163*** (0.012) (0.046) (0.045) -0.003* -0.003* (0.002) (0.002) 0.008*** (0.003) -0.000*** (0.000) -0.008 (0.064) -0.526*** (0.078) 8 0.158*** (0.045) -0.003 (0.002) 0.008** (0.003) -0.000*** (0.000) -0.016 (0.063) -0.531*** (0.078) 0.806* 0.804* (0.454) (0.463) -0.299* -0.279 (0.171) (0.179) 1.316*** 1.338*** (0.402) (0.40) -0.078 -0.061 (0.184) (0.193) 0.334 0.346 (0.274) (0.279) 0.32 0.374 (0.917) (0.912) 0.029 0.049 (0.236) (0.242) -1.182*** -1.183*** (0.123) (0.124) 0.018*** 0.017*** (0.005) (0.005) 0.896 (0.565) 0.052 (0.284) 1.675*** (0.476) 0.269 (0.284) 0.808** (0.347) 0.753 (0.825) 0.173 (0.316) -1.097*** (0.113) 0.020*** (0.004) 0.908 (0.565) 0.078 (0.285) 1.699*** (0.474) 0.291 (0.286) 0.821** (0.347) 0.793 (0.817) 0.201 (0.317) -1.096*** (0.113) 0.019*** (0.004) 0.228 (0.172) 0.645*** (0.216) 1.486*** (0.355) 4.467*** (0.240) 0.102 (0.177) -0.376* (0.202) 0.301** (0.152) 0.840*** (0.237) 0.905* (0.547) 4.031*** (0.152) 0.197 (0.210) -0.23 (0.215) 0.230 (0.173) 0.661*** (0.218) 1.502*** (0.354) 4.550*** (0.263) 0.138 (0.177) -0.377* (0.203) 0.108 (0.079) 7.312*** 7.059*** 9.455*** 9.382*** (0.189) (0.336) (0.470) (0.476) 0.101 0.101 0.256 0.257 1,702 1,702 1,702 1,702 0.304** (0.153) 0.855*** (0.238) 0.939* (0.539) 4.120*** (0.160) 0.231 (0.206) -0.234 (0.215) 0.119** (0.060) 7.446*** 6.943*** 8.709*** 8.664*** (0.120) (0.276) (0.465) (0.467) 0.130 0.131 0.288 0.290 1,702 1,702 1,702 1,702 Notes: *, **, *** denote significance at 10%, 5% and 1% levels; robust standard errors in parenthesis 98 Stellenbosch University https://scholar.sun.ac.za The data also allows us to control for the type of enterprise customers. We use final consumers as the base category since we expect enterprises selling to these customers to be less profitable compared to institutional customers such as other small businesses, large established businesses and the national marketing board (ADMARC). This is confirmed in our study for both OLS and random effects models. Enterprises that are not registered with any enterprise association make less profit compared to those that belong to an association according to OLS but in the random effects model, the result is not significant. 3.9 Measurement error using panel data In panel data, measurement error, just like non-random attrition bias, is of concern and an attempt is usually made in the literature to arrive at results that are robust to these concerns (Deaton, 1997). We discuss two types of measurement error as discussed in Wooldridge (2002), namely measurement in the dependent variable and measurement in the independent variable. The assumptions we make about the measurement error are important. First is the usual assumption that the measurement error has zero mean. However, if this is not the case, then the estimation of the intercept is affected. The second assumption relates to the relationship between the measurement error and the explanatory variables included in the model. If the measurement error in the earnings is statistically independent or uncorrelated with each explanatory variable, then the OLS estimators from equations are consistent (and possibly unbiased as well). Consequently, measurement error does not bias the coefficients but only leads to larger standard errors than when the dependent variable is not measured with error, i.e. it leads to loss of efficiency. In our model, one can reasonably argue that measurement error is correlated with education since people with more education tend to report their earnings more accurately. However, in the absence of additional information, it is difficult to establish if measurement error in earnings is related to any of the explanatory variables. One solution to measurement error in earnings is to collect more data because more observations imply a better estimator of variance and consequently reduces the errors in inferences. This solution is beyond the researcher’s control considering that the data is secondary data32. Measurement error in the independent variables is considered a more serious problem than measurement error in the dependent variable. In panel data, the most common method of dealing with measurement error is first differencing (short and longer differencing). However, our data comes from a panel of two 32 Alternatively, we can use the Malawi Labour Force (2013) cross-sectional data section for comparison purposes. Nevertheless, there is no guarantee that this is ‘better’ quality data. 99 Stellenbosch University https://scholar.sun.ac.za periods such that longer differencing is not possible. Moreover, first differencing when T=2 yields the same results as fixed effects presented in Table 3.14. We are thus unable to correct for the possibility of such measurement error. 3.10 Comparing income and consumption There is a good discussion in the literature as to which is a better measure between income and consumption in the money-metric approach to the measurement of living standards. In this section, we want to find out if our consumption and income data tell a consistent story as pointed out in Section 3.1. In general, consumption is considered a better measure of long-term living standards than income because it is less volatile on an annual basis for most households. Unlike income, consumption is said to fluctuate much less dramatically. This argument particularly holds for low-income countries (including Malawi) where there is a large informal sector. Generally, it is difficult and less straightforward to estimate income from self-employment activities and casual employment in the informal sector (Haughton & Khandker, 2009). Furthermore, Malawi is largely agricultural based and incomes are largely vulnerable to seasonal and weather patterns. On the other hand, income is relatively easy to measure in many high-income countries where salaries and wages form the largest sources of income. Another advantage of using consumption expenditure over income is that while income usually fluctuates over an individual’s lifetime, consumption remains relatively stable as a result of smoothing by saving and borrowing (Blundell & Preston, 1998); McKay, 2000; Duclos & Araar, 2006; Haughton & Khandker, 2009) as explained in the permanent income hypothesis. In the IHPS survey, consumption seems to have been collected in a greater detail than income; only two modules of the household questionnaire are dedicated to collecting information on earnings (Modules E and N) compared to five modules aimed at collecting information about consumption (Modules G through K). In order to reduce recall associated with consumption and aspects of agricultural activities, it was planned that households be visited twice in 201333. The visits were only approximately three months apart which we believe does not adversely affect recall bias. The reference period for consumption depends on the items (e.g., the reference period is one week for food consumption and either one week, one month, three months and 12 months for non-food expenditures). However, the reference period for earnings is over the past twelve months. According to Deaton (1997), large increases in prices (inflation) will tend to overstate consumption relative to income due to the differences in the periods of reporting as noted in the questionnaire. Following this line of 33 About 92% of the households in the survey were visited twice in 2013. 100 Stellenbosch University https://scholar.sun.ac.za argument, consumption is more likely to be reported in more recent and now higher prices than is income. However, inflation has not increased a lot over the period under study. Moreover, this inflation bias (if any) is partly reduced due to the fact that our consumption estimates are both spatially and temporally deflated. We compare income and consumption using kernel densities as presented in Figure 3.3. Our analysis excludes households that split to form new households although the picture does not change even with the inclusion of the split-off households. Based on the figure, we can see that both consumption and earnings data seem to tell a consistent story. The data shows that, on average, both percapita income and percapita consumption have remained the same amongst households that were available in both waves. Nevertheless, we note that on the lower tail, consumption is generally larger than income meaning that some households to consume more than their income. This could be due to dissaving or receipts of grants and grants. However, using household total income and consumption and, therefore, ignoring household size, the data shows that income and consumption have increased by 6.3% and 3.0%, respectively over three years. This shows that adjusting for household size is important especially when there are fewer earners in the household. Household percapita income (consumption) is calculated by diving total annual household income (consumption) by household size. We loosely define income as the sum of wages and earnings from self-employment activities. Household income is the sum of the incomes of all the earners in the household. The income was then converted into annual figures and adjusted into percapita figures by dividing by household size. Figure 3.3: Kernel densities for annual household percapita consumption and income by year Source: Own computation from IHPS data; the vertical line is the annual poverty line 101 Stellenbosch University https://scholar.sun.ac.za 3.11 Dynamics of household percapita consumption Table 3.23 gives a breakdown of household percapita consumption by survey year. Despite consumption remaining unchanged, we note that some consumption components such as housing have declined by almost 4 percentage points. An analysis of housing expenditure by urban-rural areas shows that the largest decline came from urban areas (34%) compared to (12%) in rural areas. This disparity is perhaps due to the fact that most households in rural areas live in their own houses. Surprisingly, households spend equal proportions on alcohol and education. Table 3.23: Average household percapita consumption by components and year 2010 Description Food Amount 2013 Percent of total Amount Percent of total 84,962 54% 88,939 56% Alcohol 2,949 2% 4,508 3% Clothing 4,340 3% 4,978 3% Housing 34,001 22% 27,853 18% Furnishings 5,493 3% 5,400 3% Health 2,420 2% 1,958 1% Transport 7,730 5% 9,865 6% Communication 5,890 4% 3,972 3% Recreation 1,484 1% 1,070 1% Education 2,459 2% 2,707 2% Hotels and restaurants 1,706 1% 2,478 2% Miscellaneous 3,855 2% 4,180 3% Total 157,287 100% 157,907 100% Source: Own computation from IHPS data In order to analyse the household consumption poverty dynamics, we categorise households into three groups with reference to the poverty line34. The first group identifies households below the food poverty line to capture the extent of extreme poverty (0- K53,262). The second group gives households with percapita consumption between food and absolute poverty line (K53,262-K85,852). The second group captures movements with poverty. Finally, the third group consists of non-poor households with consumption above absolute poverty (>K85,852). 34 Households whose total percapita household consumption is below K53,262 (food poverty line)are considered ultra-poor while those with consumption below K85,852 are poor. The absolute poverty line is the sum of the food and non-food poverty lines. 102 Stellenbosch University https://scholar.sun.ac.za The resulting consumption transition matrix presented in Table 3.24 shows that in 2010, the proportion below the food poverty line was about 12%. This proportion has declined to about 9% in 2013. Conversely, the proportion above the food poverty line has increased from about 88% in 2010 to about 91% in 2013. This change is reflected in two parts. First, there is a slight increase in the proportion above the absolute poverty from 66% in 2010 to about 67% in 2013. Second, there has also been movement with poverty as reflected in the increase in the proportion between the food and absolute poverty line from 22% in 2010 to 24% in 2013. Despite the improvement in household welfare between 2010 and 2013, there are some considerable transitions, into, within and out of poverty. Out of the 12% that was below the food poverty line, about 4.7% moved upwards but still within absolute poverty while 4.4% moved above the absolute poverty line. About 2.7% households have been stuck in extreme poverty over the three years. Table 3.24: Consumption transition matrix 2013 Below food poverty line (K53,262) Between food and 2010 absolute poverty line Above absolute poverty line (K85,852) Total Above Below food Between food poverty line and absolute (K53,262) poverty line 2.7% 4.7% 4.4% 11.8% (0.004) (0.006) (0.005) (0.011) 3.6% 8.0% 10.6% 22.2% (0.004) (0.008) (0.007) (0.011) 3.0% 11.3% 51.8% 66.0% (0.004) (0.008) (0.017) (0.016) 9.3% 24.0% 66.8% 100.0% (0.008) (0.013) (0.016) absolute poverty line Total (K85,852) Source: Own computations from IHPS data; figures in parenthesis are standard errors Amongst those within poverty (22.2%), about 8% remained within poverty while 10.6% moved out of poverty and a further 3.6% slipped below the food poverty line into extreme poverty. Of those that were above absolute poverty line (66.0%) in 2010, 3% ended up in extreme poverty, 11.3% became absolutely poor while 51.8% of the households remained out of poverty. 103 Stellenbosch University https://scholar.sun.ac.za In the IHS panel survey, respondents were asked about the shocks that affected their households and how they responded to the shocks. According to the results, the major shocks were increases in the prices of food and agricultural inputs. In order to smooth their consumption, households embarked on a number of mitigation measures against the shocks. In 2013, nearly more than one-third (about 35%) of households reported having used their own savings to cope. This is an increase compared to only 21% of the households in 2010. There has also been an increase in the number of households using other mitigation measures between 2010 and 2013. These measures include seeking help from relatives and friends, help from the government, changes in dietary patterns, selling of assets, use of credit and spiritual help. 3.12 Conclusion and policy implications The study sought to examine the returns and externalities to education in Malawi using the IHS3 panel data set. The random effects results based on the standard Mincerian earnings functions show that the average rate of return to years of schooling in Malawi is around 8.3% for the full sample. Consistent with the existing literature, we obtain returns to education that increase with the level of education and are also higher for females than males. Decomposition of the sample by economic sector reveals that the rate of return on education is lower in the informal sector at 8.6% in the formal sector and 4.7% in the informal sector. The rates of return found in this study compare favourably with those observed in other studies in Malawi and other Sub-Saharan African countries including Ghana, Cameroon and Rwanda. Sectoral analysis of earnings gives us some more interesting results different than when the sample is not split. First, we observe that there is a smaller statistically significantly difference in earnings between males and females in the formal sector than in the informal sector. Second, we find that returns to education are positive in both sectors but lower in the informal sector. These two findings highlight the importance of distinguishing between the different types of employment sectors in estimating rates of return on education in developing countries. The assumption that returns are the same in both sectors of the economy is not realistic and suggests that studies that fail to take this into account tend to overstate the returns to education. The coefficient comparison tests help us arrive at a better way of comparing estimates across models. The results are robust to different model specifications and sample selection. After accounting for sample selection, the returns to education drop by between 0.7 and 0.8 percentage points. Our results show that education externalities play a significant role in nonfarm enterprises. Education in Malawi has wider social benefits which should not be underestimated. We also conduct data consistency checks to ensure data quality. This is important if we are to construct stable and meaningful comparable data for further analysis. Specifically, the chapter highlights issues related to data quality and inconsistencies in measuring returns to education. Unless the sources of inconsistencies are explained, it is difficult to conduct further meaningful analysis with the data. 104 Stellenbosch University https://scholar.sun.ac.za The results have a number of implications with respect to policy. First, the positive and large returns from schooling suggest that education is a good investment. This is particularly supported by the presence of education externalities. Access to education, therefore, is important for all. Second, since returns increase with the level of education, it may be important to invest more resources into higher level education while not neglecting primary education. The current policy makes primary education universally free. This is good given the positive social returns of education established in the operation of non-farm enterprises. We, however, note with concern that the government has recently reduced its subsidy contribution being offered for tertiary education yet the findings from this study show that returns to education are highest at the tertiary level. Perhaps, the government should simply redirect these resources to other areas within tertiary education rather than reduce the contribution. Third, education policy should encourage female education considering that females with similar skills to males tend to have higher returns. This may be both economically efficient and equitable although currently there are fewer females entering the labour market in Malawi. Therefore, the focus of policy should not only be primary education but also tertiary education where the returns are the highest. 105 Stellenbosch University https://scholar.sun.ac.za Chapter 4 Patterns of migration and employment in Malawi: Spatial data analysis 4.1 Introduction There is growing interest in spatial analysis in the literature and it is now widely recognised that the standard non-spatial econometric techniques produce biased results in the presence of spatial autocorrelation and heterogeneity. Therefore, spatial analysis is at the research frontier and has recently become more central in both theoretical and applied econometrics including the fields of labour and agricultural economics (Anselin, 2003). This growing interest has allowed the development of econometric techniques that can handle spatial data. There has also been a rapid spread of geographic information systems (GIS) and the associated availability of data sets containing the location of observations. Taking advantage of these developments as well specific literature gaps in Malawi, this study pursues three main aims: (i) to understand the spatial and temporal patterns of employment and migration in Malawi through the application of spatial data econometric techniques; (ii) to analyse how long-term changes in age structure affect labour force participation; and; (iii) to explore the effects of land reform policy on migration and employment through the use of a difference-in-difference estimation strategy. The study contributes to the literature in three main ways. First, we add a gender dimension to our analysis and this has implications for development policy in a poor country such as Malawi. In the literature, it is argued that economic growth and structural change help remove barriers faced by women in development and, therefore, improve their labour force participation. However, little attention has been given to the empirical analysis of such assertions especially in the context of developing countries (Gaddis & Klasen, 2014). The study contributes to this strand of literature. The second contribution of this study relates to the application of spatial panel data econometric methods to the study of employment and migration in Malawi. This is important because the economic phenomena we are studying are essentially spatial in nature in the sense that we expect their values in a given location to be determined by the values in neighbouring locations. To our knowledge, such a study has not yet been done in Malawi. With respect to this objective, we compare results obtained with the application of spatial econometric models with estimation results obtained from non-spatial panel data models. Finally, by matching census data GIS codes that are consistent over time at the level of small geographical places, it is now possible to integrate census data with other data for similar spatial analysis. This is relevant because all nationally representative data sets collected by Malawi’s National Statistical Office (NSO) use censuses as the sampling frame. For example, one can now merge census data with Demographic Health Survey (DHS) data aggregated at the district or small geographical area. This kind of data integration could be an area for further study. 106 Stellenbosch University https://scholar.sun.ac.za 4.2 Data and methods 4.2.1 Data Using data from the Integrated Public Use Microdata series (IPUMS)-International, Minnesota Population Centre, we matched district and small geographical area level GIS data codes to create boundaries that are consistent over time, namely for 1987, 1998 and 2008. This matching gives us a panel of districts and small geographical areas (also called traditional authorities) over which we conduct our spatial analysis while also exploring the advantages of working with panel data. Based on previous work, most of the indicators are heterogeneous at small geographical areas rather than regions where there seems to be uniformity. At this stage, it is important to provide a brief background on Malawi’s administrative units. The country has three regions, namely Northern, Central and Southern regions. The regions are divided into districts which are further subdivided into small geographical areas called traditional areas. Figure 4.1 shows the administrative boundaries for Malawi. Malawi has four cities: Lilongwe, located in the Central region, Mzuzu in the Northern region, and Zomba and Blantyre in the Southern region. Since Lilongwe, Zomba and Blantyre are named after districts, the word “city” is added to the district name to signify the city. Consequently, “Lilongwe” refers to Lilongwe district while “Lilongwe City” to the city (National Statistical Office, 2008). While the regions have remained stable over time, the number of districts and small geographical places in Malawi has been changing over time35. Through the changes, Machinga district was split into Machinga and Balaka, Mulanje was split into Mulanje and Phalombe, and Mwanza was split into Mwanza and Neno. Nkhata-Bay and Likoma are combined in the data (National Statistical Office, 2008). As expected, these changes create a problem of inconsistent comparability of the spatial units over time because we may not be working with the same observations as desired. In order to allow for consistent comparison over time, we dissolved the new districts or traditional authorities and merged them into what they were as at 1987. The end result is data for a total of 24 districts and 178 traditional authorities instead of the 28 districts and 224 traditional authorities as at 2008. These form our units of analysis. Consequently, our variables of interest are collapsed by traditional authority as proportions by year and gender, depending on the level of analysis. A detailed list consisting of regions, districts and traditional areas is provided in Table A4 in the appendix. 35 According to National Statistical Office (2008), there were 23 districts in 1966; 24 districts in 1977; 24 districts in 1987; 28 districts and 4 cities in 1998 and 27 districts and 4 cities in 2008. 107 Stellenbosch University https://scholar.sun.ac.za Figure 4.1: Map of Malawi showing administrative boundaries Source: Own computation from census data 4.2.2 Theoretical framework Migration involves the movement of people from their usual places of residence to another area over a given period of time. Theories of migration try to explain why people migrate and where they come from and where they are going. All models of migration assume that migration is voluntary, although some forms of migrations are involuntary. The theoretical perspectives of migration from the field of economics and other disciplines can be broadly grouped into either of two disciplines, namely the disequilibrium and equilibrium perspectives (Greenwood, 2005). We now discuss these two alternative perspectives. The disequilibrium framework has been the underlying model for understanding and the study of migration among economists dating as far as before the late 1970s. In this framework, migration was thought to be driven by differences in wages across regions or sectors. Consequently, people would migrate from one area to another in search of market opportunities in the form of wages and earnings. This framework has been criticised by the advocates of the equilibrium hypothesis who argue that the geographical differences in wages or incomes are compensating and do not reflect opportunities from 108 Stellenbosch University https://scholar.sun.ac.za which people can benefit. Instead, the equilibrium models argue that migration depends on imbalances in amenities. According to this theory, the patterns of migration are such that people move from amenitypoor areas into amenity-rich areas. However, as people migrate into amenity-rich areas, wages tend to decline and the prices of locally produced goods and services are driven up. Consequently, as wages and prices continue to diverge across regions until they compensate households for the differences in the amenity bundles supplied by the various regions. The common feature in both the disequilibrium and equilibrium models is that migration is motivated by regional imbalances, with the former dependent on wages while the latter on amenities. Furthermore, regardless of the model used, migration is essentially spatial (Cooke, 2013; Greenwood, 2005). Within the equilibrium approach are the gravity and modified gravity models, which have foundations in the gravity law of spatial interaction. In the basic gravity model, migration is a function of the distance between any two places as well as the population sizes of the origin and destination. It has been argued that the basic gravity model does not fully capture all the factors behind migration. The modified gravity models, therefore, include several additional factors including income, unemployment rates, the degree of urbanisation, amenities and taxes, among other factors. Since the 1960s, the modified gravity models hold a prominent place in the literature and combine elements of both the disequilibrium and equilibrium hypotheses of migration (Greenwood, 2005). Human capital theory is another theory used to explain migration. This model rests on the disequilibrium approach in which individuals are assumed to be economic agents seeking to maximise utility from leisure and income. Implicitly, this assumes that an individual's supply of labour is dependent on the wage rate. Individuals will, therefore, migrate only when the expected return exceeds the costs of migration among other costs. The human capital theory has been central to the understanding of migration and faced no competing theory for almost 20 years. Extensions of the human capital model are the ‘spatial job-search models’. Just like the human capital model, individuals migrate to other locations in search of opportunities but have reservation wages and would reject opportunities that give wages below the reservation wage. The reservation wage is itself dependent on several factors, including the rate of unemployment. Presumably, for unemployed people the reservation wage may fall over time (Willekens, 2008; Greenwood, 2005). 4.2.3 Spatial autocorrelation The concept of spatial dependence36 is based on the first law of geography by Waldo Tobler which states that “Everything is related to everything else, but near things are more related than distant things" 36 We use spatial dependence and spatial autocorrelation interchangeably. 109 Stellenbosch University https://scholar.sun.ac.za (Tobler, 1970, p. 236). Based on this law, two or more objects that are spatially close tend to be more similar to each other with respect to a given attribute Y than are spatially distant objects. In this case, we expect positive spatial autocorrelation among geographically close regions. Measures of spatial autocorrelation can be grouped into two broad categories, namely global and local indices. Global indices express the overall level of similarity between spatially close locations in a given study area with respect to a variable Y. A global measure summarises the variable of interest in a single value and, therefore, only shows the average degree of the spatial distribution of the phenomena of interest. Local spatial autocorrelation measures overcome this limitation by detecting spatial clusters which provide greater detail than the global measures. Thus, local measures may help identify the locations that contribute most to the overall pattern of spatial clustering which some have referred to as hotspots (Sokal, Oden, and Thomson, 1998). The most commonly used tests for spatial autocorrelation are the Moran’s and Geary’s c statistics respectively named after their developers, Moran (1948) and Geary (1954). It is the practice in the literature to report both statistics as a means of robustness checks for each other in the detection of spatial dependence. A detailed discussion of these indices is provided in Pisati (2001) and Sokal et al. (1998). We begin our discussion with the global measures. Moran’s is given by I= Where: w ∑ ∑ (4.1) denotes the elements of the spatial weights matrix W corresponding to the location pair (i, j), Z = Y − , Y represents the value taken by a variable of interest Y at location i; mean variable of ; =∑ ∑ and =∑ denotes the ⁄ . Under the null hypothesis of no global spatial autocorrelation, the expected value of I is given by (4.2) E( ) = −1/( − 1) On the one hand, if I is larger than its expected value, then the overall distribution of variable Y can be seen as characterised by positive spatial autocorrelation, meaning that the value taken on by Y at each location i tends to be similar to the values taken on by Y at spatially contiguous locations. On the other hand, if I is smaller than its expected value, then the overall distribution of variable Y can be seen as characterised by negative spatial autocorrelation, meaning that the value taken on by Y at each location i tends to be different from the values taken on by Y at spatially contiguous locations. 110 Stellenbosch University https://scholar.sun.ac.za Inference is based on z-values, computed by subtracting E( ) from I and dividing the result by the standard deviation of I as follows: z = ( ) ( ) (4.3) Geary’s c is calculated as c = ( − 1) ∑ ∑ (4.4) Under the null hypothesis of no global spatial autocorrelation, the expected value of c equals 1. If c is larger than 1, then the overall distribution of variable Y can be seen as characterized by negative spatial autocorrelation; on the other hand, if c is smaller than 1, then the overall distribution of variable Y can be seen as characterized by positive spatial autocorrelation. As in the case of Moran’s I, inference is based on z-values, computed by subtracting 1 from c and dividing the result by the standard deviation of c: z = (4.5) ( ) Both z and z are assumed to follow a normal distribution (asymptotically) such that their significance can be evaluated through the use of a standard normal table (Pisati, 2001). So far we have looked at global measures. We denote the local measures of autocorrelation by subscript to denote location. Therefore, Moran’s I and Geary’s c detect positive and negative spatial autocorrelation around a given location. Positive I z-values and negative c z-values indicate clustering of similar values of Y around location i, that is, positive local spatial autocorrelation, while negative I z-values and positive c z-values indicate clustering of dissimilar values of Y around location , that is negative local spatial autocorrelation. 4.2.4 Spatial panel data models Spatial regression models estimate the relationship between a dependent variable explanatory variables and one or more while taking into account the spatial dependence among observations. In the literature, it is argued that when the observations are spatial units or locations, the standard regression models are usually misspecified because of the presence of spatial dependence (Pisati, 2001). 111 Stellenbosch University https://scholar.sun.ac.za Several Stata tools exist for fitting spatial regression models and their application depends on whether one is working with cross-sectional37 or panel data. In this study, we fit spatial panel data regressions through the use of xsmle by Belotti, Hughes, and Mortari (2013). This approach fits the models through the maximum likelihood (ML) estimation, which is the widely used method for spatial models. Anselin (2003) provides a good discussion on the different approaches for estimating spatial regression models including ML estimation, the method of moments estimators and spatial two-stage least squares. Our discussion begins by considering the following general specification of spatial panel models from which specific types of spatial panel models are derived: Where: = + denotes an + + ; + other; denotes an rowdenotes denotes the spatially lagged dependent variable (DLAG) which assumes that the value taken by affected by the values taken by denotes an denotes the number of locations; matrix of observations on the explanatory variables; The term (4.6) 1 vector of observations on the outcome variable; standardised and inverse distance38 spatial weights matrix; and + = in each geographical area is in the neighbouring areas. 1 vector of normally distributed homoscedastic errors uncorrelated to each the spatial matrix for the spatially lagged independent variables; Z the spatially lagged regressors; and the spatial matrix for the idiosyncratic error. The component or random effects and is the time effect. The terms and is the individual fixed denote the spatial autoregressive parameters. From the general model specification given in (4.6), we derive and discuss five spatial panel models which are normally considered in the literature. First is the Spatial Auto-regressive model (SAR) with lagged dependent variable ( = given by: = + + (4.7) + where the standard SAR model is obtained by setting 37 For example, see Stata’s = 0) and it is by Pisati (2001) and = 0. by Jenty (2010) for a discussion on spatial cross-sectional analysis. 38 Row standardised means that all nonzero weights are rescaled so that they sum up to 1 within each row and inverse distance means that locations that are closer are given higher weights than those farther apart. 112 Stellenbosch University https://scholar.sun.ac.za Second is the Spatial Durbin model (SDM) with time-lagged dependent variable ( = 0) and it is specified as: = + + + (4.8) + where the standard SDM model is obtained by setting τ = 0. The package xsmle allows the use of a different weighting matrix for the spatially lagged dependent variable W and the spatially lagged regressors D together with a different set of explanatory and spatially lagged regressors Z . The default is to use W = D and X = Z . Third is the Spatial Autocorrelation Model (SAC) ( = weighting matrix for the spatially lagged dependent variable = + with + + with = and the error term . It is given as: + = The fourth model is the Spatial Error Model (SEM) ( = = = 0) and it allows the use of a different + (4.8) = = 0) and it is given as: Finally, we have the Generalised Spatial Random Effects (GSPRE) model ( = specification is as follows: = + with =∅ + and v = + = (4.9) = 0) whose (4.10) Table 4.1 provides the summary of the spatial available fixed and random effects spatial models which are the most commonly estimators with panel data39. The SAR and SDM allow for a time-lagged dependent variable which may not be feasible for a panel with 39 = 2. Williams (2015), Clark and Linzer (2015); Wooldridge (2002) provide a detailed discussion on panel data methods and the choice between random and fixed effects. 113 Stellenbosch University https://scholar.sun.ac.za Table 4.1: Summary of spatial panel model options Spatial Generalised Spatial Spatial Spatial Durbin Spatial Error Autocorrelation Panel Random Description Autoregressive Model (SDM) Model (SEM) (SAC) Effects (GSPRE) Model (SAR) Random effects (RE) No time lagged Yes Yes Yes N/A Yes dependent variable With time lagged Yes, DLAG Yes, DLAG N/A N/A N/A dependent variable Fixed effects (FE) No time lagged Yes Yes Yes Yes N/A dependent variable With time lagged Yes, DLAG Yes, DLAG N/A N/A N/A dependent variable Source: Compiled from Belotti et al. (2013 We use fixed effects models because we are able to control for the effects of time-invariant variables with time-invariant effects. Our main results are based on SAR because it combines coefficients for both the weighting matrix and the matrix for the idiosyncratic error term, namely and . SEM and SAC are used as robustness checks. Since SDM uses a time-lagged dependent variable, it is not possible to use for a panel with two periods of data. The non-spatial results are obtained with the help of the standard pooled ordinary least squares (OLS) and fixed effects estimation models. 4.3 Descriptive analysis In this section, we show the spatial distribution of some of the variables of interest on the map of Malawi. This presents a first step in the spatial analysis of the data. Trends in fertility, population age structure, labour force participation, occupational mobility and migration are also discussed. 4.3.1 Spatial distribution of employment, education, assets and fertility From Figure 4.2 to 4.5 we show choropleth maps for some of our indicators of interest drawn using two Stata tools, namely ℎ 2 and 40. These maps reveal some spatial interaction and heterogeneity in Malawi in terms of proportions of employment status, type of employment, education, asset index, years of schooling and fertility rates. The long white strip on the map with no (or missing) data is Lake Malawi. The two dots near the Lake are Likoma and Chizumula Islands. From the maps, we get an overall idea of the spatial distribution of the aggregated economic indicators under study. For example, most of the clustering in terms of the inactive population occurs in the 40 ℎ 2 converts shape boundary files to Stata datasets and 114 is for visualisation of spatial data. Stellenbosch University https://scholar.sun.ac.za Northern Region. The two main reasons for being inactive cited in the data are schooling (52%) and involvement in housework (33%). We also observe some clusters of self-employment activity in some parts of the Central and Southern regions of the country. Education levels are lowest in the Southern region where there are also high levels of fertility and low asset ownership. The statistics generally agree with what is known about Malawi in terms of poverty from Chapter 2. Figure 4.2: Spatial distribution of status in employment for 2008 Source: Own computation from census data Figure 4.3: Spatial distribution of employment type for 2008 Source: Own computation from census data 115 Stellenbosch University https://scholar.sun.ac.za Figure 4.4: Spatial distribution of educational attainment for 2008 Source: Own computation from census data Figure 4.5: Spatial distribution of asset index, schooling and fertility for 2008 Source: Own computation from census data 4.3.2 Fertility trends In Table 4.2, we present patterns of age-specific fertility rates41 for three years of census data, namely 1987, 1998 and 2008. The unadjusted fertility first declined from 5.53 children per woman in 1987 to 4.75 in 1998 before slightly increasing to 5.05 in 2008. Overall, there has been a decline in total fertility rate over the two decades under study. With respect to the child-bearing ages, the patterns in the age41 The age-specific fertility rate is calculated as the number of births in the last twelve months divided by the number of women for each age group. The result is then multiplied by 5 (the number in each of the age group). 116 Stellenbosch University https://scholar.sun.ac.za specific fertility rates have not been uniform. They consistently remained high in the 20-24, 25-29 and 30-34 age groups but particularly increased for women in the 20-24 age group. Table 4.2: Age-specific fertility rates (ASFR) and total fertility rates (TRF) Year 1987 Age group No. of women No. of births in last 12 months Age-specific fertility rate 15 – 19 20 – 24 25 – 29 30 – 34 35 – 39 40 – 44 45 – 49 Total 37,732 34,103 28,211 21,072 20,147 14,128 12,538 167,931 4,477 8,174 6,497 4,230 3,314 1,375 677 28,744 0.59 1.20 1.15 1.00 0.82 0.49 0.27 5.53 15 – 19 20 – 24 25 – 29 30 – 34 35 – 39 40 – 44 45 – 49 Total 55,640 54,149 39,260 29,465 24,550 17,782 16,637 237,483 5,537 12,029 7,981 5,049 3,251 1,415 699 35,961 0.50 1.11 1.02 0.86 0.66 0.40 0.21 4.75 7,558 16,863 12,797 7,614 4,252 1,495 513 51,092 0.55 1.22 1.12 0.94 0.72 0.34 0.15 5.05 1998 2008 15 – 19 68,134 20 – 24 69,177 25 – 29 57,291 30 – 34 40,575 35 – 39 29,428 40 – 44 21,712 45 – 49 17,057 Total 303,374 Source: Own computation from census data Studies have linked patterns of female employment to fertility rates. For example, the literature on feminisation U-hypothesis42 suggests that the female labour force participation rate exhibits a U-shaped trend with respect to fertility rates as countries develop. At the early stages of economic development, when gross domestic product (GDP) is also still very low, most women tend to participate in the labour force even when fertility rates are still high. The women are said to be able to combine economic activity with child-bearing (Gaddis & Klasen, 2014). In the context of Malawi, much of the work is on household 42 The feminisation hypothesis states that economic growth first lowers the participation of women in the labour market and increases it at later stages of economic development. 117 Stellenbosch University https://scholar.sun.ac.za enterprises and small-scale household farms. However as the countries become richer, the shift in the structure of the economy towards manufacturing combined with the emergence of a formal sector results in lower levels of female labour force participation and employment while fertility is high. Despite the improvement in economic activity and favourable structural change, the low levels of female education and the difficulty of combining wage employment with child-bearing contribute to the low participation of females in the emerging opportunities in industry and formal sector expansion. This may as well be influenced by social restrictions against females working in industries as well as allowing married women with children to get employment outside of the home. The subsequent increase in female participation rate could be as a result of a combination of several factors. One possible explanation could be the expansion of education among females. However, it has generally been the case that labour force participation is also quite high amongst those without education. Therefore, improvement in education does not explain everything. Another explanation could be the emergence of attractive employment opportunities for women in the white-collar service industries which give women a chance to enter the labour force without many restrictions. In this case, a decline in fertility rates possibly frees up the time women spend on child-bearing which can be in the end reallocated to other work. Improvements and increased access to child-care facilities may also have allowed women to combine work outside the home with raising children. 4.3.3 Changes in population age structure and labour supply Data for this section are drawn from the October 2011 International Labour Organisation (ILO) Economically Active Population, Estimates and Projections (EAPEP) (6th ed., October 2011). ILO’s EAPEP database contains country estimates and projections of the total population, the activity rates and the economically active population by sex and age groups. In Table 4.3, we use the EAPEP data to show changes in population and labour force between 1987 and 2008. 118 Stellenbosch University https://scholar.sun.ac.za Table 4.3: Changes in economically active population and labour force by sex (1987-2008) Population Age group 1987 1998 2008 Change (1987-2008) Labour force 1987 1998 2008 Change (1987-2008) Male 15-19 20-24 25-29 30-34 35-39 40-44 45-49 50-54 55-59 60-64 65+ Total Female 15-19 20-24 25-29 30-34 35-39 40-44 45-49 50-54 55-59 60-64 65+ Total 407,691 558,786 782,638 343,770 493,273 629,291 284,125 372,529 533,372 223,757 298,164 449,713 192,948 246,465 314,596 157,269 195,192 236,352 120,458 167,955 192,212 97,986 147,204 155,313 77,201 110,054 137,155 57,122 87,802 119,605 92,185 153,561 199,020 2,054,512 2,830,985 3,749,267 420,190 557,397 775,215 363,609 495,638 626,106 306,338 386,990 522,124 247,318 318,279 426,374 219,557 267,212 299,434 178,186 214,836 241,890 135,505 181,539 216,010 110,125 155,631 185,938 88,609 113,442 162,105 68,982 90,758 137,274 116,213 173,485 229,905 2,254,632 2,955,207 3,822,375 92% 83% 88% 101% 63% 50% 60% 59% 78% 109% 116% 82% 84% 72% 70% 72% 36% 36% 59% 69% 83% 99% 98% 70% 139,838 165,401 378,598 249,233 350,717 369,706 262,531 347,942 515,089 215,254 290,114 438,683 187,160 241,536 311,360 152,708 191,483 233,980 116,965 165,100 190,379 94,850 144,407 153,884 74,808 107,743 135,824 54,387 85,344 118,235 81,330 144,040 193,049 1,629,065 2,233,826 3,038,788 171% 208,414 242,468 485,026 267,253 367,763 439,787 241,088 315,784 510,079 203,295 268,627 419,522 185,745 231,138 295,232 155,022 189,915 239,644 120,193 163,204 214,815 98,562 140,846 178,502 79,659 103,119 155,785 59,351 81,682 131,613 87,863 145,727 202,316 1,706,446 2,250,274 3,272,321 133% 48% 96% 104% 66% 53% 63% 62% 82% 117% 137% 87% 65% 112% 106% 59% 55% 79% 81% 96% 122% 130% 92% Source: Own computation from EAPEP data Firstly, we observe that the labour force growth is higher than that of the population. The magnitude is larger for females than for males. For males (females), the labour force has grown by 87% (92%) compared to 82% (70%) for the population. Secondly, the percentage of the youth has been on the increase. Using the International Labour Organisation (ILO) definition, the youth (15-24) made up 37.7% of the total working-age population in 2008 compared to 36.6% in 198743. This explains some of the large increases that have taken place within the youth groups in terms of both the economically active population and labour force participation. These observed changes in the age structure of the population have significant implications on the supply of labour and growth. As we see in Figure 4.6 the population pyramid for 2008 is different from the 43 Estimates from the 2010 IHS3 put the figures at 38% and 66% for the ILO and SADC definitions, respectively. 119 Stellenbosch University https://scholar.sun.ac.za other two in terms of labour force participation rates especially for the youth (ages 15-29) where the numbers have expanded, suggesting that a majority of the youths now participate in the labour market. Figure 4.6 : Pyramids showing populations and labour force participation by sex (1987) Figure 4.7: Pyramids showing populations and labour force participation by sex (1998) 120 Stellenbosch University https://scholar.sun.ac.za Figure 4.8: Pyramids showing populations and labour force participation by sex (2008) Source: Own computation from census data Changes in population depend on migration, fertility and mortality rates. The picture is, however, not complete because education levels are also increasing and this has implications on fertility and the time people spend in education (and therefore entry into the labour market). Fertility rates have not fallen significantly in Malawi between 1987 and 2008 as earlier shown. The mortality rate (measured by crude death rate) has dropped over time from 20.39 deaths per 1000 in 1987 to about 12.13 per 1000 in 2008. It is not straightforward to know whether the combined changes of fertility, mortality and migration translate into significant differentials in the size, growth and structure of the labour force. In Table 4.4 we show the proportions of the population and labour force belonging to each of the age groups. With the exception of the (20-24) age group, the overall changes in the population proportions seem to mirror those in the proportions in the labour force. 121 Stellenbosch University https://scholar.sun.ac.za Table 4.4: Proportions of population and labour force by age group Age groups 15-19 20-24 25-29 30-34 35-39 Population 1987 19.2 16.4 13.7 10.9 9.6 1998 19.3 17.1 13.1 10.7 8.9 2008 20.6 16.6 13.9 11.6 8.1 Labour force 1987 10.4 15.5 15.1 12.5 11.2 1998 9.1 16.0 14.8 12.5 10.5 2008 13.7 12.8 16.2 13.6 9.6 Source: Own computation from EAPEP data 40-44 45-49 50-54 55-59 60-64 65+ Total 7.8 7.1 6.3 5.9 6.0 5.4 4.8 5.2 4.5 3.8 3.9 4.0 2.9 3.1 3.4 4.8 100 5.7 100 5.7 100 9.2 8.5 7.5 7.1 7.3 6.4 5.8 6.4 5.3 4.6 4.7 4.6 3.4 3.7 4.0 5.1 100 6.5 100 6.3 100 We also examine trends in the age dependency ratio and the proportions of the total labour force by age groups. We define dependency ratio as the ratio of dependents (people younger than 15 or older than 64) to the working-age population (those aged 15-64). Based on this definition, the dependency ratio was 98% in 1987, declined to 94% in 1998 before increasing again to 96%. 4.3.4 Spatial and temporal patterns of migration Migration data is only available for 1987 and 2008 and these can be consistently compared over time. The censuses collected migration data at the district level and geographical mobility was only defined where an individual’s district of residence one year ago was different from the district of residence at the time of the census. Movements within a district or city are not considered as migration. However, for districts which share boundaries with cities, if a person moves from rural areas into a city (e.g. from Lilongwe rural to Lilongwe city) and vice versa is recorded as a movement. Migration is thought to be responsible for changes in spatial distribution of populations and labour force in countries. For example, in Malawi, the urban population amongst individuals aged between 15 and 64 years has steadily increased from 11.7% in 1987 to 16.0% in 1998 to 16.6% in 2008. Although these are not very large changes, the trend is likely to increase and this may have implications for a wide range of social, economic and political dynamics in the country, including the general employment situation. The expansion of the labour force must be matched by the creation of new productive jobs into which migrants can be absorbed. While the Northern and Central regions have registered increases, the proportion of people living in the Southern region has dropped from 49.3% in 1987 to 44.5% in 2008. Our analysis assumes, for the sake of simplicity, that Malawi is a closed system. We, therefore, only concentrate on internal migration. Moreover, the census data provide information only on immigrants into the country and not on Malawians now abroad but who lived in the country a year earlier. Two main patterns of migration can be identified as taking place in the country, namely regional and district level migration. 122 Stellenbosch University https://scholar.sun.ac.za 4.3.4.1 Regional movements The first level of analysis involves movements across Malawi’s three administrative regions. The transition frequencies are given in Table 4.5 and accompanying probabilities in Table 4.6. As one might expect, the majority of the people remained in their home region with only a small percentage moving across. Approximately, 96.73% of the males and 97.71% of females resident in the Northern region in 1986 were still there in 1987. A slightly higher percentage than in the North remained in the Centre and South. The patterns largely remained the same in 2008 although slightly more people had migrated from their original region of residence. Much of the people are migrating towards the Centre and in the long-run, it might be expected to have the majority of the people. The reasons for migration are not captured in the census data. However, most migration seems to be undertaken as a way of improving incomes and economic status. Other reasons might include education, marriage and retirement. Table 4.5: Inter-regional migration transition frequencies by gender Year North 1987 North 204,270 Centre 4,990 South 4,270 Total 213,530 2008 North 365,230 Centre 24,830 South 17,580 Total 407,640 Male Centre South 4,030 2,870 720,780 12,790 16,460 876,540 741,270 892,200 22,780 1,312,940 93,200 1,428,920 15,510 54,470 1,389,170 1,459,150 Total 211,170 738,560 897,270 1,847,000 North 231,620 3,470 2,700 237,790 403,520 1,392,240 1,499,950 3,295,710 404,750 22,870 13,130 440,750 Female Centre South 3,280 2,150 778,960 9,390 9,800 1,027,740 792,040 1,039,280 22,810 1,381,350 70,840 1,475,000 14,470 52,400 1,537,130 1,604,000 Total 237,050 791,820 1,040,240 2,069,110 442,030 1,456,620 1,621,100 3,519,750 Source: Own computation from census data Table 4.6: Inter-regional migration transition probabilities by gender Year 1987 North Centre South Total 2008 North Centre South Total North 96.73 0.68 0.48 11.56 Centre 1.91 97.59 1.83 40.13 Male South 1.36 1.73 97.69 48.31 90.51 1.78 1.17 12.37 5.65 94.30 6.21 43.36 3.84 3.91 92.61 44.27 Source: Own computation from census data Total 100 100 100 100 North 97.71 0.44 0.26 11.49 100 100 100 100 91.57 1.57 0.81 12.52 123 Female Centre South 1.38 0.91 98.38 1.19 0.94 98.80 38.28 50.23 5.16 94.83 4.37 41.91 3.27 3.60 94.82 45.57 Total 100 100 100 100 100 100 100 100 Stellenbosch University https://scholar.sun.ac.za 4.3.4.2 District level migration As we will see, the district level migration patterns reveal another important aspect of migration in Malawi namely, rural-rural migration driven by people searching for agricultural land, considering the country’s dominantly agricultural base and high fertility rates. Just like with regional movements, we capture district movements as a transitional probability matrix obtained by a cross tabulation of the current and previous district of residence. The population proportions that migrated in each district by gender and year are shown in Figure 4.9. We also provide a detailed analysis of population figures for each district by gender and year, the number of people that moved between districts and the accompanying proportions in the appendix in tables A6, A7 and A8. Figure 4.9: Proportions that moved between districts Source: Own computation from census data 124 Stellenbosch University https://scholar.sun.ac.za Malawi has had policies that can potentially affect migration patterns such as those shown in the figure. We discuss two of these government policies. Firstly, the government has been supporting the establishment of rural growth centres which act as focal points for development within rural areas. This project falls under the Integrated Rural Development Programme outlined in the MGDS II. This might have the effect of keeping people in rural areas. Secondly, the Malawi Community Based Rural Land Development Project (CBRLDP), which was launched in 2004 and ran through 2009, enabled landless or land-poor households to voluntarily purchase plots from fallow estates and resettle there. The historical problem of land shortage is evident in Thyolo and Mulanje districts, which are traditionally home to tea estate farms since the early 1900s. The land reform policy was aimed at addressing emergent social conflicts related to unequal access to land. As a pilot programme, about 15,000 households from these rural districts were allowed to resettle in other rural districts of Machinga and Mangochi, but later it was expanded to Ntcheu and Balaka to ease pressure on land prices (World Bank, 2012). The policy has implications for migration patterns we observe post-2004 in the Southern region districts affected by CBRLDP and adjacent areas. The observed patterns of migration due to this policy are largely ruralrural migration since the programme targeted Malawi’s rural poor. We do not have data on these specific 15,000 households but in the sections that follow we simply take advantage of the natural experiment by looking at conditions of the affected districts (as a whole) before and after the policy. 4.3.5 Spatial autocorrelation in variables Table 4.7 extends the spatial comparisons in the maps to show the calculated Moran’s and Geary’s statistics for our variables of interest. For lack of space, we only report statistics based on the pooled sample, but we have also computed them on year by year basis where we find that they are generally stable over time as shown in the appendix in Tables A3, A4 and A5. The table shows that there is similarity between spatially close traditional authorities as shown by positive statistically significant Moran’s and Geary’s values. Technically, spatial dependencies justify spatial regression analysis. Table 4.7: Results showing spatial dependencies in variables (pooled 1987, 1998 and 2008) Variables Moran's I SD p-value Geary's c SD p-value Migration 0.397 0.009 0.000*** 0.612 0.011 0.000*** Employed 0.063 0.009 0.000*** 0.930 0.013 0.000*** Years of schooling 0.167 0.009 0.000*** 0.793 0.013 0.000*** Sex 0.153 0.009 0.000*** 0.857 0.013 0.000*** Age 0.056 0.009 0.000*** 0.954 0.015 0.002*** Married 0.127 0.009 0.000*** 0.863 0.012 0.000*** Dependency ratio 0.075 0.009 0.000*** 0.927 0.013 0.000*** Note: *, **, *** denote significance at 10%, 5% and 1% levels 125 Stellenbosch University https://scholar.sun.ac.za We compute and plot Moran’s spatial correlograms for employment and migration based on cumulative distance bands, namely (0; 100,000 metres], (0; 200,000 metres],…, (0; 700,000 metres] presented in Figure 4.10 and Figure 4.11, respectively. Figure 4.10: Spatial autocorrelation for employment Moran's I spatial correlogram (mean) employed I 0.10 0.05 0.00 0-100000 0-200000 0-300000 0-400000 0-500000 Distance bands 0-600000 0-700000 Figure 4.11: Spatial autocorrelation for migration Moran's I spatial correlogram (mean) migration 0.80 I 0.60 0.40 0.20 0.00 0-100000 0-200000 0-300000 0-400000 0-500000 Distance bands Source: Own computation from census data 126 0-600000 0-700000 Stellenbosch University https://scholar.sun.ac.za The correlograms help us to draw two conclusions. Firstly, spatial dependence is larger for shorter distance bands but diminishes with larger cumulative bands and even becomes negative for employment at large distances. Secondly, spatial dependence is higher for migration than for employment. Employment has lower dependence partly because of the fact that most of Malawi is agriculturally based and there seems to be little variation. Figure 4.12 shows pairs of choropleth maps of two measures of local spatial autocorrelation (Moran’s and Geary’s ) for employment and migration in Malawi. For each small geographical area, the maps confirm some degree of similarity between neighbouring areas with respect to both employment and migration. Generally, the two measures yield consistent results in spatial patterns. Figure 4.12: Measures of local spatial autocorrelation for employment and migration Source: Own computation from census data 4.4 Spatial panel regression results The spatial descriptive results and spatial autocorrelation statistics presented in Section 4.3.1 indicate that there is some clustering in our variables of interest at different locations. However, according to Anselin (1992b), this does not explain why the clustering occurs. We, therefore, extend our previous analysis through spatial panel regression analysis. Our analysis is segregated by gender and the 178 small geographical areas form our spatial units. We begin our analysis with the standard non-spatial ordinary least squares and fixed effects regressions. However, as discussed in Section 4.2.3, these are inadequate in providing answers to our question due 127 Stellenbosch University https://scholar.sun.ac.za to the problem of misspecification when spatial locations form our unit of analysis and in the presence of spatial dependence (Pisati, 2001). The rest of the models are spatial fixed effects models whose results we compare with OLS and non-spatial fixed effects (FE). In this section, we specifically explore the effects of land reform policy, explained earlier, on migration and employment in Malawi through a difference-in-difference (DID) estimation strategy. Considering that migration and employment are inherently spatial, we expect the programme to have spill-over effects affecting surrounding districts. We now provide a brief discussion on the set-up of the DID. Firstly, we created a dummy variable called time to indicate the time when the treatment started. The land reform started in 2004 and we assign a value of “0” to the period before 2004 and “1” post-2004. Secondly, we create a dummy variable treatment to identify the group exposed to the treatment. Small geographical areas in Thyolo and Mulanje are assigned a value of “1” and “0” is assigned for the rest of the country. Finally, we create an interaction term between time and treatment as follows (DID = time ∗ treatment). The coefficient for DID is the difference-in-difference estimator in which we are interested. On the basis of the foregoing discussion, the regression implementation of DID for estimating the impact of treatment on the outcome variable (y) is as follows: y = time ∗ + + + (4.13) It is possible to combine fixed effects with propensity score matching (PSM) on the baseline data to ensure that the comparison and treatment groups are similar before applying double differences to the matched sample. Alternatively, balancing tests can be used to check whether the mean of the observables are the same in the base period. The tests are necessary for obtaining valid results because the treatment and comparison groups must be balanced using observed characteristics. 4.4.1 Effects of land reform policy on migration Our econometric results are presented in Table 4.8 for the male and female subsamples, respectively. The key explanatory variables are DID, schooling and the spatially lagged migration (W*migration). We begin with results based on pooled OLS where the results show a positive significant impact of the land reform policy on migration while holding other factors constant. Next, we use the fixed-effects regression model (without spatial effects), which allows us to control for unobserved and time-invariant characteristics that may influence migration. The results again show a positive significant effect but only for the female subsample. The third regression (SAR) is also a fixed 128 Stellenbosch University https://scholar.sun.ac.za effects regression model but takes into account spatial dependencies between variables. Although the positive effect is retained, the sizes of coefficient reduce after controlling for spatial effects. There is also clear evidence of spatial spill-overs in migration between areas as shown by large and statistically significant coefficients for the spatial lag term, namely ∗ . Our results are also robust to the choice of spatial models because the coefficients also remain stable for the SEM and SAC models (see the results in Table A 11 in the appendix). Overall, three issues are worth noting from the results. Firstly, allowing for spatial effects further reduces the size of coefficients for our variable of interest (DID). Secondly, higher migration in one area is associated with higher migration in neighbouring areas, as indicated by statistically significant coefficients for and ∗ (shown in the appendix). Thirdly, there seems to be a gender dimension to our results. We observe that compared to males, the coefficients for females are not only larger but statistically significant even after controlling for spatial effects. Table 4.8: Effects of land reform policy on migration Time Treated DID OLS -0.180*** (0.010) -0.039*** (0.014) 0.035* (0.020) -0.004 (0.004) 0.056 (0.053) -0.001 (0.001) 0.229** (0.095) -0.036 (0.039) 0.039*** (0.005) Male Non spatial -0.188*** (0.028) SAR -0.089*** (0.029) 0.029 (0.020) -0.004 (0.014) 0.106 (0.083) -0.002 (0.001) 0.427** (0.177) -0.08 (0.057) 0.034** (0.014) OLS -0.153*** (0.009) -0.036*** (0.012) 0.036** (0.017) 0.000 (0.003) -0.045 (0.036) 0.001 (0.001) 0.083 (0.061) -0.112*** (0.022) 0.020*** (0.005) Female Non spatial -0.076** (0.031) 0.019 0.038* (0.014) (0.020) Schooling -0.009 -0.022 (0.009) (0.014) Age 0.097* -0.123 (0.056) (0.078) Age squared -0.002** 0.002 (0.001) (0.001) Married 0.389*** 0.174 (0.120) (0.152) Employed -0.081** -0.116*** (0.038) (0.035) Assets 0.034*** -0.002 (0.010) (0.015) W*migration 0.726*** (0.160) Observations 356 356 356 356 356 Spatial units 178 178 178 178 178 Notes: *, **, *** denote significance at 10%, 5% and 1% levels; standard errors in parenthesis 129 SAR 0.001 (0.025) 0.027** (0.013) -0.023** (0.009) -0.130** (0.052) 0.002** (0.001) 0.108 (0.102) -0.114*** (0.023) -0.005 (0.010) 0.783*** (0.131) 356 178 Stellenbosch University https://scholar.sun.ac.za We further conducted balancing tests, which is a t-test of the difference in the means of the covariates between the control and treated groups in the base period. This test was performed Stata’s user defined command, namely . The balancing tests presented in Table 4.9 show that there are no differences in observed mean outcomes between the treatment and control samples. This further confirms that our results are robust. Moreover, the World Bank (2012) has rated the land reform policy as satisfactory and exceeding expectation. According to their findings, 15,142 households were successfully relocated against a target of 15,000. Therefore, our finding that the people successfully migrated is substantiated. Table 4.9: Balancing tests for base period Description Weighted Variable(s) Mean Control Mean Treated Male Migration 0.150 0.106 Schooling 2.565 2.472 Age 20.736 20.622 Married 0.333 0.326 Employed 0.646 0.636 Assets -0.410 -0.500 Female Migration 0.148 0.093 Schooling 1.275 1.233 Age 21.681 21.758 Married 0.364 0.361 Employed 0.623 0.627 Assets -0.463 -0.595 Note: *, **, *** denote significance at 10%, 5% and 1% levels 4.4.2 Diff. t P-value -0.044 -0.093 -0.115 -0.007 -0.01 -0.09 3.12 0.0021*** 0.74 0.4600 0.72 0.4720 1.48 0.1400 0.2410 1.18 0.56 0.5770 -0.054 -0.042 0.077 -0.003 0.004 -0.132 3.69 0.0003*** 0.4 0.6888 0.7268 0.35 0.82 0.4132 0.22 0.8271 0.9 0.3707 Effects of land reform policy on employment Regression results based on agricultural and government employment are presented in Table 4.10 and Table 4.11 respectively for both the male and female subsamples. It is important to note that our treatment dummy in this section is for the areas where the people were resettled. Effectively, our results should be understood in terms of the regions where the land reform was implemented. DID, schooling and the spatially lagged employment (W*employed) are the three explanatory variables of focus. The results show that the land reform had positive significant effects on agricultural employment, more especially for females where the coefficients for DID (non-spatial fixed effects and SAR) are not only significant but also generally larger than for males. Similar findings are established for the SEM and SAC as reported in Table A 12 in the appendix. The programme, therefore, might have improved the opportunities of women in agricultural employment. 130 Stellenbosch University https://scholar.sun.ac.za The gender dimension of our results has implications for policy and poverty reduction because the participation of females in employment contributes to economic growth. For example, in a recent study, the Food and Agriculture Organisation (2011) found that by closing the resource gap between men and women in developing countries, agricultural yields for females could potentially improve by between 20% and 30%. As a result, total agricultural output of developing countries would grow by between 2.5% and 4.0%. In turn, the levels of malnutrition would decline by between 12% and 17%, which would globally imply that 100 to 150 million people would be lifted out of hunger. Table 4.10: Effects of land reform policy on agricultural employment Time Treated DID OLS -0.289*** (0.035) 0.080*** (0.027) 0.036 (0.037) -0.064*** (0.010) 2.989*** (0.549) -0.048*** (0.009) 0.109 (0.282) 0.677*** (0.188) Male Non spatial -0.390*** (0.045) SAR -0.306*** (0.106) 0.025 (0.029) 0.077** (0.033) 0.225 (0.777) -0.004 (0.012) -0.071 (0.425) -0.262 (0.202) OLS -0.316*** (0.028) 0.032 (0.021) 0.042 (0.029) -0.042*** (0.006) 1.840*** (0.237) -0.030*** (0.004) 0.217 (0.146) 0.199* (0.118) Female Non spatial -0.443*** (0.056) SAR -0.252*** (0.087) 0.025 0.048* 0.045** (0.020) (0.028) (0.019) Schooling 0.074*** 0.060 0.057** (0.023) (0.040) (0.027) Age 0.237 0.986* 1.020*** (0.538) (0.507) (0.348) Age squared -0.004 -0.016* -0.016*** (0.009) (0.008) (0.006) Married -0.103 -0.100 -0.103 (0.297) (0.378) (0.260) Dependency -0.257* 0.105 0.111 (0.140) (0.155) (0.106) W*employed 0.268 0.569** (0.324) (0.233) Observations 356 356 356 356 356 356 Spatial units 178 178 178 178 178 178 Notes: *, **, *** denote significance at 10%, 5% and 1% levels; standard errors in parenthesis The results further show that government employment increased for both males and females but the magnitude was larger for males (see Table 4.11 and Table A13). The analysis was repeated for other types of employment such as self-employment and private employment (see results in Table A 14 in the appendix) and we found a negative coefficient for DID, which is significant for both types of employment for males and also in the case of private employment for women. The opposite sign of these coefficients suggests some trade-off between wage sectors, particularly when contrasted with agricultural and government employment where the coefficient is positive for both males and females. 131 Stellenbosch University https://scholar.sun.ac.za We find positive and significant effects of schooling for all models, except for OLS where the coefficient is negative in agricultural employment. More generally, it is interesting to note that the coefficient for schooling reduces in size after taking into account spatial dependence. The reduction demonstrates the importance of taking into account spatial effects. The statistically significant spatially lagged term W ∗ employed also confirms evidence of spatial spill-overs which implies that the employment status in each small geographical areas affects the likelihood of employment in other neighbouring regions. Table 4.11: Effects of land reform policy on government employment Time Treated DID OLS 0.042*** (0.007) -0.011** (0.005) 0.029*** (0.008) 0.015*** (0.002) -0.335*** (0.113) 0.005*** (0.002) -0.004 (0.058) 0.039 (0.039) Male Non spatial 0.063*** (0.015) SAR -0.010 (0.011) 0.027*** (0.009) -0.005 (0.011) -0.415 (0.252) 0.007* (0.004) -0.008 (0.138) 0.061 (0.065) OLS 0.017*** (0.006) -0.011** (0.004) 0.015** (0.006) 0.008*** (0.001) -0.116** (0.049) 0.002** (0.001) -0.019 (0.030) 0.006 (0.024) Female Non spatial -0.006 (0.014) SAR -0.032** (0.013) 0.022*** 0.013* 0.012** (0.006) (0.007) (0.005) 0.027*** 0.029*** Schooling 0.004 (0.007) (0.010) (0.007) Age -0.344** 0.455*** 0.453*** (0.162) (0.130) (0.089) Age squared 0.006** -0.007*** -0.007*** (0.003) (0.002) (0.001) Married -0.058 0.161* 0.134** (0.089) (0.097) (0.067) Dependency 0.04 0.044 0.041 (0.042) (0.040) (0.027) W*employed 0.913*** 0.648*** (0.060) (0.195) Observations 356 356 356 356 356 356 Spatial units 178 178 178 178 178 178 Notes: *, **, *** denote significance at 10%, 5% and 1% levels; standard errors in parenthesis The World Bank (2012) also cites some positive outcomes of the land reform policy which we briefly discuss. The positive effects include an expansion in the average land holding size from less than 0.5 hectares before to approximately 2.2 hectares after the project. In addition, households that participated reported an improvement in maize production (maize stocks after relocation lasted 10.7 months compared to about 3.6 months before); incomes of relocated households grew by 6 times when compared to the control group; maize and tobacco yields were respectively 4 and 2.6 times higher compared to the base period; yields also reached an average level of 50 to 60% higher as compared to control groups in the surrounding areas. 132 Stellenbosch University https://scholar.sun.ac.za Due to the positive results and favourable experience, the government and the World Bank are willing to scale up the programme to cover the whole country, where at least 100,000 households would be resettled (World Bank, 2012). However, while the positive benefits of the reform are widely acknowledged, Chinsinga (2008) raises a number of issues that need to be addressed to ensure its sustainability. The main concern surrounds access to functioning services such as health, water, transport and markets. For example, it was reported that due to lack of markets for produce some farmers ended up selling at very low prices. Related to this, an outbreak of cholera almost led to a collapse of one of the settlement trusts. Specifically, about 20 out of 35 households belonging to Kalungu trust immigrated back to Thyolo for lack of access to portable water facilities. 4.5 Conclusions In this chapter, we created panel data at the level of traditional authorities from national censuses to examine spatial and temporal patterns of employment and migration in Malawi. We make a distinction between males and females because this has implications for development policy. The results showed that over the long-term changes in the population structure have effects on labour force participation rates. We also found clear evidence of spatial spill-overs in our phenomena of interest. Further, allowing for spatial effects reduces the sizes of coefficients and this confirms arguments in the literature that failure to take into account spatial dependencies tends to overstate the results. The results show that Malawi’s community-based land reform policy launched in 2004 influenced migration patterns which were largely rural-rural. Also as a result of the land reform, there was a trade-off in the employment sectors with respect to employment. On the one hand, agricultural and government employment improved, more particularly for the females. On the other hand, selfemployment and private employment declined. Finally, by matching GIS codes at the level of traditional authority areas, the work undertaken for this study makes it possible to integrate census data with other Malawian data sets for similar spatial analysis. In terms of policy, since most indicators are heterogeneous within the country, analysis at low level geographical units as we have done has benefits of allowing for a detailed understanding of phenomena and designing interventions adapted for specific areas. Spatial visualisation of data is also particularly useful for policy makers because it is intuitive. Our study uses aggregated data at the traditional authority level rather than individual level data. It has been argued in the literature that aggregated data ignores the rich cross-sectional evidence of the individual observations and fails to say anything about the industries, firms and individuals themselves. Moreover, geographical aggregation is arbitrary with respect to the definitions of the spatial observational units. One of the solutions suggested in the literature is to estimate models at a micro level 133 Stellenbosch University https://scholar.sun.ac.za (Arbia, 2016). However, the main challenge with this approach is the size of the spatial weighting matrix, considering that in censuses we are typically working with millions of observations. The current packages available to the researcher can only handle up to 11,000 x 11,000 for Stata 14.1 SE or MP (e.g., Belotti et al., 2013). This may be an area for further study as we explore other available options. 134 Stellenbosch University https://scholar.sun.ac.za Chapter 5 Conclusion 5.1 Introduction This thesis has used multiple data sources to investigate trends in non-monetary dimensions of welfare, labour market outcomes and migration in Malawi. As noted in Chapter 1, the issues are investigated in three separate chapters, namely Chapters 2, 3 and 4. Section 5.2 provides the summary of findings from each of the chapters. In Section 5.3, we discuss the conclusion of the thesis and implications for policy. Research implications are discussed in Section 5.4. Contributions made to the literature are summarised in Section 5.5. Finally, in Section 5.6, we provide suggestions for future research. 5.2 Summary of findings Chapter 2 was dedicated to the analysis of poverty and inequality over time using two non-monetary measures of welfare, namely child nutritional status and household asset ownership. Data for this study was drawn from nationally representative DHS data sets for 1992, 2000, 2004 and 2010. On the one hand, the temporal aspect of the study provided an understanding of long-term economic well-being in Malawi. On the other hand, the spatial aspect gave the profile of welfare across population groups such as regions and urban-rural areas. The study showed that poverty in Malawi has declined tremendously from as high as 80% in 1992 to about 50% in 2010. Having established that welfare has improved over time, the study set out to investigate who benefited from these gains in welfare. This was investigated through pro-poor growth analysis which showed that all population groups benefited but the greatest gains accrued to the poor as opposed to the rich. This finding implies that inequality in Malawi has declined between 1992 and 2010. The study also conducted rankings of welfare that are robust to the choices of the poverty line and the dimension of well-being. For example, we found that poverty and inequality are higher in rural areas than urban areas regardless of whether we use assets or child-nutritional status. Our results show that while welfare differs among regions, the regional differences are not as large as the difference between rural and urban areas. For example, using the asset index, only about 7% of the population is poor in urban areas compared to 53% in rural areas. The gap is huge when contrasted with regional poverty rates of 32.4%, 51.3% and 44.5% for the Northern, Central and Southern regions, respectively. Nevertheless, we conclude that poverty in Malawi is both a rural area and regional problem. Therefore, policy interventions should focus more on reducing poverty in rural areas while not neglecting the regional imbalances. 135 Stellenbosch University https://scholar.sun.ac.za One of the advantages of the measures of poverty and inequality employed in Chapter 2 is that they are decomposable. Poverty decomposition shows that urban areas and households headed by females contribute less to poverty compared to rural areas and households headed by males, respectively. We also decomposed total inequality into between-group (for example, rural and urban) and within-group (for example, rural-urban asset gap) inequality. According to Haughton & Khandker (2009), withingroup inequality is typically at least 75% of total inequality in a given country. Our results confirmed that inequality is indeed higher within groups as opposed to between groups; we find magnitudes of between 65% and 100% depending on the measure of welfare used as well as population group pair. Chapter 3 used the two waves of the IHS3 panel data, namely 2010 and 2013, to analyse returns to education in the wage sector and externalities to education in household enterprises. Three main issues were considered. Firstly, we discussed the various issues that could affect the reliability of the returns and externalities to education. The conclusion was that a robust treatment of outliers and inconsistencies in the data makes it possible to obtain trends in the labour market that are both consistent and reliable. Secondly, we tested the argument that educational attainment is positively associated with poverty reduction through the labour market. This assertion was investigated through an econometric analysis of the role played by education in the determination of the likelihood of labour force participation and employment. After controlling for other factors, we found that individuals with tertiary education are more likely to participate in the labour market than those without education. However, the results showed that those with primary and secondary are less likely to participate in the labour market than those without education. We reason that those without formal education drop out of schooling to immediately enter the labour market, unlike those with primary and secondary education who choose to continue with their education and, therefore, only enter the labour market at a later stage. Moreover, the largest proportion of people without education is engaged in informal sector jobs where earnings are low. We follow a similar line of thinking with the multivariate analysis of employment where we find that those with JCE and MSCE are less likely to get employed compared to those without education. Thirdly, the study investigated if higher levels of schooling or education are associated with higher incomes. Our findings, which are robust to sample selection, yielded large positive returns to years of schooling in Malawi for both wage employment and self-employment in household enterprises. The analysis was repeated with education categories instead of years of schooling. The results did not change, implying that the definition of education does not matter. However, using education categories gives us more information in the sense that we are able to show that the returns to education are heterogeneous rather than homogeneous. Not only do the returns increase by the level of education but are also higher amongst females compared to males. Through these findings, the study demonstrates the 136 Stellenbosch University https://scholar.sun.ac.za importance of distinguishing between groups such as economic sectors and gender when analysing labour market outcomes. In Chapter 4, trends in employment and migration were examined using the 1987, 1998 and 2008 census data which are the only three publicly available census data sets in the country. The study has shown that both geography and time matter in the understanding of long-term patterns of employment and migration. While time is important, we demonstrated that it is not everything we should be controlling for when analysing economic outcomes that are essentially spatial such as employment and migration. The importance of geography is proven by the fact that the magnitude of explanatory coefficients dramatically decline once we take spatial dependencies into account in our regressions. Our analysis was done in two stages. In the first stage, we confirmed that spatial dependencies exist in our data using the two most commonly used tests for spatial autocorrelation, namely the Moran’s I and Geary’s c statistics. Spatial distributions on the map of Malawi showed some heterogeneity and clustering in the variables of interest. The detection of spatial dependence only provided the first step in spatial data analysis since it is possible that such clustering could occur randomly. Therefore, as a second step, we conducted some diagnostic tests in order to understand the type of clustering. This further informed the specification of the spatial regression model. The diagnostic tests showed that it is possible to fit both the spatial lag and error models or their variants with our data. Therefore, our regression results are robust to the choice of model specification from a whole range of the available model options commonly used in spatial panel data analysis. The study also explored the effects of the Malawi Community Based Rural Land Development Project (CBRLDP) on migration and employment. The results show that the land reform policy positively influenced migration patterns and also resulted in increased employment opportunities for the individuals who migrated. These results have implications for policy. By showing that it is possible for people to migrate in the pilot phase, the programme can potentially be scaled up to cover the whole country given the positive benefits of the programme in terms of gains in employment and incomes for the participating households. The World Bank and Government of Malawi already have plans to expand the programme to cover the whole of the country for similar settlements. The results also revealed an interesting gender dimension in the sense that employment effects differ between males and females; not only did agriculture employment improve by larger magnitudes for the female subsample when compared to males, but was also significant, unlike the male subsample where we obtained nonsignificant results. Therefore, we expect the programme to have improved the incomes of women involved in agricultural employment for a living. This finding is important because the Food and Agriculture Organisation (2011) states that if resource gap in developing countries between men and women were to be closed, the latter could potentially increase yields by between 20% and 30% and lift 137 Stellenbosch University https://scholar.sun.ac.za many out of food poverty. Although government employment increased for both the male and female subsamples, the magnitude was larger for males. 5.3 Conclusions The poverty and inequality profiles derived in Chapter 2 suggest that although the welfare changes over time have pro-poor, Malawi remains a poor country in terms of both assets and child-nutritional status. Malawi’s poverty is, therefore, multidimensional. A robust ranking of welfare shows that poverty and inequality levels are highest in rural areas and amongst households headed by females. Rankings for the three regions of Malawi depend on the welfare measure used. While asset poverty is the highest in the Central region, child-nutritional status is highest in the Southern region. With respect to inequality, while no robust ranking is determined with respect to assets, height-for-age z-scores (HAZ) and weight-forage z-scores (WAZ), we are able to conclude that inequality is the lowest in the Northern region when weight-for-height z-scores (WHZ) are used as a measure of welfare. Amongst the three measures of child-nutritional status, HAZ yields the highest levels measures of poverty and inequality. Regardless of the welfare measure used, poverty decompositions seem to suggest that poverty is the highest in groups where the population shares are the largest. For example, rural areas which constitute about 85% of the population, contribute to about 91.4% of the poverty using WHZ as a measure of childnutritional status. Similarly, the Central region and households headed by males contribute more to poverty compared to other population groups. Using multivariate analysis, we have identified factors associated with asset poverty and child nutritional status in Malawi. With respect to child-nutritional status, the study has shown that childindividual characteristics such as age, sex and birth order play an important role in addition to household welfare and the levels of education by both the father and mother. It was also established in Chapter 2 that household characteristics such as household size, dependency ratio, levels of education in the household and sex of the household head are important correlates to asset poverty. Furthermore, asset poverty is geographical and depends on the area of residence. Chapter 3 confirms the human capital theory. First, we show that education indeed improves chances of employment, particularly for those with tertiary education. Secondly, we show that returns to education increase with levels of education. We also find that the returns are higher for females than males. Furthermore, given that wage employment is a small proportion of total employment, we segregate our results by economic sector. Our results show that modelling returns to education based on the formal sector only is misleading as it assumes a homogeneous labour market which is not true for developing countries such as Malawi. Finally, our results show that we should be correcting for sample selection when analysing labour market outcomes in Malawi. 138 Stellenbosch University https://scholar.sun.ac.za In Chapter 4, we confirmed that employment and migration are spatial economic phenomena. Econometric analysis of these variables of interest should ideally, therefore, control for both time and geography. We also showed that, between 1987 and 2008, there has been an increase in labour force participation of the youth in Malawi, particularly those aged between 15 and 19 years. This might have implications for youth unemployment in the country. The analysis of long-term occupational mobility showed that a larger proportion of women remains trapped in agricultural employment compared to males. The results also show that the land reform policy in Malawi caused people to migrate to other areas in search of agricultural land. In terms of the effects on employment, agricultural employment increased for the female subsample while government employment particularly increased for the males. 5.4 Implications of the research The main aim of this research was to contribute to the empirical understanding of poverty, inequality and the labour market in Malawi. Using multiple sources of data which stretch over time, the study shows interesting dynamics of the aforesaid economic phenomena in a country which has been largely static in terms of economic growth and urbanisation. One of the main conclusions one can draw from the study is that Malawi’s poverty profile is a ‘bad picture’ given that almost 50% of the population was still poor in 2010, but a ‘good movie’ in that the incidence of poverty had fallen from as high as 80% in 1992. This holds for both monetary and non-monetary indicators of welfare. A number of implications can be drawn from this research and these may have broader policy applications. Following the structure of the thesis, the implications are discussed separately for each of the three main chapters. Chapter 2 provided research evidence on poverty and inequality in Malawi using two non-monetary dimensions of welfare, namely an asset index and child-nutritional status. The first implication arising from this chapter emanates from the finding that Malawi has very high levels of poverty regardless of whether assets or child-nutritional status are used as the measure of welfare. Other research based on consumption (e.g. Mussa, 2013) derives similar implications with respect to this finding. As shown in Chapter 2, despite improvements over time, the data shows very low levels of asset ownership in Malawi. There is also poor access to sanitation, and poor quality of dwellings as reflected in the floor material, as well as poor water services. These have either remained the same or declined over time. Therefore, despite gains in poverty reduction, Malawi remains a very poor country; addressing this problem remains extremely important. A second implication relates to the finding that there is not much regional variation in terms of both poverty and inequality, as shown by the narrow bands in the poverty and inequality mapping shown in Chapter 2. This suggests that the population subgroups are largely undifferentiated. This homogeneous feature of welfare seems to be a feature that is quite unique to Malawi, as few other developing countries show such small differences in poverty measures across geographic space. Similarly, we find that there 139 Stellenbosch University https://scholar.sun.ac.za are no large differences between male and female-headed households, a finding not so common in most developing countries, including most of Malawi’s neighbours. Perhaps these findings relate to the largely undiversified nature of the livelihoods of the Malawian population. The research further established that rural areas dominate urban areas in terms of poverty. This finding is not surprising, considering that 85% of the population in Malawi lives in rural areas and that urban areas usually offer at least some improved formal sector wage earning opportunities. Nevertheless, it is important to note that despite the rural dominance, poverty is also quite high in urban areas. For example, incidences of stunting in urban and rural areas stand at 40.8% and 48.3%, respectively. Thus, while government policy should focus on rural areas, it is important not to neglect urban areas. In stark contrast with consumption-based inequality estimates normally used in official reports, this study interestingly shows that non-money metric inequality is higher in rural areas compared to urban areas. This is a new finding and responds to the call by some, such as Grosse et al. (2008), who have highlighted the need to extend pro-poor growth analysis to non-monetary dimensions. Clearly, this has implications for how government considers its policy on inequality. It may be necessary to rethink the policy approach to reducing inequality in Malawi. Specifically, policy makers need to explicitly take into account non-income dimensions of welfare when formulating public policy. Chapter 3 used panel data to estimate externalities and returns to education in Malawi. Although the chapter shows that returns to education are positive in both the formal and informal sectors, they are lower in the latter where the majority of the population is employed. Therefore, unless workers move from the informal sector to the formal sector, the benefits from education remain low for the majority of the workforce. A second feature evident from the analysis in Chapter 3 relates to the structure of employment in Malawi. Casual employment and self-employment activities (household enterprises) continue to make up the largest share of total employment both in and outside of agriculture. They jointly constitute at least 75% of total employment. The chapter also shows that Malawi has very high labour force participation rates, ranging between 71% and 91%, depending on whether the broad or narrow definition is used. These traditionally high rates of labour participation can be linked to the existence of a large agricultural sector which in turn makes it difficult to clearly identify actual labour force participation rates, as subsistence activities in the agricultural sector are categorised as labour force participation. 140 Stellenbosch University https://scholar.sun.ac.za The chapter also shows that the labour force in Malawi remains poorly educated; only 17% of individuals in the working age population has more than primary education. The low levels of education imply low productivity levels that partly account for the large earning differentials between the formal and informal sector. Consequently, there is a need to continue investing in education which has been found to yield positive returns for those employed. The fact that returns to education are higher for females than males with similar skills (more particularly at high levels of education) shows that the continued expansion of education for girls should translate into gains in the fight against poverty and the economic empowerment of women. It is further shown in the study that the returns to education in non-farm enterprises are higher when maximum household education is used (signifying externalities) as opposed to the level of education of the owner of the enterprise. This suggests that provision of free primary education and subsidies in higher education can be justified on these grounds. In this case, government’s investment in education is not only the right thing to do but also have much wider benefits for society in terms of poverty reduction and improvements in food security. Furthermore, the presence of education externalities raises issues for further study. For example, motivated by findings emergent from this study, one may want to further investigate how the allocation of education to different household members of different ages or gender involved in various activities would affect household income. Simply put, whose education matters in the determination of household income? The use of panel data has enabled an examination of changes in occupational status as well as movements into and out of the labour market. This is not only important for monitoring trends but also for identifying the sources of changes over time. For example, from the data, it is possible to identify individuals who are employed or unemployed in both waves. This is not possible with cross-sectional data. Consequently, panel data not only provided transparent ways of dealing with data issues but also allow a better understanding of the economic phenomena of interest, including how earnings have changed between surveys. The data shows that compared to those only employed in either wave, the initial earnings amongst individuals employed in both years were not only higher (almost double as much) but also increased by a greater magnitude. Specifically, the earnings of those employed in both waves increased by 46% between 2010 and 2013. While most of the individuals employed in both waves were educated, the majority of those unemployed had no education. This seems to suggest that education not only improves chances of finding employment but also that higher education is associated with higher earnings. Furthermore, with panel data, it was possible to control for individual heterogeneity since the individuals served as their own controls. Chapter 4 looked at migration and employment in Malawi using spatial data. The government has generally regarded the movement of people within Malawi in the light of rural-urban migration which 141 Stellenbosch University https://scholar.sun.ac.za is seen as a natural and inevitable process. Policies have, therefore, tended to focus on developing rural centres to prevent massive movements of people into urban areas, particularly the four main urban centres, namely Mzuzu, Lilongwe, Blantyre and Zomba. However, given the low levels of urbanisation, rural-urban migration is not as massive as perceived. On the contrary, the observed patterns of migration in Malawi are largely rural-rural. Specifically, while most people stay in the home regions there are significant movements across districts. This points to the continued dominance of rural livelihoods, even for those in search of better opportunities. Furthermore, the study has shown that the land reform policy enabled people to move from the rural areas of sending districts into the rural areas of receiving districts. This policy has had large implications for development because of its effects on employment. Consequently, government policy needs to go beyond simply providing access to land because this may potentially create as many problems as it seeks to solve. On a more substantive level, deliberate policies need to be put in place to support agricultural production in addition to the provision of markets and other necessary amenities. In Chapter 2, it was shown that there are no major differences between regions in terms of economic welfare. However, there exists some heterogeneity in terms of economic indicators when small-level geographical data is used as shown in the spatial maps in Chapter 4. Specifically, while there seems to be broad uniformity across the main regions, a descriptive spatial analysis of employment, schooling, fertility and assets shows some heterogeneity at the smallest geographical units, namely traditional authorities. One implication of this finding is that policy should not be based on the average picture but instead needs to be tailored to the small geographical areas. In addition, visualisation of spatial is a powerful tool and can be used to aid policies that are specifically adapted for small geographical units. Some studies (e.g. Gaddis & Klasen, 2014) link fertility rates to economic development. Chapter 2 of this thesis has shown that fertility rates in Malawi have dropped between 1987 and 2008. Although this is the broad trend for the population, fertility rates have increased by almost 2% amongst women aged between 20 and 24 years, contrary to what is desired; this may call for policy action. High fertility rates are known to have effects on education; education expenditures per child tend to be lower in large families. Furthermore, high fertility rates may also negatively affect the supply of labour of the parents, especially when the children are still young. This may, in turn, have a negative effect on the ability of families to save for the children. The analysis of the long-term patterns of labour force and population in Malawi shows that the percentage of the youth participating in the labour force has been on the increase, particularly for females. These observed changes in the age structure of the population and of labour force participation have significant implications for the supply of labour and therefore also the need for economic growth. 142 Stellenbosch University https://scholar.sun.ac.za There is a need for the corresponding improvement in education and skills development. The youth, particularly those aged between 20 and 24 years, constitutes the largest percentage of the unemployed. Youth unemployment has implications for development. For example, as education continues to expand, many of those educated may fail to find employment due to increased competition for scarce formal sector jobs. Even those who eventually enter into employment may actually end up in informal employment and subsistence agriculture where the earnings are low. Another important implication of the research is that it provides a platform for further spatial analysis in Malawi. Specifically, the study has demonstrated the feasibility of creating GIS codes that are consistent over time. This is an important step towards linking census data with other national data sets for similar analysis at the lowest geographical unit. This thesis has thus shown that Malawi, one of the poorest countries in the world, has many of the features of poor countries, but also some features that make it quite unique. These include the relatively undifferentiated economic activity in rural areas, which lies at the heart of the limited economic and spatial differentiation across the country as a whole. This thesis has also shown that there has been considerable pro-poor growth in Malawi, something that is to be commended and holds promises that future growth too may benefit Malawi’s many poor people. As the country develops, however, it is likely that differentiation would increase and that heterogeneity would become the norm in the economic field as in the labour market. The data used in this study offer a useful first attempt at evaluating the features and consequences of economic progress. Further work on new datasets that become available over time would be beneficial for following the trajectory of growth and development. 5.5 Summary of contributions Previous studies on poverty and inequality in Malawi have been static in nature, due to data limited availability. This study was able to conduct more inter-temporal analysis that was made possible by increased data availability. Chapter 2 has contributed to the literature by examining trends in poverty through pro-poor growth analysis which allows us to understand the distribution of the gains from reductions in poverty and inequality over time. More generally, this is the first attempt to apply an asset index to the measurement of poverty in Malawi. While it is generally understood that poverty is multidimensional, there has been a dearth of literature dedicated to the empirical understanding of asset poverty and how the results compare with what is already known with respect to other dimensions such as consumption, education and nutritional status. Until the release of the IHS3 panel data (IHSP), there has been a lack of panel data in Malawi. Therefore, previous evidence on education and earnings has been based on cross-sectional data. In Chapter 3, we take advantage of the recently made available panel survey results set to explore the interesting dynamic 143 Stellenbosch University https://scholar.sun.ac.za aspects of the observations as well as control for unobservable individual heterogeneity. Furthermore, we conduct robust treatment of outliers and bring to light some of the inconsistencies in the data. These are some of the issues that new users of the data will have to look out for in order to ensure quality data before further analysis. This more generally applies to most datasets in less developed countries with a large informal sector. The main contribution in Chapter 4 is the application of spatial econometric methods to the understanding of employment and migration in Malawi. For example, by comparing results based on spatial effects to those based on non-spatial regression analysis, we demonstrate that studies that do not take into account spatial effects tend to overestimate other coefficients. Potentially, this study provides an exploratory platform for further spatial analysis in Malawi. In order to achieve this, we have matched GIS codes that are consistent over time with the view to link census data with other data sets for similar spatial analysis. This kind of data integration is an area for further study, as noted in the ensuing section. 5.6 Future research Chapter 2 dealt with the dynamic aspects of poverty and inequality in Malawi by examining trends over time. However, since we are not tracking the same households, our results only show the overall population changes in welfare that have occurred over time. Another important aspect of the dynamic nature of poverty considered in the literature is the identification of the proportions of households or individuals that are either chronically or transiently poor. This kind of analysis, which can only be achieved with panel data, has not been done in this study. Future research can take advantage of the IHSP data which has been recently released by the National Statistical Office. Furthermore, in Chapter 3, we identified measurement error in the independent variables as one of the possible sources of bias when estimating returns and externalities to education. The most commonly suggested method of dealing with this kind of measurement error in the literature is differencing (short and longer differencing). Unfortunately, this approach only works for panels with at least three waves of data panel, which is not available in Malawi at the moment. However, as more waves are added to the IHSP in the future, there would be the possibility of accounting for measurement error in the independent variables. Another area for future research relates to the treatment of individuals with zero earnings. Our approach has been to exclude from the analysis observations with zero incomes. One of the criticisms of this approach is that it may affect the reliability of the estimates (Yu, 2012), especially if the observations are many. Therefore, researchers sometimes consider an alternative of adding a very small number to earnings before taking the log, which would allow considering zero incomes. This option is not considered in this study because zero incomes constitute about 10% percentage in our data which we 144 Stellenbosch University https://scholar.sun.ac.za believe is large and may, therefore, eventually end up distorting the distribution of the data. As a further study, one may set out to investigate the alternative treatment of zero earnings not only for the panel data set but also for other data sets in the country including consumption figures. The spatiotemporal analysis carried out in Chapter 4 can be generally applied to economic phenomena by matching GIS census data with other data in Malawi. This constitutes an area for future research. With the development of geosystems, it can be expected that geo-coded data will become more readily available in Malawi. This will make spatial analysis possible on the new datasets, but the richness of such data could perhaps be further enhanced by adding information from the census for smaller geographical areas. Thus, this study acts as a platform for further spatial analysis. Another extension of this study would be to conduct spatial analysis using individual level data instead of aggregated data as is the case in the present study. However, this approach requires working with very large matrices involving millions of observations, as is typically the case with census data. The current Stata packages only handle smaller matrix sizes as explained in Section 4.5. Consequently, use of individual data for spatial analysis is dependent on the identification of methodologies and tools that overcome this limitation. 145 Stellenbosch University https://scholar.sun.ac.za List of references Alkire, S. & Santos, M.E. (2010). Acute Multidimensional Poverty: A New Index for Developing Countries (OPHI Working Paper No. 38). Retrieved from https://www.ophi.org.uk/wpcontent/uploads/ophi-wp38.pdf. Anselin, L. (1992b). Spatial Data Analysis with GIS: An Introduction to Application in the Social Sciences. National Center for Geographic Information and Analysis, University of California, Santa Barbara. Anselin, L. (2003). Spatial Econometrics, in Baltagi, B.H. (Ed.), A Companion to Theoretical Econometrics. Malden, Massachusetts, USA: Blackwell Publishing Ltd. doi: 10.1002/9780470996249.ch15. Appleton, S., Bigsten, A. & Manda, D.K. (1999). Educational Expansion and Economic Decline: Returns to Education in Kenya, 1978-95 (Working Paper WPS/99-6). Centre for the Study of African Economies, University of Oxford. Araar, A. & Duclos, J.Y. (2013). DASP: Distributive Analysis Stata Package [Computer Software]. Laval University, World Bank, PEP and CIRPÉE. Arbia, G. (2016, June). Spatial econometrics: a broad view. Paper presented at the X World Conference Spatial Econometrics Association (SEA), Rome, Italy. Baltagi, B. H. & Li, Q. (1990). A Lagrange multiplier test for the error components model with incomplete panels. Econometric Reviews 9 (1), 103–7. Baltagi, B.H. (2013). Panel Data Methods. Handbook of Applied Economic Statistics. New York. Basu, K. & Foster, J. E. (1998). On measuring literacy, Economic Journal, 108(451), 1733-49. Becker, G. S. (1962). Investment in human capital: A theoretical analysis. The journal of political economy, 9-49. Belotti, F., Hughes,G. & Mortari, A.P.(2014). “XSMLE: Stata module for spatial panel data models estimation," Statistical Software Components S457610, Boston College Department of Economics. Bennell, P. (1996). Rates of Return on Education: Does the Conventional Pattern Prevail in SubSaharan Africa. World Development, 24 (1), 183-99. Blundell, R. & Preston, I. (1998). Consumption inequality and income uncertainty. Quarterly Journal of Economics. 113(2), 604-40. 146 Stellenbosch University https://scholar.sun.ac.za Bokosi, F.K. (2006). Household poverty dynamics in Malawi: A bivariate probit analysis. Journal of applied sciences, 7 (2), 258-62. Booysen, F., Van der Berg, S., Burger, R., Von Maltiz, M., & Du Rand, G. (2008). Using an Asset Index to Assess Trends in Poverty in Seven Sub-Saharan African Countries. World Development, 36 (6), 1113-1130. Bourguignon, F. & Chakravarty, S.R. (2003). The measurement of multidimensional poverty. Journal of Economic Inequality, 1 (1), 25-49. Burger, R. & Yu, D. (2007). Wage trends in post -Apartheid South Africa: Constructing an earnings series from household survey data (Working Paper 07/117). Development Policy Research Unit, University of Cape Town. Card, D. (1999). The causal effect of education on earnings. Handbook of labor economics, 3, 18011863. Casale, D. & Desmond, C. (2015). Recovery from stunting and cognitive outcomes in young children: Evidence from the South African Birth to Twenty Cohort Study. Journal of developmental origins of health and disease, available on CJO2015. doi:10.1017/S2040174415007175. Chen, M. A. (2012). The informal economy: Definitions, theories and policies. Women in informal economy globalizing and organizing: WIEGO Working Paper, 1. Chirwa, E. & Matita, M. (2009). The Rate of Return on Education in Malawi (Department of Economics Working Paper No. 2009/01). University of Malawi, Chancellor College. Chirwa, E.W. & Ngalawa, H.P.E. (2008). Determinants of Child Nutrition in Malawi. South African Journal of Economics, 76(4), 628-40. Chirwa, E.W. & Zgovu, E.K. (2002). Does the Return to Schooling Depend on the Type of Employment? Evidence from the Rural Labour Market in Malawi. Chinsinga, B. (2008). Exploring the politics of land reforms in Malawi: A case study of the Community Based Rural Land Development Programme (CBRLDP). University of Manchester: Manchester, UK. Clark, T.S. & Linzer, D. A. (2015). Should I Use Fixed or Random Effects? Political Science Research and Methods, 3(2), 399–408. Crow, K. (2015). SHP2DTA: Stata module to converts shape boundary files to Stata datasets. Cooke, T. J. (2013). Internal migration in decline. The Professional Geographer, 65(4), 664-675. Da Maia, C. (2012). Understanding Poverty and Inequality in Mozambique. The Role of Education and Labour Market Status (Doctoral thesis, Stellenbosch University, South Africa). Retrieved from http://scholar.sun.ac.za/handle/10019.1/71857. Dancer, D., Rammohan, A. & Smith, M. D. (2008). Infant mortality and child nutrition in Bangladesh. Health Economics, 17(9), 1015–35. doi:10.1002/hec.1379. 147 Stellenbosch University https://scholar.sun.ac.za Davidson, R., & Duclos, J. Y. (2000). Statistical inference for stochastic dominance and for the measurement of poverty and inequality. Econometrica, 68(6), 1435-1464. Deaton, A. (1997). The analysis of household surveys: A microeconomic approach to development policy. Baltimore: The John Hopkins University Press. Duclos, J. & Araar, A. (2006). Poverty and equity: Measurement, policy and estimation with DAD (1st ed.). Ottawa: Springer. Duclos, J.Y. & Verdier-Chouchane, A. (2010). Analyzing Pro-Poor Growth in Southern Africa: Lessons from Mauritius and South Africa (Working Papers Series No. 115). African Development Bank, Tunis, Tunisia. Ewoudou, J. & Vencatachellum, D. (2006). An Empirical Analysis of the Rates of Returns to Education in Cameroon. Paper presented at the Centre for the Study of African Economies Conference, University of Oxford, Oxford. Food and Agriculture Organisation. (2011). The state of food and agriculture: Women in agriculture- closing the gender gap for development. Rome, Italy: FAO. Retrieved from http://www.fao.org/docrep/013/i2050e/i2050e.pdf. Foster, J., Greer, J., & Thorbecke, E. (1984). A Class of Decomposable Poverty Measures. Econometrica, 52(3), 761-66. doi:1. Retrieved from http://www.jstor.org/stable/1913475 doi:1 Gaddis, I. & Klasen, S. (2014). "Economic development, structural change, and women’s labour force participation:," Journal of Population Economics, Springer, 27(3), 639-81. Geary, R. (1954). The contiguity ratio and statistical mapping. The Incorporated Statistician, 5 (3), 115–45. Government of Malawi. (2007). Malawi Growth and Development Strategy I: From Poverty to Prosperity (2006-2011). Lilongwe, Ministry of Economic Planning and Development. Retrieved from https://www.imf.org/external/pubs/ft/scr/2007/cr0755.pdf. Government of Malawi. (2012). Malawi Growth and Development Strategy II (2011-2016). Lilongwe, Ministry of Finance and Development Planning. Retrieved from https://www.imf.org/external/pubs/ft/scr/2012/cr12222.pdf. Gondwe, A.S. (2011). Spatial Comparisons of Multidimensional Poverty in Malawi (Unpublished Master’s thesis). Chancellor College, University of Malawi, Malawi. Greenacre, M. (2007). Correspondence analysis in practice. CRC press. Greenwood, M. J. (2005). Modeling migration. Encyclopedia of social measurement, 2, 725-734. Grosse, M., Harttgen, K., & Klasen, S. (2008). Measuring pro-poor growth in non-income dimensions. World Development, 36(6), 1021-1047. 148 Stellenbosch University https://scholar.sun.ac.za Harris, J. R., & Todaro, M. P. (1970). Migration, unemployment and development: a two-sector analysis. The American economic review, 126-142. Haughton, J. & Khandker, S.R. (2009). Handbook on poverty and inequality. Washington: The World Bank. Heckman, J. (1979). Sample Selection as a Specification Error. Econometrica, 47 (1), 15361. Human Development Report. (2015. Work For Human Development: Retrieved from http://hdr.undp.org/sites/default/files/2015_human_development_report.pdf. Hussmanns, R. (2004, February). Statistical definition of informal employment: Guidelines endorsed by the Seventeenth International Conference of Labour Statisticians (2003). In 7th Meeting of the Expert Group on Informal Sector Statistics (Delhi Group) (pp. 2-4). International Labour Organization. (2011). ILO estimates and projections of the economically active population: 1990–2020 (6th ed.). Retrieved from http://laborsta.ilo.org/applv8/data/EAPEP/eapep_E.html. Jeanty, P.W. (2010). spmlreg: Stata module to estimate the Spatial lag, the Spatial error, the Spatial Durbin, and the General Spatial models. Jolliffe, D. (2002). Whose education matters in the determination of household income? Evidence from a developing country. Economic Development and Cultural Change, 50(2), 287-312. Kahyarara, G. & Teal, F. (2008).The Returns to Vocational Training and Academic Education: Evidence from Tanzania. World Development, Elsevier, 36(11), 2223-42. Kakwani, N. (1980). On a class of poverty measures. Econometrica: Journal of the Econometric Society, 437-446. Kakwani, N., & Pernia, E. M. (2000). What is pro-poor growth? Asian development review, 18(1), 116. Kerr, A. & Teal, F.(2015). The Determinants of Earnings Inequalities: Panel data evidence from KwaZulu-Natal, South Africa. Journal of African Economies, 24(4), 530-58. Khandker, S.R., Koolwal, G.B. & Samad, H.A. (2010) Handbook on impact evaluation: quantitative methods and practices. Washington, DC: The World Bank. Kingdon, G. & Knight, J. (2004). Unemployment in South Africa: The Nature of the Beast. World Development, 32 (3), 391-408. Lassibille, G. & Tan, J. (2005).The Returns to Education in Rwanda. Journal of African Economies, 14 (1), 92-116. Linnemayr, S., Alderman, H. & Ka, A. (2008). Determinants of malnutrition in Senegal: Individual, household, community variables, and their interaction. Economics & Human Biology, 6 (2), 252-63. 149 Stellenbosch University https://scholar.sun.ac.za Matita, M. & Chirwa, E. (2009). The Impact of Education on Self-Employment, Farm Activities and Household Incomes in Malawi. (Department of Economics Working Paper No. 2009/02). University of Malawi, Chancellor College. McKay, A. (2000). Should the survey measure total household income? In Grosh, M. and Glewwe, P. (Eds.), Designing household survey questionnaire for developing countries: Lessons from 15 years of the living standards measurement study (pp. 83 – 104). Washington: The World Bank. Mincer, J. (1974). Schooling, Experience and Earnings. New York: National Bureau of Economic Research. Retrieved from http://papers.nber.org/books/minc74-1. Minnesota Population Center. (2015). Integrated Public Use Microdata Series, International: Version 6.4 [Machine-readable database]. Minneapolis: University of Minnesota. Retrieved from https://international.ipums.org/international-action/variables/group. Moran, P. (1948). The interpretation of statistical maps. Journal of the Royal Statistical Society, Series B (Methodological), 10 (2), 243–51. Murkherjee, S. & Benson, T. (2003). The determinants of poverty in Malawi, 1998. World Development, 31(2), 339-358. Mussa, R. (2013). Spatial Comparisons of Poverty and Inequality in Living Standards in Malawi. South African Journal of Economics, 81(2), 192-210. Mussa, R. (2014). A matching decomposition of the rural–urban difference in malnutrition in Malawi. Health Economics Review, Springer, 4(1), 1-10. Mussa, R. (2014). Externalities of Education on Efficiency and Production Uncertainty of Maize in Rural Malawi (MPRA Paper 54628). University Library of Munich, Germany. National Statistical Office. (2008). Population and Housing Census Report. Zomba, Malawi. Retrieved from http://www.nsomalawi.mw/images/stories/data_on_line/demography/census_2008/Main%2 0Report/ThematicReports/Migration%20Report.pdf. National Statistical Office. (2012). Integrated Household Survey 2010-2011: Household SocioEconomic Report. Zomba, Malawi. Retrieved from http://www.nsomalawi.mw/images/stories/data_on_line/economics/ihs/IHS3/IHS3_Report. pdf. 150 Stellenbosch University https://scholar.sun.ac.za National Statistical Office. (2012). Malawi Labour Force Survey Report 2013. Zomba, Malawi. Retrieved from http://www.nsomalawi.mw/component/content/article/8/209-malawilabour-force-survey-2013.html. National Statistical Office. (2014). Integrated Household panel Survey 2010-13: Household SocioEconomic Characteristics Report. Zomba, Malawi. Retrieved from http://www.nsomalawi.mw/images/stories/data_on_line/economics/ihs/IHPS%202013/IHP S%20Report.pdf. National Statistical Office. (2015). Welfare Monitoring Survey 2014. Retried from http://www.nsomalawi.mw/images/stories/data_on_line/agriculture/wms_2014/Welfare%20 Monitoring%20Survey%202014.pdf. National Statistical Office & ICF Macro. (2011). Malawi Demographic and Health Survey 2010. Zomba, Malawi, and Calverton, Maryland, USA. Retrieved from http://www.dhsprogram.com/pubs/pdf/FR247/FR247.pdf. Organisation for Economic Co-operation and Development. (2008). 2008 Survey on Monitoring the Paris Declaration: Making Aid More Effective by 2010. OECD Publishing, Paris. doi: 10.1787/9789264050839-en. Pisati, M. (2001). Tools for Spatial Data Analysis. Stata Technical Bulletin. March 2001. STB60:21-36. Retrieved from http://www.stata.com/products/stb/journals/stb60.pdf. Pisati, M. (2012). Spatial Data Analysis in Stata: An Overview. Department of Sociology and Social Research. University of Milano-Bicocca (Italy). Retrieved from http://www.stata.com/meeting/italy12/abstracts/materials/it12_pisati.pdf. Pischke, J. S., & Schwandt, H. (2014). Poorly Measured Confounders are More Useful on the Left Than on the Right (Working Paper). London School of Economics. Psacharopoulos, G. (1994). Returns to Investment in Education: A Global Update, World Development, 22 (9), 1325-43. Ravallion, M. (2011). On multidimensional indices of poverty. The Journal of Economic Inequality, 9(2), 235-248. Ravallion, M., & Chen, S. (2003). Measuring pro-poor growth. Economics letters, 78(1), 93-99. Reserve Bank of Malawi. (2015). Financial and Economic Review, 49 (4). Retrieved from https://www.rbm.mw/Publications/EconomicReviews/. Roy, A. D. (1951). Some thoughts on the distribution of earnings. Oxford economic papers, 3(2), 135-146. Rutstein, S.O. & Johnson, K. (2004). The DHS Wealth Index (DHS Comparative Reports No. 6). Calverton, Maryland: ORC Macro. 151 Stellenbosch University https://scholar.sun.ac.za Sahn, D.E. & Stifel, D.C., 2003. Urban-rural Inequality in Living Standards in Africa. Journal of African Economies, 12 (4), 564-97. Sen, A. (1976). Poverty: an ordinal approach to measurement. Econometrica: Journal of the Econometric Society, 219-231. Sen, A. (1985). Commodities and Capabilities. North Holland, Amsterdam. Sen, A. (1987). The Standard of living. In McMurrin, S. (Ed), Tanner lectures on Human Values, Cambridge: Cambridge University Press. Retrieved from http://tannerlectures.utah.edu/_documents/a-to-z/s/sen86.pdf. Singer, H., & Jolly, R. (1972). Employment, Incomes and Equality: A Strategy for Increasing Productive Employment in Kenya. Geneva: ILO. Sokal, R. R., Oden, N. L. & Thomson, B. A. (1998), Local Spatial Autocorrelation in a Biological Model. Geographical Analysis, 30: 331–54. doi: 10.1111/j.1538-4632.1998.tb00406.x StataCorp. (Version 13.1). Stata Statistical Software [Computer Software]. College Station, Texas: Stata Corporation. StataCorp. (Version 14.1). Stata Statistical Software [Computer Software]. College Station, Texas: Stata Corporation. Tobler, W. (1970). A Computer Movie Simulating Urban Growth in the Detroit Region. Economic Geography, 46, 234-40. doi:1. Retrieved from http://www.jstor.org/stable/143141 doi:1 Verduzco-Gallo, I., Ecker, O., & Pauw, K. (2014). Changes in Food and Nutrition Security in Malawi. IFPRI Working Paper 06. Villa, J.M. (2011). DIFF: Stata Module to Perform Differences in Differences Estimation. Statistical Software Components. Boston College Department of Economics. Willekens, F. (2008). Models of migration: Observations and judgement. International migration in Europe: Data, models and estimates, 117-147. Williams, R. (2015). Panel Data 4: Fixed Effects vs. Random Effects Models. University of Notre Dame. Retrieved from https://www3.nd.edu/~rwilliam/stats3/Panel04-FixedVsRandom.pdf. Wittenberg, M. (2014). Wages and wage inequality in South Africa 1994-2011: The evidence from household survey data. (SALDRU working paper No. 135). Southern Africa Labour and Development Research Unit, University of Cape Town. Wittenberg, M., Leibbrandt, M. (2015). Measuring Inequality by Asset Indices: A general approach with application to South Africa. (SALDRU Working Paper No. 141). Southern Africa Labour and Development Research Unit, University of Cape Town. Wooldridge, J. (2002). Econometric Analysis of Cross Section and Panel Data. Cambridge: The MIT Press. 152 Stellenbosch University https://scholar.sun.ac.za World Bank. (2004). Poverty reduction strategies papers sourcebook. Washington, D.C.: World Bank. World Bank. (2012). Malawi - Community-Based Rural Land Development Project. Retrieved from http://documents.worldbank.org/curated/en/2012/03/16215843/malawi-community-based-ruralland-development-project. World Health Organisation. (2006). WHO Child Growth Standards: Methods and development. Retrieved from http://www.who.int/childgrowth/standards/Technical_report.pdf. Yu, K.C.D. (2012). Using household surveys for deriving labour market, poverty and inequality trends in South Africa (Doctoral thesis, Stellenbosch University, South Africa). Retrieved from http://scholar.sun.ac.za/handle/10019.1/71638. 153 Stellenbosch University https://scholar.sun.ac.za Appendices Appendix A1: Coefficient comparison test The coefficient comparison was achieved by seemingly unrelated regression (SURE) which takes the following code form in Stata 13.1: .svy: regress lwage edu x if Male .estimates store Male .svy: regress lwage edu x if! Male .estimates store Female .suest Male Female .test_b [Male:edu]=_b[Female: edu] .test_b [Male:x]=_b[Female: edu] Since suest (and its alternatives) do not work with xtreg , we conduct the coefficient comparison tests for pooled Ordinary Least Squares (OLS) only. Appendix A2: Dealing with data inconsistencies In order to ensure data quality, we created a salary payment period that is consistent across the waves using the following Stata code (in Stata 13.1) for the formal and informal sectors, respectively: .bys PID: egen period1=max (salary_period) .bys PID: egen period2=max (ganyu_dys) It does not matter whether we use maximum (max) or minimum (min). The most important thing is to come up with a period that is consistent across waves. Otherwise, the data will show that wages increased while in actual fact it is as a result of changes in the period of payment. 154 Stellenbosch University https://scholar.sun.ac.za Figure A 1: FISP evaluation panel data on real ganyu wages Source: Farm Input Subsidy Programme (FISP) evaluation panel data 155 Stellenbosch University https://scholar.sun.ac.za Table A 1: Table showing the distribution of households by year and population subgroups 2010 2004 2000 1992 Description Number Percent Number Percent Number Percent Number Percent Areas Urban 4,116 16.6 2,262 16.6 1,949 13.7 603 11.3 Rural 20,709 83.4 11,402 83.5 12,264 86.3 4,720 88.7 Region Northern 2,716 10.9 1,584 11.6 1,496 10.5 589 11.1 Central 10,627 42.8 5,589 40.9 5,744 40.4 2,043 38.4 Southern 11,482 46.3 6,491 47.5 6,973 49.1 2,691 50.6 Residence Rural North 2,476 10.0 1,325 9.7 1,215 8.6 529 9.9 Rural Centre 8,876 35.8 4,615 33.8 5,049 35.5 1,822 34.2 Rural South 9,356 37.7 5,462 40.0 6,000 42.2 2,369 44.5 Urban North 240 1.0 259 1.9 281 2.0 60 1.1 Urban Centre 1,751 7.1 974 7.1 695 4.9 221 4.2 Urban South 2,126 8.6 1,029 7.5 973 6.9 322 6.1 Sex Male head 17,857 71.9 10,295 75.3 10,431 73.4 4,013 75.4 Female head 6,968 28.1 3,369 24.7 3,782 26.6 1,310 24.6 Total 24,825 100 13,664 100 14,213 100 5,323 100 Table A 2: Table showing the distribution of children by year and population subgroups 2010 2004 2000 1992 Description Number Percent Number Percent Number Percent Number Percent Age 0-23 2,050 42.7 4,083 46.9 4,497 46.11 1,591 47.45 2,751 57.3 4,624 53.1 5,256 53.89 1,762 52.55 24-59 Sex 2,331 48.6 4,355 50.0 4,821 49.4 1,676 50.0 Male Female 2,470 51.4 4,352 50.0 4,932 50.6 1,677 50.0 Areas Urban 721 15.0 1,129 13.0 1,272 13.0 360 10.7 Rural 4,080 85.0 7,578 87.0 8,481 87.0 2,993 89.3 Region Northern 514 10.7 1,174 13.5 1,064 10.9 399 11.9 Central 2,226 46.4 3,417 39.3 4,229 43.4 1,387 41.4 Southern 2,061 42.9 4,116 47.3 4,460 45.7 1,567 46.7 Residence Rural North 460 9.6 1,001 11.5 855 8.8 354 10.6 Rural Centre 1,903 39.6 3,024 34.7 3,726 38.2 1,235 36.8 Rural South 1,717 35.8 3,553 40.8 3,899 40.0 1,404 41.9 Urban North 54 1.1 173 2.0 209 2.1 45 1.3 Urban Centre 323 6.7 393 4.5 502 5.2 152 4.5 Urban South 344 7.2 563 6.5 561 5.8 163 4.9 Total 4,801 100 8,707 100 9,753 100 3,353 100 156 Stellenbosch University https://scholar.sun.ac.za Table A 3: WHZ regression results by age category Age<24 months Variables Age>=24 months Coeff SE Coeff SE Age in months -0.080** (0.029) 0.038 (0.025) Square of age 0.286* (0.114) -0.050 (0.029) Female child 0.036 (0.087) -0.162** (0.054) Child is twin -0.417 (0.250) -0.101 (0.163) Birth order number 0.045 (0.027) 0.032* (0.013) Incomplete primary 0.200 (0.146) 0.059 (0.093) Complete primary -0.009 (0.209) 0.186 (0.126) Incomplete secondary 0.173 (0.201) 0.144 (0.131) Complete secondary 0.480 (0.252) 0.257 (0.201) Post-secondary -0.089 (0.763) 0.012 (0.370) Incomplete primary -0.052 (0.164) -0.028 (0.099) Complete primary 0.084 (0.218) 0.096 (0.135) Incomplete secondary -0.141 (0.190) -0.028 (0.113) Post-secondary 0.476 (0.317) -0.331 (0.288) Asset index -0.002 (0.086) 0.000 (0.060) Square of asset index 0.007 (0.023) 0.009 (0.016) Rural area -0.163 (0.142) -0.042 (0.109) Central -0.162 (0.131) -0.044 (0.094) Southern -0.215 (0.124) -0.142 (0.094) 0.652 (0.358) -0.127 (0.517) Mother's education Father's education Region Constant R-squared 0.026 0.017 Prob > F 0.004 0.097 Observations 1,863 2,668 Notes: *, **, *** denote significance at 10%, 5% and 1% levels 157 Stellenbosch University https://scholar.sun.ac.za Table A 4: List of regions, districts and traditional areas consistent over time Region District Traditional area Northern Region Chitipa Chitipa Boma, Kameme, Mwabulambya Mwenemisuku Nyika National Park, Mwenewenya Karonga SC Mwakaboko, Silupula Karonga, Kyungu Wasambo Karonga Boma SC Mwirang'ombe Nkhata Bay, Likoma Nkhata Bay Boma, Mkumpa, SC Mkumbira, Mankhambira Malenga Mzoma, SC Fukamalaza, SC Malanda Kabunduli Timbiri Fukamapiri, SC Zilakoma Boghoyo, SC Mkondowe, SC Nyaluwanga, Musisya Rumphi Rumphi Boma, SC Mwahenga, Katumbi, SC Zolokere, Vwaza Marsh Game, Chikulamayembe SC Chapinduka, Mwamlowe, SC Mwankhunikira SC Kachulu, SC Mwalweni Mzimba SC Jaravikuba Munthali, Mtwalo Chindi M'mbelwa Mzikubola Mabulabo SC Kampingo Sibande Vwaza Marsh Game Reserve, Mpherembe SC Khosolo Gwaza Mzukuzuku Mzimba Boma, Mzuzu city Central Region Kasungu Wimbe Santhe Chulu SC Chisikwa, Kaluluma SC Kawamba 158 Stellenbosch University https://scholar.sun.ac.za Kaomba and Kasungu national park Kasungu Township SC Njombwa SC Chilowamatambe Kapelula SC M'nyanja SC Lukwa SC Simlemba Nkhota Kota SC Kafulazira, Kanyenda Mwadzama Nkhota Kota Game Reserve, Malenga Chanzi SC Mphonde Nkhota Kota Boma Mwansambo Ntchisi Ntchisi Boma, Kalumo SC Chilooko Kasakula, SC Nthondo Chikho Dowa SC Chakhaza SC Kayembe Chiwere Dowa Boma, Mkukula Dzoole Msakambewa Mponela Urban, SC Mponela Salima SC Msosa, Khombedza, Kuluunda Chipoka Urban, SC Kambalame, Ndindi Maganga Salima Township SC Kambwiri SC Mwanza Pemba Lilongwe Chiseka Kalolo Kabudula Chadza 159 Stellenbosch University https://scholar.sun.ac.za Khongoni Mazengera Malili Chimutu Kalumbu SC Mtema SC Chitekwele SC Njewa Tsabango Chitukula Kalumba Lilongwe city Mchinji Mchinji Boma, Zulu SC Mavwere Mkanda SC Mduwa SC Dambe Mlonyeni Dedza Kaphuka Pemba Kachindamoto Kasumbu Tambala SC Chilikumwendo SC Kamenya Gwaza Dedza Township SC Chauma Ntcheu Goodson Ganya Champiti, Makwangwala Ntcheu Boma, Kwataine Phambala Njolomole Mpando Chakhumbira Masasa Southern Region Mangochi Palm Beach forest, Monkey Bay urban 160 Stellenbosch University https://scholar.sun.ac.za Chimwala Mponda Chowe SC Mbwana Nyambi Jalasi Makanjila Katuli Mangochi Boma SC Namabvi Machinga Kawinga Liwonde National Park, Liwonde, Lake Malawi national park SC Chikweo Nyambi and SC Chiwalo SC Mlomba Machinga Boma, Sitola, Liwonde Township SC Ngokwe SC Mposa Chamba Zomba SC Mbiza Mlumbe Mwambo Kuntumanji, Mkumbira Malemia Chikowi Zomba municipality Chiradzulu Chiradzulu Boma, Chitera, Mpama Kadewere Likoswe Nkalo Mchema Blantyre Kapeni Kuntaja Somba Chigeru Makata, Machinjili Kunthembwe 161 Stellenbosch University https://scholar.sun.ac.za Lundu Blantyre city Mwanza Mwanza Boma, Nthache Kanduku Symon Mlauli Dambe Ngozi Thyolo Luchenza Township, Chimaliro Bvumbwe Thyolo Boma, Nchilamwela Kapichi SC Thukuta, Nsabwe SC Mphuka SC Kwethemule SC Mbawela Changata Thomas Mulanje Mabuka Nkanda, Mulanje Mountain Mulanje Boma, Chikumbu Juma Laston Njema Nthiramanja Phalombe Phalombe Boma, Mkhumba Nazombe Chikwawa Ngabu Urban, Ngabu Chapananga, Lengwe National Park Mankhwira Lundu Chikwawa Boma, Katunga Kasisi, Majete Game Reserve Maseya Nsanje Mlolo 162 Stellenbosch University https://scholar.sun.ac.za Malemia, Tengani, Mwabvi Game Reserve SC Mbenje SC Makoko, Nyachikadza, Ndamera Chimombo, Ngabu Nsanje Boma Balaka Msamala and Balaka town Kalembo Source: Own compilation from census data Table A 5: Results showing spatial dependencies in variables for 1987 Variables Moran's I SD p-value Geary's c Migration 0.010 0.005 0.004*** 0.998 Employed 0.015 0.005 0.000*** 0.996 Years of schooling 0.100 0.005 0.000*** 0.866 Sex 0.000 0.005 0.540 0.997 Age 0.060 0.005 0.000*** 0.935 Married 0.081 0.005 0.000*** 0.899 Dependency ratio 0.039 0.005 0.000*** 0.966 Note: *, **, *** denote significance at 10%, 5% and 1% levels SD 0.017 0.015 0.007 0.005 0.009 0.007 0.008 p-value 0.909 0.792 0.000*** 0.540 0.000*** 0.000*** 0.000*** SD . 0.013 0.007 0.005 0.01 0.007 0.008 p-value . 0.000*** 0.000*** 0.540 0.003*** 0.000*** 0.000*** SD 0.006 0.007 0.008 0.005 0.011 0.008 0.009 p-value 0.000*** 0.017*** 0.000*** 0.540 0.890 0.000*** 0.000*** Table A 6: Results showing spatial dependencies in variables for 1998 Variables Moran's I SD p-value Geary's c Migration . . . . Employed 0.059 0.005 0.000*** 0.924 Years of schooling 0.126 0.005 0.000*** 0.841 Sex 0.000 0.005 0.540 0.997 Age 0.043 0.005 0.000*** 0.971 Married 0.069 0.005 0.000*** 0.913 Dependency ratio 0.033 0.005 0.000*** 0.960 Note: *, **, *** denote significance at 10%, 5% and 1% levels Table A 7: Results showing spatial dependencies in variables for 2008 Variables Moran's I SD p-value Geary's c Migration 0.420 0.005 0.000*** 0.591 Employed 0.025 0.005 0.000*** 0.983 Years of schooling 0.107 0.005 0.000*** 0.870 Sex 0.000 0.005 0.540 0.997 Age 0.010 0.005 0.005*** 1.002 Married 0.043 0.005 0.000*** 0.928 Dependency ratio 0.074 0.005 0.000*** 0.912 Note: *, **, *** denote significance at 10%, 5% and 1% levels 163 Stellenbosch University https://scholar.sun.ac.za Table A 8: Population figures for individuals aged (15 years and older) 1987 District Male Female Chitipa 20,100 24,450 Karonga 32,580 37,540 Nkhata Bay, Likoma 31,780 35,390 Rumphi 22,100 24,750 Mzimba 108,440 117,170 Kasungu 89,990 78,670 Nkhota Kota 41,230 39,810 Ntchisi 30,030 31,700 Dowa 81,260 86,150 Salima 47,140 52,590 Lilongwe 246,180 254,090 Mchinji 63,850 56,860 Dedza 83,060 108,420 Ntcheu 64,030 87,060 Mangochi 103,950 127,900 Machinga 110,070 140,180 Zomba 105,430 128,100 Chiradzulu 48,330 64,420 Blantyre 168,480 145,910 Thyolo 102,140 118,680 Mulanje 132,890 177,440 Chikwawa 68,390 70,110 Nsanje 33,870 39,900 Neno 23,600 29,200 Total 1,858,920 2,076,490 Source: Own computation from census data 1998 Male Female 29,740 35,780 49,130 56,430 2008 Male Female 42,900 47,600 66,590 73,190 45,300 49,980 33,720 35,840 161,690 173,590 139,600 125,510 63,140 61,800 44,960 46,330 111,530 114,430 67,400 71,110 381,780 371,920 87,940 86,530 120,660 142,510 92,040 110,580 158,050 179,370 160,250 186,870 151,760 164,380 62,160 76,890 252,610 229,830 120,750 141,180 170,600 210,110 95,930 98,280 49,520 55,770 35,320 38,970 2,685,580 2,863,990 56,980 61,000 45,120 46,780 228,090 244,440 170,160 164,410 79,230 79,550 63,000 63,740 152,060 158,700 89,810 93,730 540,870 534,880 121,150 119,380 159,680 182,870 123,020 140,760 194,000 218,630 196,960 223,400 174,900 192,770 74,070 88,790 299,600 288,050 150,130 171,600 206,610 244,320 117,710 117,230 62,890 64,510 50,140 54,810 3,465,670 3,675,140 164 Stellenbosch University https://scholar.sun.ac.za Table A 9: Number of people who migrated to other districts (15 years and older) 1987 Male Female Chitipa 206 281 Karonga 375 411 Nkhata Bay, Likoma 434 418 Rumphi 453 460 Mzimba 1,914 1,877 Kasungu 2,040 1,438 Nkhota Kota 955 814 Ntchisi 464 410 Dowa 1,198 1,105 Salima 956 757 Lilongwe 3,751 3,236 Mchinji 1,203 903 Dedza 1,129 897 Ntcheu 637 668 Mangochi 1,817 1,672 Machinga 2,098 1,888 Zomba 1,800 1,503 Chiradzulu 593 595 Blantyre 4,457 3,101 Thyolo 1,626 1,522 Mulanje 1,462 1,310 Chikwawa 862 687 Nsanje 328 325 Neno 252 283 Total 31,010 26,561 Source: Own computation from census data Total 487 786 852 913 3,791 3,478 1769 874 2,303 1713 6,987 2106 2026 1305 3,489 3,986 3,303 1188 7,558 3,148 2,772 1549 653 535 57,571 165 2008 Male Female Total 61 64 125 182 185 367 296 213 509 223 199 422 1,153 1,036 2,189 792 664 1,456 244 212 456 164 146 310 275 286 561 306 270 576 3,201 2,590 5,791 295 297 592 250 228 478 281 265 546 705 659 1,364 713 662 1,375 931 836 1,767 259 276 535 4,428 3,805 8,233 733 676 1,409 685 646 1,331 563 527 1,090 257 264 521 346 327 673 17,343 15,333 32,676 Stellenbosch University https://scholar.sun.ac.za Table A 10: Proportions of people who migrated to other districts (15 years and older) 1987 Male Female Chitipa 10.2% 11.5% Karonga 11.5% 10.9% Nkhata Bay, Likoma 13.7% 11.8% Rumphi 20.5% 18.6% Mzimba 17.7% 16.0% Kasungu 22.7% 18.3% Nkhota Kota 23.2% 20.4% Ntchisi 15.5% 12.9% Dowa 14.7% 12.8% Salima 20.3% 14.4% Lilongwe 15.2% 12.7% Mchinji 18.8% 15.9% Dedza 13.6% 8.3% Ntcheu 9.9% 7.7% Mangochi 17.5% 13.1% Machinga 19.1% 13.5% Zomba 17.1% 11.7% Chiradzulu 12.3% 9.2% Blantyre 26.5% 21.3% Thyolo 15.9% 12.8% Mulanje 11.0% 7.4% Chikwawa 12.6% 9.8% Nsanje 9.7% 8.1% Neno 10.7% 9.7% Total 15.8% 12.9% Source: Own computation from census data Total 10.9% 11.2% 12.7% 19.5% 16.8% 20.6% 21.8% 14.2% 13.8% 17.2% 14.0% 17.5% 10.6% 8.6% 15.1% 15.9% 14.1% 10.5% 24.0% 14.3% 8.9% 11.2% 8.9% 10.1% 14.3% Male 1.4% 2.7% 5.2% 4.9% 5.1% 4.7% 3.1% 2.6% 1.8% 3.4% 5.9% 2.4% 1.6% 2.3% 3.6% 3.6% 5.3% 3.5% 14.8% 4.9% 3.3% 4.8% 4.1% 6.9% 4.2% 2008 Female 1.3% 2.5% 3.5% 4.3% 4.2% 4.0% 2.7% 2.3% 1.8% 2.9% 4.8% 2.5% 1.3% 1.9% 3.0% 3.0% 4.3% 3.1% 13.2% 3.9% 2.6% 4.5% 4.1% 6.0% 3.7% Total 1.4% 2.6% 4.3% 4.6% 4.6% 4.4% 2.9% 2.5% 1.8% 3.1% 5.4% 2.5% 1.4% 2.1% 3.3% 3.3% 4.8% 3.3% 14.0% 4.4% 3.0% 4.6% 4.1% 6.4% 3.9% Note: As earlier stated, four of the districts in Malawi consist of cities, namely Mzuzu city (in Mzimba district), Lilongwe city (in Lilongwe district), Zomba municipality or city (in Zomba district) and Blantyre city (in Blantyre district). A movement from the rural areas of the district into the city (e.g. from Blantyre rural to Blantyre city) and vice versa is recorded as migration. As a result, the high migration patterns observed in Blantyre and Lilongwe (which are the two major cities in Malawi) could largely be rural-urban migration involving people moving rural areas to urban areas within the district boundaries. 166 Stellenbosch University https://scholar.sun.ac.za Table A 11: Effects of land reform policy on migration Male Female SEM SAC SEM SAC Time -0.170*** -0.102*** -0.028 -0.016 (0.025) (0.038) (0.023) (0.039) DID 0.015 0.015 0.006 0.017 (0.016) (0.016) (0.017) (0.015) Schooling -0.013 -0.013 -0.037*** -0.029*** (0.010) (0.010) (0.010) (0.010) Age 0.088 0.088 -0.179*** -0.151*** (0.056) (0.056) (0.051) (0.052) Age squared -0.002** -0.002** 0.002*** 0.002** (0.001) (0.001) (0.001) (0.001) Married 0.384*** 0.382*** 0.074 0.076 (0.125) (0.123) (0.116) (0.110) Employed -0.084** -0.085** -0.103*** -0.109*** (0.039) (0.038) (0.023) (0.023) Assets 0.033*** 0.033*** -0.012 -0.008 (0.010) (0.010) (0.010) (0.010) W*migration 0.564** 0.447 (0.260) (0.284) Lambda 0.755*** 0.528* 2.024*** 0.789*** (0.155) (0.302) (0.145) (0.152) Observations 356 356 356 356 Spatial units 178 178 178 178 Notes: *, **, *** denote significance at 10%, 5% and 1% levels; standard errors in parenthesis 167 Stellenbosch University https://scholar.sun.ac.za Table A 12: Effects of land reform policy on agricultural employment Male Female SEM SAC SEM SAC Time -0.387*** -0.352** -0.436*** -0.324*** (0.033) (0.137) (0.043) (0.123) DID 0.028 0.027 0.053** 0.050** (0.021) (0.021) (0.021) (0.021) Schooling 0.074*** 0.073*** 0.054* 0.054* (0.024) (0.024) (0.028) (0.028) Age 0.286 0.279 1.018*** 1.031*** (0.542) (0.543) (0.341) (0.343) Age squared -0.004 -0.004 -0.016*** -0.017*** (0.009) (0.009) (0.006) (0.006) Married -0.156 -0.153 -0.166 -0.151 (0.309) (0.309) (0.271) (0.269) Dependency -0.268* -0.265* 0.096 0.101 (0.142) (0.142) (0.113) (0.111) W*employed 0.113 0.342 (0.438) (0.352) Lambda 0.334 0.275 0.611*** 0.479 (0.319) (0.412) (0.218) (0.309) Observations 356 356 356 356 Spatial units 178 178 178 178 Notes: *, **, *** denote significance at 10%, 5% and 1% levels; standard errors in parenthesis 168 Stellenbosch University https://scholar.sun.ac.za Table A 13: Effects of land reform policy on government employment Male Female SEM SAC SEM SAC Time 0.079** -0.002 -0.011 -0.024 (0.038) (0.021) (0.012) (0.017) DID 0.028*** 0.024*** 0.013** 0.012** (0.007) (0.007) (0.006) (0.005) Schooling 0.007 0.011 0.033*** 0.033*** (0.008) (0.007) (0.007) (0.007) Age -0.277* -0.264* 0.432*** 0.436*** (0.162) (0.159) (0.087) (0.088) Age squared 0.005* 0.004* -0.007*** -0.007*** (0.003) (0.003) (0.001) (0.001) Married -0.102 -0.113 0.111 0.110 (0.092) (0.089) (0.070) (0.070) Dependency 0.021 0.016 0.042 0.041 (0.044) (0.043) (0.029) (0.029) W*employed 0.799*** 0.324 (0.130) (0.369) Lambda 0.922*** 0.830*** 0.707*** 0.582** (0.054) (0.115) (0.174) (0.288) Observations 356 356 356 356 Spatial units 178 178 178 178 Notes: *, **, *** denote significance at 10%, 5% and 1% levels; standard errors in parenthesis 169 Stellenbosch University https://scholar.sun.ac.za Table A 14: Effects of land reform policy on self, wage and private employment (OLS only) Male Female Self Private Self Private Time 0.228*** 0.185*** 0.213*** 0.108*** (0.020) (0.018) (0.019) (0.013) Treated -0.018 -0.007 0.002 -0.014 (0.015) (0.014) (0.015) (0.010) DID -0.043** -0.040** -0.021 -0.041*** (0.021) (0.019) (0.020) (0.014) Schooling -0.001 0.020*** 0.008* 0.011*** (0.006) (0.005) (0.004) (0.003) Age -1.436*** -0.298 -0.691*** -0.106 (0.310) (0.284) (0.161) (0.110) Age squared 0.023*** 0.005 0.011*** 0.002 (0.005) (0.005) (0.003) (0.002) Married -0.274* -0.047 -0.241** 0.090 (0.159) (0.146) (0.099) (0.068) Dependency -0.134 -0.324*** -0.030 -0.129** (0.106) (0.097) (0.080) (0.055) Constant 22.764*** 4.543 10.835*** 1.62 (4.826) (4.416) (2.428) (1.658) R-squared 0.580 0.572 0.657 0.428 Observations 356 356 356 356 Notes: *, **, *** denote significance at 10%, 5% and 1% levels; standard errors in parenthesis 170