WPS8376 Policy Research Working Paper 8376 Risk Preferences and the Decision to Flee Conflict Lidia Ceriani Paolo Verme Fragility, Conflict and Violence Global Theme March 2018 Policy Research Working Paper 8376 Abstract Despite the growing numbers of forcibly displaced per- maximizing gains by choosing the lottery with the best sons worldwide, many people living under conflict choose outcome at the higher end of the distribution. Using a rich not to flee. Individuals face two lotteries—staying or set of household and conflict panel data from Nigeria, the leaving—characterized by two distributions of poten- paper finds that risk-tolerant individuals have a significant tial outcomes. This paper proposes to model the choice preference for staying and risk-averse individuals have a between these two lotteries using quantile maximization significant preference for fleeing, in line with the predic- as opposed to expected utility theory. The paper posits that tions of the quantile maximization model. These findings risk-averse individuals aim at minimizing losses by choos- are contrary to findings on economic migrants, and call ing the lottery with the best outcome at the lower end of for separate policies toward economic and forced migrants. the distribution, whereas risk-tolerant individuals aim at This paper is a product of the Fragility, Conflict and Violence Global Theme. It is part of a larger effort by the World Bank to provide open access to its research and make a contribution to development policy discussions around the world. Policy Research Working Papers are also posted on the Web at http://econ.worldbank.org. The authors may be contacted at pverme@worldbank.org or Lidia.Ceriani@georgetown.edu. The Policy Research Working Paper Series disseminates the findings of work in progress to encourage the exchange of ideas about development issues. An objective of the series is to get the findings out quickly, even if the presentations are less than fully polished. The papers carry the names of the authors and should be cited accordingly. The findings, interpretations, and conclusions expressed in this paper are entirely those of the authors. They do not necessarily represent the views of the International Bank for Reconstruction and Development/World Bank and its affiliated organizations, or those of the Executive Directors of the World Bank or the governments they represent. Produced by the Research Support Team Risk Preferences and the Decision to Flee Conflict Lidia Ceriani Paolo Verme∗ Georgetown University The World Bank JEL Codes: D01; D1; D3; D6; D7; D8; I3. Keywords: Conflict; migration, expected utility, forced displacement, quantile maximiza- tion. ∗ Corresponding author: pverme@worldbank.org. The authors wish to thank participants to the follow- ing meetings for their useful comments: seventh meeting of the Society for the Study of Income Inequality (ECINEQ) held in New York in July 2017; annual workshop of the Household in Conflict Network (HiCN) held in Brussels in November 2017; Georgetown Mortara International Development Seminar held in Wash- ington DC in February 2018. The authors also thank peer-reviewers at the World Bank, including Xavier Devictor, Toan Do, Kevin McGee, David McKenzie and C ¸ ha˘ ¨ glar Ozden. This work is part of the program “Building the Evidence on Protracted Forced Displacement: A Multi-Stakeholder Partnership”. The pro- gram is funded by UK aid from the United Kingdom’s Department for International Development (DFID), it is managed by the World Bank Group (WBG) and was established in partnership with the United Nations High Commissioner for Refugees (UNHCR). The scope of the program is to expand the global knowledge on forced displacement by funding quality research and disseminating results for the use of practitioners and policy makers. This work does not necessarily reflect the views of DFID, the WBG or UNHCR. 1 1 Introduction The United Nations High Commissioner for Refugees (UNHCR) estimates that there were 65.6 million forcibly displaced people worldwide at the end of 2016.1 This estimate is the largest on record since the creation of the UNHCR in 1950 (Figure 1) and is constituted by Internally Displaced Persons (IDPs, 40.3 m), refugees (22.5 m) and asylum-seekers (2.8 m). The growth in these numbers is evidently associated with episodes of conflict and violence. The Syrian Arab Republicn, Afganistan and South Sudan conflicts have been responsible for the major outflows of refugees whereas conflicts in Colombia, Syrian Arab Republic, Iraq, Democratic Republic of Congo, Sudan and Nigeria are the ones that generated the largest numbers of IDPs. Despite these numbers, most of the people who are affected by conflict do not migrate. Some people may have nowhere to go and others may not have the means or the opportunity to flee but many of those who live in conflict areas choose not to move.2 What are the factors that explain this choice remains a rather unexplored question. In this paper we posit that individuals living under conflict are Quantile Maximizers (QM) as opposed to Expected Utility Maximizers (EUM). Individuals face two lotteries - staying or leaving - corresponding to two distributions of outcomes that are observed and used to take migration decisions. In an EUM framework, individuals would focus on expected utilities (mean expected outcomes) and take decisions accordingly. In a QM framework, individuals self-select into groups that focus on different parts of the distribu- tions of outcomes. In its most extreme form, we can imagine individuals partitioning into two groups. The first group focuses on the lowest outcomes of the two lotteries, and picks the lottery where this outcome is higher (maxmin group). This is the group of individu- als that exhibit the largest aversion to risk and prioritize minimizing losses. The second group focuses on the highest outcomes of the two lotteries and picks the lottery with the higher outcome (maxmax group). This is the group of least risk-averse individuals (or risk-tolerant individuals) who seek to maximize gains. When compared to the EUM approach, the QM approach is more parsimonious, flexible and realistic. Individuals are not expected to be knowledgable of all possible prospected outcomes, since they focus on one particular segment of the distribution of outcomes. This is a more realistic assumption in a context where information is scarce and erratic. It is also not necessary to speculate on the shape of individual utility functions. If individual utilities are identical for all, the EUM model would not be able to predict who flees and who does not because all individuals would face the same expected utility and would take the same decision. If utilities vary across individuals, an EUM approach would require aggregating utilities across the population, which is complex. Instead, with a QM approach, individuals’ risk preferences are revealed by the quantile of the distribution they focus on: 1 http://www.unhcr.org/globaltrends2016. 2 One study estimated that Conflict Affected Residents (CARs) accounted for about 87 percent of the total number of people affected by conflict (file:///C:/Users/wb201247/Downloads/PAC2013.pdf). 1 at the extremes, maxmin individuals are the most risk-averse and maxmax individuals are the least risk-averse (risk-tolerant). This is simpler, closer to the reality of conflict and also easier to test with empirical data. In our knowledge, this is the first paper that uses a QM approach to study risk pref- erences in a conflict scenario. The first paper that formalized a QM approach to decision making is Rostek [2010]3 . The paper provided the theoretical characterization of quantile maximization in decision theory but did not offer any empirical application and we are not aware of empirical works that used this approach. The first and only paper we found that studies risk preferences as a possible driver of forced migration is Mironova and Whitt [2017]. This is an unpublished experiment conducted in Syrian Arab Republic and Turkey between 2013 and 2014. Using a variation of the Eckel and Grossman (2002 and 2008) risk game, the authors find combatants and non-combatants in rebel-held conflict zones to be more risk-tolerant than Syrian Arab Republicns living in other parts of Syrian Arab Re- public or Syrian Arab Republicns who fled abroad to Turkey. These results are consistent with our results but the paper does not build on a QM model. Two recent papers looked at the role of conflict (or violence) in shaping risk pref- erences. Research across the social sciences has shown how traumas of different kinds (natural calamities, economic slumps, or personal traumas), or even the recollection of traumas, can modify risk preferences temporarily or permanently (Malmendier and Nagel, 2011; Voors et al., 2012; Eckel et al., 2009; Lerner and Keltner, 2001). Similar findings are emerging in studies related to conflict. Voors et al. [2012] run a number of experiments among individuals who have been affected by violence in Burundi and find that individuals who experienced violence or lived in communities that experienced violence are signifi- cantly more risk-tolerant. Callen et al. [2014] looked at violence and risk preferences in Afghanistan. They find that preferences for certain outcomes increase when individuals are exposed to violence or even recollect violent episodes (what they refer to as “the certainty premium”). These studies are closely related to our work but they focused on understand- ing how conflict (or violence) affects risk preferences rather than understanding whether risk preferences contribute to select migrants and nonmigrants. They are important for us because they point to a potentially confounding factor. Differences in risk preferences between migrants and nonmigrants may be a reflection of the longer impact that conflict has on risk-aversion for the nonmigrants rather than the outcome of a self-selection process, a hypothesis we will test in the paper. The other body of literature that is related to our paper is the one on economic mi- gration and risk preferences. This is quite vast and emerged in response to the seminal migration model proposed by Todaro [1969]. The original model focussed on rural-urban migration and identified expected income as the core driver of migration in line with EU theory. This hypothesis was later found to be inconsistent with some of the empirical 3 Chambers [2009] shows that quantiles are an essentially unique ordinal decision criterion which does not violate weak first-order stochastic dominance. 2 evidence that emerged from developing economies and one of the early explanations for this failure was risk-aversion. Simon [1964] and Myrdal [1968] had already noticed that migrants are selectively drawn from rural areas based on personal characteristics that are closely associated with risk-taking such as young age so that one should expect risk-tolerant individuals to be more likely to migrate. In a theoretical model devised as a critique to the Todaro model, Katz and Stark [1986] showed that a small chance of reaping high rewards is a sufficient condition to trigger migration irrespective of wage differentials, a finding that would make risk-taking behavior consistent with economic migration. Recent empirical work in developed and developing economies supports this view. Using data from the German Socio-Economic Panel (GSOEP), Jaeger et al. [2010] find that Germans who had spells of migration between 2000 and 2006 were more likely to be risk-tolerant individuals as opposed to those who did not move. Using the 2009 round of the Rural Household Survey (RHS) of China, Akg¨ ¸ et al. [2016] find that risk-tolerance positively uc affects the decision to migrate and this holds for households as well as for household heads and their spouses. It is important to note here that this literature focuses on self-selection of individuals into migrants and nonmigrants. It does not address the question of whether exposure to poverty (instead of conflict) has an impact on risk preferences.4 This paper uses household and conflict panel data from Nigeria for the period 2010-2016 to predict migration from conflict areas with a QM model. It finds the model to predict risk- averse individuals to leave conflict areas and risk-tolerant individuals to stay. Using a direct measure of risk preferences, it also finds these predictions to match observed risk preferences among leavers and stayers. If these results are confirmed, there is clearly a sharp difference between economic migrants and forced migrants in terms of risk preferences. Economic migrants are less risk-averse and migrate with the expectation of reaping large rewards and better living conditions. Forced migrants are more risk-averse and migrate in an effort to protect their lives and minimum living standards. As discussed in the conclusions, these findings have important policy implications for both the place of origin and the place of destination of migrants. The paper is organized as follows. Section 2 summarizes traditional models of decision 4 The theory and evidence on economic migration and risk preferences is more complex. For example, Bonin et al. [2009] find that first generation immigrants to Germany have a higher risk-aversion than natives using the 2004 GSOEP, the same survey used by Jaeger et al. [2010]. This is not necessarily in contrast with this latter paper because the samples compared are different but shows the importance of distinguishing between internal and external migration and between first and second generations of migrants. Some scholars have also provided arguments for economic migrants to be more risk averse. One hypothesis is that migration decisions are household based and that households strategically select individual members for migration in an effort to diversify and reduce risks (Stark and Levhari, 1982; Katz and Stark, 1986). While plausible, there is not much empirical evidence supporting this hypothesis and such hypothesis is not necessarily in contrast with the fact that, within the household, risk-tolerant individuals would be more willing to migrate. Another hypothesis states that risks associated with the place of destination decreases over time as people settle and find work and that these risks may be below those of the place of origin, encouraging therefore risk averse individuals to migrate. Again, the empirical evidence for this hypothesis remains inconsistent as compared to the findings described in the text. 3 under uncertainty and outlines the model we propose; Section 3 describes the empirical strategy. Section 4 illustrates the data, Section 5 shows the results and Section 6 concludes summarizing results and discussing policy implications. 2 Modeling Decisions under Uncertainty 2.1 Expected utility and recent developments Orthodox economic theory frames decision making under uncertainty using models of expected utility maximization.5 The backbone of these models is the von Neumann- Morgenstern Expected Utility (EU) model (1944), where individuals’ utilities over n pos- sible states of the world are described as n U (x) = πi u(xi ) (1) 1=1 where πi is the probability that state of the world i occurs, with n i=1 πi = 1, and u(xi ) is the utility associated with outcome xi , which we will consider some monetary value, such as levels of income or expenditure. Let Fx denote the cumulative distribution function associated to the random variable (also referred to as prospect, or lottery) x = [x1 , π1 ; x2 , π2 , . . . ; xn , πn ], where Fx = P r(x ≤ xi ), i = 1, 2, . . . , n. In standard models of these kinds, individuals are expected to be risk-averse so that the function u(.) is expected to be concave. A prospect y is non preferred (riskier) to a prospect x when y is a mean preserving spread of x (see Rothschild and Stiglitz, 1970). Figure 2 provides an example of a mean-preserving spread. Distributions x and y have the same expected outcome of 5 but distribution x has a smaller spread of outcomes than y . EU theory predicts that U (x) > U (y ), or that x y (x is preferred to y ). Risk-aversion induces individuals to choose the prospect with the lowest spread of outcomes (variance), given the same expected outcome. Note that for individuals to make a decision among two prospects, they must have complete information on the entire domain of the distribution of outcomes. While the von Neumann-Morgenstern EU model remains the model of choice for most economists, several critiques have emerged over the past few decades. One early critique is the Ellsberg paradox (Ellsberg, 1961) or the question of ambiguity aversion. When 5 The original idea dates back at least to Daniel Bernoulli (Versuch einer neuen Tizeorie der Wertbe- stirnrnung von Gliicksfdllen, Leipzig, 1896, translated by A. Pringsheim from “Specimen theoriae novae de mensura sortis” Commentarii acaderniae scienliarum imperialis Petropolitanae, Vol. V, for the years I730 and I731, published in 1738). It was rejected by classical economists such as Marshall on the ground that maximizing expected utility with a utility curve with diminishing returns implies that: “The gain in utility from winning a dollar will be less than the loss of utility from losing a dollar (...), individuals would always have to be paid to induce them to bear a risk. But this implication is clearly contrary to actual behavior” (Friedman and Savage, 1948). 4 individuals are put in front of two alternatives, one with a certain low outcome and one with an uncertain high outcome, they would generally opt for the certain low outcome because of preferences for known odds to unknown odds and this violates the postulates of expected utility theory. A second and more recent critique to the EU model is Kahneman and Tversky’s [1979] prospect theory. The central theme of the critique is that people underweight outcomes that have very low probability of occurring and overweight outcomes that have a very high probability (this is called the certainty effect ). Considering equal weighting as in EU theory can lead to the Allais paradox (Allais, 1953) where different choice frameworks can lead to opposite conclusions about the dominance of alternative choices. Kahneman and Tversky [1979] showed that, for negative prospects, preferences are reversed as compared to positive prospects (this is called the reflection effect). Therefore, “ [...] certainty increases the aversiveness of losses as well as the desirability of gains ” (Kahneman and Tversky, 1979, p. 269). The authors introduce the notion of decision weight to weigh the importance that people give to different probabilities so that the expected utility function becomes the following equation (2): n U (x) = w(πi )u(xi ). (2) i=1 where the weighting function w(·) : [0, 1] −→ [0, 1] infinitely underweighs infinitesimal probabilities and infinitely overweighs near-one probabilities. Both the Ellsberg and Khanemann and Tversky critiques show that decisions under uncertainty cannot be driven simply by a mean and spread of a probabilistic distribution but that individuals attribute different importance to and have different preferences for different parts of the probability distribution. Behavioral and experimental economics have now provided countless examples supporting these views but it remains difficult to test the predictions of these models with non experimental and empirical data. Recent models of economic migration have also advanced our understanding of inter- temporal decisional processes emphasising the importance of past experiences and learning processes in decision making in a dynamic or life-cycle framework (Kenna and Walker [2013]), a feature that is also shared by EU types of models that include subsistence thresholds. These models provide alternatives to classic EU models and allow for out- comes that could explain, for example, why individuals may choose not to migrate even when migrating would clearly improve their present welfare and utility.6 The general idea is that individuals have long-term rather than short-term goals and adapt to lower present consumption for ensuring long-term subsistence or other consumption goals. In the context of the Boko Haram conflict in North-East Nigeria and, more generally, in the context of forced migration, these models are less relevant as the decision to move is taken quickly 6 In some cases, even material incentives to migrate have proved not to increase migration (see a recent review of this literature in Beam et al. [2016]). 5 and it is less likely that individuals can learn from past experiences. Indeed, one of the factors that explains Boko Haram’s success is related to the fact that its incursions are audacious and unpredictable such as the kidnapping of 200 schoolgirls in 2014. Under such circumstances, behavioral factors such as risk aversion become more important to decision making than rational planning based on past experiences. Our interest is to see how we can devise a model that draws on the intuitions provided by behavioral economics and the economics of migration but remains sufficently simple to predict migration decisions in the short-term and be tested with empirical data that are commonly available to researchers. The model presented in the next section is not necessarily a substitute of existing models but a viable alternative for empirical researchers working with real world data. 2.2 A simplified Quantile Maximization Model We saw that in EU theory risk-aversion is a measure of how much individuals dislike the spread of prospect outcomes. We argue, instead, that a more relevant interpretation of risk-aversion for individuals facing conflict is their concern for downside risks. Consider the two prospects x and y reported in Figure 2, and assume that an individual is concerned not about the spread, but about falling below any given value with probability greater than 0.5. With such (downside) risk of 0.5, this individual obtains 2 in lottery x and 3 in lottery y . Therefore, this individual will choose lottery y . Note that this different interpretation of risk-aversion (aversion to downside risk as opposed to spread), allows a more parsimonious assumption on the set of information available to individuals. In particular, individuals are not expected to have information about Fx and Fy over their entire domain, but only up to the smallest realization of the prospects x and y such that (in this specific example) Fx ≥ 0.5 and Fy ≥ 0.5. Also, no hypothesis on the shape of individual utility functions is needed. The only assumption is that people prefer more to less. If individuals are concerned with downside risk and not with mean preserving spreads, EU Maximization is not helpful in modeling and predicting individual choices. An alterna- tive framework which models the attitude of individuals towards downside risk is Quantile Maximization (QM). In QM, individuals concentrate only on selected segments (quantiles) of the distribution of prospective outcomes, according to their risk attitude. On one ex- treme, the most cautious individual will choose the lottery with the highest return in the worst case scenario (maxmin decision maker); on the other extreme, the least cautious will pick the lottery with the highest return in the best case scenario (maxmax decision maker). More in general, individuals may be somewhere in between the two extreme cases, being τ -quantile decision makers. Rostek [2010] provides the theoretical characterization of quantile maximization in de- cision theory. Let Fx be the cumulative probability distribution of the prospect x. Then, the τ -th quantile of x is a (generalized) inverse of the cumulative distribution at τ , defined 6 as the smallest value xi such that the probability that the prospect x will be less than xi is not smaller than τ : Qτ (Fx ) = inf {xi |Fx ≥ τ } , (3) while for τ = 0, the quantile is defined as Qτ (Fx ) = sup {xi |Fx ≤ 0} (4) A decision maker is said to be a τ −quantile maximizer if there exists a unique τ ∈ [0, 1], such that for all x, y ∈ F , x y ⇐⇒ Qτ (Fx ) > Qτ (Fy ). (5) Similarly to the EU maximizer, where individuals evaluate lotteries on the basis of a single statistics, namely the mean, the τ -maximizer assesses the value of each lottery by the τ -th quantile realization. As mentioned above, at the extreme, quantile maximization includes the maxmin maximizer who picks the lottery with the highest payoff among the lowest outcomes: x y ⇐⇒ min u(x) > min u(y ) (6) xi |(πi >0) yi |(πi >0) and the maxmax maximizer selects the lottery with the highest payoff among the highest outcomes: x y ⇐⇒ max u(x) > max u(y ) (7) xi |(πi >0) yi |(πi >0) As an example, let us assume that x and y are two lotteries with four different equally plausible states of the world each, as represented in Figure 3. The corresponding cumulative distribution functions Fx and Fy are graphically represented in Figure 4. The EU maximizer will select lottery y to lottery x, since the two lotteries have the same mean but x has a wider spread around the mean. Instead, different τ -maximizers will choose a different lottery depending on the quantile they focus on. For τ =0, y will be preferred over x because the highest outcome in the worse-case scenario belongs to y . This is true for all values of τ ∈ (0, 1/2). For τ ≥ 1/2, the preferred lottery becomes x because, with a probability equal or larger than 1/2, the minimum outcome that individuals can realize under x is always higher than the minimum outcome which they can realize under y . In our model, the two lotteries correspond to the prospects of migrating and non mi- grating. We claim that the different risk-aversion gradient of individuals results in different migration choices because they concentrate on a different quantile to take their decision. If x is the prospect of migrating and y is the prospect of staying, only relatively more risk-tolerant individuals would leave their homes. 7 Note that, in the QM framework, individuals can be ordered with respect to their degree of risk-aversion by looking at the quantile chosen to make decisions among different prospects. In particular, as noted by Rostek [2010], in “the Quantile Maximization model, τ < τ if and only if a τ −maximizer is weakly more averse toward downside risk than a τ −maximizer” (Rostek, 2010, p.354). To conclude, the advantages of QM over EU can be summarized as follows: (i) QM does not require any hypothesis on the shape of individual utility functions, only that more (income or expenditure) is better than less; (ii) QM does not require full information on the entire support of distribution functions of future prospects, only on part of the domain (even just the extreme realizations, without any probabilistic information, in the extreme cases of maxmin and maxmax decision makers); (iii) QM allows to model a type of risk-aversion, namely, aversion to downside risks, which appears to be better suited to model decision under conflict; (iv) QM does not require a learning process based on past experiences similar to recent EU types of models proposed in the context of economic migration and is therefore more suitable for choices made in a forced displacement context. 3 Empirical strategy Our empirical strategy is structured in four steps: 1) Estimation of the distributions of outcomes (lotteries) for people living under conflict who decide to stay or leave; 2) Predic- tions of migration choices for people with different risk preferences (focusing on different quantiles of the distributions) based on the QM model; 3) Test whether the QM predic- tions match observed risk preferences among stayers and leavers; 4) Test for a potential cunfounding factor. The first step of our empirical strategy is to determine the distribution of potential outcomes for people opting to stay or leave conflict-affected areas where outcomes are measured in terms of household expenditure per capita. The outcomes for stayers are observed and the distribution of these outcomes can be derived directly from data. The potential gains (outcomes) of moving from conflict to non conflict areas are not observed and require estimation. In order to estimate these potential gains for households living in conflict areas, we first estimate a welfare equation for non-conflict areas as a log-linear equation as follows: ln(yi,nc ) = α + βnc Xi,nc + i (8) where y is household expenditure per capita, i = 1, 2, . . . , n are all households living in non-conflict affected areas (nc), and Xi,nc is a matrix of household-specific characteristics. The estimated coefficients β ˆnc are then used to predict the expenditure levels that house- holds living in conflict areas could potentially have if they migrated to non-conflict-affected areas: 8 ln(ˆ yi,c ) = α ˆnc Xi,c ˆ+β (9) We then use Duan’s smearing re-transformation to obtain levels of expenditure starting from the predicted log expenditures (Duan, 1983). This procedure allows us to obtain two comparable distributions of potential outcomes (lotteries) for people living under conflict if they choose to stay or to leave. The first distribution can be interpreted as the distribution of gains for people who decide to stay (stayers ) and the second distribution can be interpreted as the distribution of gains for people who decide to leave (leavers ). As in all prediction models of this kind, selection on unobservables is of course pos- sible but less likely than in models looking at mean outcomes for treated and untreated individuals. In our case, we are not comparing two groups (treated and non treated) but two lotteries for the same group. The two lotteries relate to the same people with the same observable and unobservable characteristics. We are also assuming that people take decisions based on the same observables we have in the data. There may be unobservables relevant to predict consumption that bias consumption estimates but they are irrelevant for the decision to flee conflict if they are not observed by the decision makers. The other assumption we make (common to all cross-imputation exercises) is that the Betas of the prediction equation are constant if one migrates or stays. This is a strong assumption of course and we could still miss on some unobservables that are important to people when deciding to flee conflict, they are observed by the decision makers but not captured by the data and are unrelated to risk aversion. One of the likely factors that fall in this category is having kin in safe areas, a factor we do not observe and is likely to affect the decision to migrate. To address this question, we have added language to the prediction of consumption in the case of migration as a proxy for kin networks. The second step consists of comparing the two distributions of gains to find the mi- gration choice that the QM model would predict for people with different risk preferences, or at the extreme, maxmin (risk averse) and maxmax (risk tolerant) individuals. This is done by means of stochastic dominance analysis of first degree, i.e. by comparing the Cumulative Distribution Functions (CDFs) of the two lotteries. This analysis reveals the highest gain (dominance) between the two lotteries for each quantile of the probability distributions.7 It can tell us, therefore, what the migration preference of people focusing on different quantiles would be. Or, in a world of only maxmin and maxmax individuals, it will predict the migration choice made by these two groups. The third step is to test whether the predictions of the QM model match the risk preferences observed among stayers and leavers. This is done using a direct question on risk preferences administered to respondents and studying the association between migration status and risk preferences as follows: 7 Recall that the y-axis of CDFs built on probabilistic functions represent the sum of probabilities up to the corresponding outcome. This is why we referred to downside risks when we described the model. 9 Mi = α + βRAi + γXi + ηi (10) where M (0, 1) is the migration status of heads of households i, RA is the observed risk aversion of heads of households, X is a vector of household and individual caracteristics and η is the error term. If the predictions of the QM model for risk averse and risk tolereant individuals match the risk preferences observed among stayers and leavers, we consider our model successfully tested. Note that we expect RAi to be independent of ηi because many of the known correlates of RAi are observed and present in Xi . The fourth and final step is to test for a possible confounding factor. As already discussed, one possible confounding factor of our results is that conflict could increase risk-tolerance as found by Voors et al. [2012] for Burundi and by Callen et al. [2014] for Afghanistan. If this is the case, results could be explained solely by the fact that stayers and leavers are exposed to conflict for different lenghts of time. To test for this confounding factor we carry out a Difference-in-Differences (DiD) test. We consider risk-aversion as an outcome, conflict as treatment and migration as a control and interact conflict and migration to obtain the DiD estimator as follows: RAi = α + βCi + γMi + θCi Mi + φXi + πi (11) Where RAi is risk-aversion, Ci is a dummy variable taking value 1 if the individual has experienced conflict and 0 otherwise; Mi is a dummy variable taking value 1 if the individual never migrated over the period taken into account and 0 otherwise, Xi is a matrix of individual’s socio-economic characteristics, and π is the error term. 4 Data and context The paper uses household and conflict data from Nigeria, the largest country in Africa in terms of population and GDP and also the largest country in terms of IDPs in spite of relatively low migration flows. Estimates of external migration are gross and point to relatively low stocks. It is estimated that in 2010 the stock of immigrants was about 1.1 million people with the quasi totality originating in other African countries whereas the stock of emigrants was estimated at just about over one million with most of these emigrants found in the Sudan, United States and United Kingdom (IOM [2009]). More significant is internal migration with an estimated 6-7 percentage of the population living in areas other than those where they were born but still a low figure when compared to countries with high internal migration. In this migration context, forced migration is a very significant factor and most of this migration can be reconduced to one conflict: The Boko Haram conflict. The current number of IDPs is estimated at over 2.2 million, a very large number when compared to external migration and even to internal migration. The quasi-totality of these IDPs derived from the conflict between the Nigerian state and the Boko Haram insurgents in the North 10 East of the country. Nigeria has endured high degrees of violence in many parts of the country for decades, particulalry around the Niger delta in the South, but the conflict associated with Boko Haram has been the most intensive internal conflict to date in terms of number of deaths and displacement and certainly the most relevant for the recent period of time we observe. This conflict started to gain force around 2010, reached its peak in violence between 2014 and 2015 and continues to these days with frequent episodes. The household data we use is the General Household Survey (GHS) administered by the National Bureau of Statistics (NBS) of the Federal Government of Nigeria in collaboration with the World Bank Living Standards Measurement Study.8 Since 2010, this survey contains a panel component of 5,000 households administered during the post-planting and post-harvest seasons every other year. Each of these two components is a self-contained nationally and regionally representative survey and is administered in a different calendar year. This makes a total of six points in time with one point in time per calendar year between 2010 and 2016 (in 2015 the survey was not administered due to the degree of violence in the country). This is the period we consider, which fully captures the evolution of the Boko Haram conflict. We have four points in time before the peak in violence and two points in time after that. The panel is a fixed (non-rotating) panel where missing households are traced to their new addresses and interviewed when possible. This reduced the attrition rate at the end of the six rounds to 8.3 percent of households. The survey records the reason why households do not respond, as summarized in Table 1 for the case of the last visit in 2016. A third of the attrition is due to the impossibility of interviewing households living in crisis areas. In particular, 14 enumeration areas could not be visited in Borno and Yobe States at the peak of the conflict in 2015, which is potentially problematic for our analysis. In this paper, we do not use panel models because one of our key variables (risk preferences) was only measured during the last round in 2016. However, attrition could potentially bias our 2016 sample with respect to key variables and controls. We test therefore for attrition before describing the rest of the data. The original sample weights are adjusted by the national statiscal agency to account for attrition using household nonresponse rates by enumeration areas (we refer the reader to the Basic Information Document, p.21 and its Appendix 5 for further details, GHS, 2016). To test whether this correction restores the means of the original sample, we conducted means tests with the 2010 sample on all variables of interest between the full sample and the sample with attrition.9 Table 2 describes all relevant variables we use in the paper and Table 3 reports results of the means tests. The test shows no significant differences across all variables of interest with the exception of household size and the marital status of the head of the household. Both variables are only used as controls in the analysis and the difference in means between the two samples is very small. We do not expect these 8 All data and ancillary files can be found in the World Bank microdata library. For an example of the latest GHS survey see: http://microdata.worldbank.org/index.php/catalog/2734. 9 All covariates used in the paper are described in Table 2. 11 differences to affect our results. Instead, attrition could affect the prediction capacity of these variables when used together in a multivariate forms in models that include our key variables: expenditure per capita and risk aversion. The variables considered in Table 3 enter the models used to predict expenditure (equations 8 and 9) and to study the association between migration and risk aversion (equation 10) in a multivariate form. To test for non random attrition on per capita expenditure and risk-aversion respectively, we compare the Cumulative Distribution Functions (CDFs) of the predicted values of these variables (based on the predictors listed in Table 3) and test for stochastic dominance of first degree between the samples with and without attrition. This is a rather simple but effective test. We want to make sure that predictors are stable with respect to these variables whether we have attrition or not. If the predictions generated by the sample with attrition are different from those generated by the sample without attrition, our results would be biased. For per capita expenditure, we observe this variable all along the panel. We can there- fore use the 2010 full sample to make predictions and then compare the two distributions of predicted values for the sample with and without attrition. Results are shown in Table 4 and Figure 5 and show that there is no difference in the distributions of predicted values between the two samples. For risk-aversion, we observe this variable only in 2016. We assume that risk-aversion is time-invariant and we use individual characteristics in 2010 to predict the probability of being risk-tolerant for the full panel based on individuals left in the 2016 panel sample with attrition. Results are shown in Table 5 and Figure 6 and show again that there is no difference in the distribution of predicted values of risk-aversion between the overall sample and the sample with attrition. We can therefore conclude that attrition does not bias our results with respect to per capita expenditure and risk-aversion. The migration status of households is measured across the 774 Local Government Ar- eas (LGAs), the lowest administrative subdivision available in our data. We consider a household to have migrated if it changed LGA in any of the years considered. We consider three migration statuses for Equation 10. People who have migrated from areas with no conflict, people who have migrated from areas with some conflict (conflict in at least one year of the six years considered) and people who have migrated from areas with permanent conflict (conflict in all six years considered). Overall, we have 329 migrant households (6.6 percent of the sample) and 90.5 percent of these migrants migrated within one of the six geographical regions. Therefore, most of the migration observed is short range migration. To control for possible predictors of risk-aversion in equation 10, we include most known correlates of risk preferences. Risk-aversion has been found to be higher in women and lower in men or individuals with higher levels of testosterone (Eckel and Grossman, 2008; Apicella et al., 2008; Johnson et al., 2006), and higher for married and older individuals (Kuznets, 1964; Myrdal, 1968); it is associated with lower cognitive abilities (Dohmen et al., 2010), lower levels of skills, occupational status and income (Hartog et al., 2002). Risk preferences were also found to be genetic and transmissible through generations (Dohmen et al., 2011; Becker et al., 2016). Our models control for age, education, marital status and 12 occupational status. We do not have information on testosterone levels, genetic heritage or cognitive abilities but it can be argued that gender is a good proxy for testosterone levels whereas education and employment status are good proxies for cognitive abilities whereas we measure migration across small administrative areas where genetic diversity is not expected to be very large. Instead, income (proxied by expenditure in our data) is our measure of outcomes and should not be controlled for in the migration equation by design. The 2016 GHS also contains a battery of behavioral questions, which include questions on risk-aversion. To assess the degree of risk-aversion we used the following question: Suppose you want to invest some money. Which option do you prefer? A) Investing in a business where I can’t lose money but has low profits; B) Investing in a business where there is a chance I can lose money but potentially brings high profits. Empirical studies on migration and risk-aversion typically use as measures of risk-aversion a self-assessment measured on a scale from 0 to 10 (Bonin et al., 2009; Jaeger et al., 2010; Akg¨ uc¸ et al., 2016). These studies then argue that this scale is a proper measure of risk-aversion because it is closely associated with direct measurements of risk-aversion as shown by Jaeger et al. [2010] and Dohmen et al. [2011]. In our study, we do not need this extra verification as we have a question that measures risk preferences directly. The second data set we use is the The Armed Conflict Location & Event Data (ACLED). These data cover daily conflicts episodes in the country since 1997 and include information on the cause of conflict, perpetrators, casualties and fatalities. We will use the number of fatalities by LGA and match these data with the GHS panel data for the period 2010-2016. We then subdivide LGAs in areas that had no conflict (in any of the years considered), areas with some conflict (defined as areas with fatalities due to conflict in at least one of the years considered but less than six years) and areas always in conflict (defined as areas with fatalities due to conflict in all six years considered).10 The ACLED data set is not perfect as many episodes of violence are not reported or detected but the map of casualties constructed with these data shows correctly all the areas where the conflict was known to be intense (see Figure 7). 5 Results We report results following the order described in the empirical strategy. As a first step we estimate the distributions of outcomes. Table 6 reports coefficients and standard errors of the predicted model for households living in non-conflict areas (Equation 8).11 Next, 10 Data are available on-line and include codebooks, user guides, questionnaires and other relevant material (see http://www.acleddata.com/data/). 11 Note that the equation in Table 6 was also run without variables related to the quality of dwelling. This was to allow for the hypothesis that people may expect to live in makeshift shelters once they move. Results of this test showed that the R squared decreased by 4 percetage points whereas the following results on the dominance analysis remained intact. We also know that most migrants captured by the survey moved to accomodations not too dissimilar from their own homes with the exception of some of the IDPs hosted in 13 the estimated coefficients β ˆnc are used to predict the expenditure levels that households living in conflict areas could potentially have if they migrated to non-conflict-affected areas (Equation 9). We then use Duan’s smearing re-transformation to obtain levels of expendi- ture starting from the predicted log expenditures (Duan, 1983). As a second step, the observed distribution of expenditure per capita of people living in conflict areas is compared with the distribution of predicted values for the same people if they migrated to non-conflict areas. Figure 8 shows the results using the 2016 GHS data. It is clear that the two curves cross in one point splitting the samples into two parts. The left end side of the CDFs shows that the curve of predicted values (gains from leaving) dominates the curve of observed values (gains from staying). The QM model would therefore predict that maxmin (risk averse) individuals who focus on the left hand side of the distrbutions (low gains) would opt to leave. Instead, the opposite is true for the right end side of the CDFs. In this case, the QM model would predict that maxmax (risk tolerant) individuals who focus on the higher end of the distributions (high gains) would opt to stay. As shown in the right hand panel of Figure 8, these differences are statistically significant. Therefore, the QM model predicts maxmin (risk-averse) individuals to leave and maxmax (risk-tolerant) individuals to stay. As a third step, we test whether these QM predictions match reality. We test whether risk-averse (maxmin) individuals are effectively associated with migrating and risk-tolerant (maxmax) individuals are effectively associated with staying. Recall that migrants are defined as those households who have changed LGA at some point in time during the six panel waves and that risk-aversion is assessed using a question asked in a module of the 2016 GHS data as described in the data section. We test the association between migration and risk-tolerance for household heads controlling for a set of household socio-economic characteristics. Results are shown in Table 7.12 The association between migration and risk-tolerance is negative and significant in all three specifications of the equation. The three specifications differ in the dependent variable. In all three equations, “0” corresponds to migrants and “1” corresponds to non-migrants. However, the “1” changes in the three equations to capture different degrees of conflict in the area of origin. Equation (1) captures non-migrants irrespective of whether the area of origin had conflict in previous periods (2010-2014) or not. Equation (2) uses a sub-sample of non-migrants who originates from areas that had some conflict over the previous periods and Equation (3) reduces the sample to people in areas that had always conflict over the previous periods. Results show that the coefficient camps. Moreover, migrants are generally expected to overestimate rather than underestimate the benefits of migration. 12 Note that some of the covariates related to the quality of dwelling are dropped in Table 7 because of perfect prediction. The variable “language” is also not included in this table and the next because it almost perfectly predicts areas in conflict. For instance, people living in areas affected by the Boko Haram conflict speak almost invariably Kanun or Hausa whereas people living in violent areas in the South speak mostly Igbo. As one would expect, language is closely associated with location. 14 is always significant and that the size of the coefficient increases from Equation (1) to Equation (3). This shows that those who opt to migrate are the maxmin (risk averse) individuals and those who stay are the maxmax (risk tolerant) individuals as predicted by the QM model with the dominance analysis. It also shows that the positive association between staying and risk-tolerance is stronger for people who sustained longer spells of violence. As a fourth and last step, we test for a possible confounding factor. Recall that conflict could increase risk-tolerance as found elsewhere and that the lower risk-aversion that we find among stayers in Table 7 could be explained solely by the fact that stayers have experienced a longer period of violence as compared to leavers. To test for this confounding factor we carry out a Difference-in-Differences (DiD) test. Results are reported in Table 8 and show that conflict and the interaction between these two variables (the DiD coefficient) are both non significant. Note also that the low R-squared and the insignificance of the independent variables are further evidence that the covariates of risk-aversion in Table 7 are independent. We can therefore reject the hy- pothesis that conflict affects risk preferences and explains the difference in risk preferences between movers and stayers. 6 Conclusion The paper departed from standard utility theory under uncertainty positing that individ- uals living in conflict areas are quantile maximizers as opposed to expected utility maxi- mizers. In its most extreme form, quantile maximization implies that individuals tend to cluster into two groups, maxmin (risk-averse) and maxmax (risk-tolerant) individuals. We then proposed a four-step approach to test the model with empirical data: 1) Estimation of the distributions of outcomes (lotteries) for people living under conflict who decide to stay or leave; 2) Predictions of migration choices for people with different risk preferences (focusing on different quantiles of the distributions) based on the QM model; 3) Test whether the QM predictions match observed risk preferences among stayers and leavers; 4) Test for a potential cunfounding factor. Using household and conflict data from Nigeria, we found that the predictions of the QM model correctly identify stayers and leavers based on risk preferences. The CDFs of observed and predicted gains from staying and migrating crossed in one point. We also found that risk-averse individuals have a greater propensity to migrate whereas risk-tolerant individuals have a greater propensity to stay as predicted by the dominance analysis. This association is very significant and increases passing from general migration to migration from high conflict areas. We also tested for a possible confounding factor related to the possibility that conflict affects risk preferences and explains the observed difference between migrants and nonmigrants. We used a DiD estimator to test for this possibility and found no evidence that conflict affects risk-aversion. 15 How do these findings fare with the existing literature? In our knowledge, this is the first paper of its kind. We are not aware of another paper that attempted to use a similar model of decision making under conflict and then test the model empirically with household data. The closest work to our study is an unpublished experiment conducted on combatants and non combatants in conflict and non conflict zones of Aleppo (Syrian Arab Republic) in 2013 and 2014 (Whitt and Mironova, 2017). Using a variation of the Eckel and Grossman (2002, 2008) risk game, the authors find combatants and non-combatants in rebel-held conflict zones to be more risk-tolerant than other Syrian Arab Republicns in other parts of Syrian Arab Republic or Syrian Arab Republicns who fled abroad to Turkey. Therefore, the only other similar study we found would confirm our results on Nigeria. Instead, these findings are in stark contrast with the literature on economic migration and risk preferences (Jaeger et al., 2010 and Akg¨ c et al., 2016). There is a remarkable u¸ difference between risk-tolerant economic migrants and risk-averse forced migrants. The cause of flight (economic or flight from violence) is therefore a discriminant in the self- selection process of migrant populations. Forced migration in Nigeria seems to be driven by risk-aversion and a desire to protect one’s own minimum living standards as opposed to leaving in search of greater opportunities and rewards. These findings have important policy implications for the place of origin of migrants. Stopping or reducing conflict would be a necessary condition for most of the forced migrants to return home and, possibly, a sufficient condition if the place of origin guarantees basic needs. Indeed, most of the forced migrants tend to settle close to the place of origin in the expectation of going home when peace is restored. Evidence on economic migrants shows instead that this group is more likely to focus on growth opportunities and is ready to travel far and stay away for extended periods of time irrespective of conflict. This group would require substantial improvements in the place of origin to justify return, a condition that is rarely met in the short or medium term in a development context. These same findings have also important policy implications for the place of destination of migrants. Risk-averse forced migrants who tend to settle close to the place of origin and expect to return when conditions allow would be less inclined to take entrepreneurial risks and invest in the place of destination whereas they are more likely to endure subsistence types of livelihoods. Instead, economic migrants who are more risk tolerant and seek long- term opportunities in better off places would be less likely to endure subsistence conditions and more likely to travel to a further destination even if the journeys imply important risks. This has implications for the type of policies that host governments may want to adopt for economic migrants as opposed to forced migrants. When compared with results on economic migration, our results suggest that bundling economic and forced migrants into one class does not serve well neither the place of origin nor the place of destination. 16 References u¸ Mehtap Akg¨ c, Xingfei Liu, Max Tani, and Klaus Zimmermann. Risk atti- tudes and migration. China Economic Review, 37(C):166–176, 2016. URL https://EconPapers.repec.org/RePEc:eee:chieco:v:37:y:2016:i:c:p:166-176. M Allais. Le comportement de l’homme rationnel devant le risque: Critique des postulats et axiomes de l’ecole americaine. Econometrica, 21(4):503–546, 1953. Coren L. Apicella, Anna Dreber, Benjamin Campbell, Peter B. Gray, Moshe Hoffman, and Anthony C. Little. Testosterone and financial risk preferences. Evolution and Human Behavior, 29(6):384 – 390, 2008. ISSN 1090-5138. E.A. Beam, D. McKenzie, and D. Yang. Unilateral facilitation does not raise international labor migration from the philippines. Economic Development and Cultural Change, 64: 323–368, 2016. Anke Becker, Benjamin, and Enke Armin Falk. Ancient origins of the global variation in economic preferences. August 2016. Holger Bonin, Amelie Constant, Konstantinos Tatsiramos, and Klaus Zimmermann. Native-migrant differences in risk atti- tudes. Applied Economics Letters, 16(15):1581–1586, 2009. URL https://EconPapers.repec.org/RePEc:taf:apeclt:v:16:y:2009:i:15:p:1581-1586. Michael Callen, Mohammad Isaqzadeh, James D. Long, and Charles Sprenger. Violence and risk preference: Experimental evidence from afghanistan. The American Economic Review, 104(1):123–148, 2014. ISSN 00028282. Christopher P Chambers. An axiomatization of quantiles on the domain of distribution functions. Mathematical Finance, 19:335–342, 2009. Thomas Dohmen, Armin Falk, David Huffman, and Uwe Sunde. Are risk aversion and impatience related to cognitive ability? American Economic Review, 100(3):1238–60, June 2010. Thomas Dohmen, Armin Falk, David Huffman, Uwe Sunde, J¨ urgen Schupp, and Gert G. Wagner. Individual risk attitudes: Measurement, determinants, and behavioral conse- quences. Journal of the European Economic Association, 9(3):522–550, 2011. Naihua Duan. Smearing estimate: A nonparametric retransforma- tion method. Journal of the American Statistical Association, 78 (383):605–610, 1983. doi: 10.1080/01621459.1983.10478017. URL http://www.tandfonline.com/doi/abs/10.1080/01621459.1983.10478017. 17 Catherine C. Eckel and Philip J. Grossman. Sex differences and statistical stereotyping in attitudes toward financial risk. Evolution and human behavior, 23(4):281–295, 2002. Catherine C. Eckel and Philip J. Grossman. Men, women and risk aversion: Experimental evidence. Handbook of experimental economics results, 1(1):1061–1073, 2008. Catherine C. Eckel, Mahmoud A. El-Gamal, and Rick K. Wilson. Risk loving after the storm: A Bayesian-Network study of Hurricane Katrina evacuees. Journal of Economic Behavior & Organization, 69(2):110–124, February 2009. Daniel Ellsberg. Risk, ambiguity, and the savage axioms. The Quarterly Journal of Eco- nomics, 75(4):643–669, 1961. ISSN 00335533, 15314650. Milton Friedman and L. J. Savage. The utility analysis of choices involving risk. Jour- nal of Political Economy, 56(4):279–304, 1948. ISSN 00223808, 1537534X. URL http://www.jstor.org/stable/1826045. GHS. Basic information document - nigeria general household survey–panel 2015/16. Tech- nical report, National Bureau of Statistics and LSMS, December 8 2016. Joop Hartog, Ada Ferrer-i Carbonell, and Nicole Jonker. Linking measured risk aversion to individual characteristics. Kyklos, 55(1):3–26, 2002. ISSN 1467-6435. IOM. Migration in Nigeria: A Country Profile 2009. International Organization for Mi- gration, Geneva, Switzerland, 2009. David A. Jaeger, Thomas Dohmen, Armin Falk, David Huffman, Uwe Sunde, and Holger Bonin. Direct evidence on risk attitudes and migration. The Review of Economics and Statistics, 92(3):684–689, 2010. Dominic DP Johnson, Rose McDermott, Emily S Barrett, Jonathan Cowden, Richard Wrangham, Matthew H McIntyre, and Stephen Peter Rosen. Overconfidence in wargames: experimental evidence on expectations, aggression, gender and testosterone. Proceedings of the Royal Society B: Biological Sciences, 273(1600):2513–2520, 10 2006. Daniel Kahneman and Amos Tversky. Prospect theory: An analysis of decision under risk. Econometrica, 47(2):263–291, 1979. ISSN 00129682, 14680262. Eliakim Katz and Oded Stark. Labor migration and risk aversion in less developed coun- tries. Journal of Labor Economics, 4(1):134–49, 1986. J. Kenna and R. Walker. Modeling individual migration decision. In A.F. Constant and F. Zimmermann, editors, Internal Handbook on the Economics of Migration. Edward Elgar, 2013. 18 J. S. Lerner and D Keltner. Fear, anger, and risk. Journal of Personality and Social Psychology, 81(1):146–159, 2001. Ulrike Malmendier and Stefan Nagel. Depression babies: Do macroeconomic experiences affect risk taking? The Quarterly Journal of Economics, 126(1):373–416, 2011. Vera Mironova and Sam Whitt. Risk tolerance during conflict: Evidence from aleppo, syria. Under Review, 2017. Gunnar Myrdal. Asian Drama: An Inquiry into the poverty of nations. Allen Lane The Penguin Press, 1968. Marzena Rostek. Quantile maximization in decision theory. The Review of Economic Studies, 77(1):339–371, 2010. doi: 10.1111/j.1467-937X.2009.00564.x. M. Rothschild and J. Stiglitz. Increasing risk: I. a definition. Journal of Economic Theory, 2:225–243, 1970. Kuznets Simon. Economic growth and the contribution of agriculture: Notes for measure- ment. In C. Eicher and L. Witt, editors, Agriculture in Economic Development. New York: McGraw-Hill, 1964. Oded Stark and David Levhari. On migration and risk in ldcs. Economic Devel- opment and Cultural Change, 31(1):191–196, 1982. doi: 10.1086/451312. URL https://doi.org/10.1086/451312. Michael P. Todaro. A model of labor migration and urban unemployment in less developed countries. The American Economic Review, 59(1):138–148, 1969. ISSN 00028282. URL http://www.jstor.org/stable/1811100. Maarten J. Voors, Eleonora E. M. Nillesen, Philip Verwimp, Erwin H. Bulte, Robert Lensink, and Daan P. Van Soest. Violent conflict and behavior: A field experiment in Burundi. American Economic Review, 102(2):941–64, April 2012. 19 List of Figures Figure 1: Number of Forcibly Displaced Persons (1951-2016) 65 60 55 50 45 40 Millions 35 30 25 20 15 10 5 0 1951 1956 1961 1966 1971 1976 1981 1986 1991 1996 2001 2006 2011 2016 Source: Authors’ elaboration based on UNHCR Data. Figure 2: Two lotteries x and y , where y is a mean preserving spread of x 1/4  1     1/2 ¨B2 ¨  1/4 ¨B3 ¨ ¨¨  ¨¨ xr ¨¨ ¨¨ 1/4 yr   E7 rr 1/2 rr1/4 r r rr8 j rj9 r 20 Figure 3: Probabilities and outcomes associated to two lotteries x and y 1/4  1   1/4  2        1/4B3 ¨ ¨  1/4B4 ¨ ¨¨ ¨¨   ¨ 1/4   ¨ x¨   r E7 ¨¨ 1/4 yr   E6 rr1/4 rr 1/4 r r j9 r r rj8 r Figure 4: Cumulative Distribution Function of x(blue) and y (red) T t t 1 3 t t d d 4 1 t t d d 2 1 t t d d 4 d d E 0 1 2 3 4 6 7 8 9 21 Figure 5: Cumulative distribution functions of predicted values for per capita expenditure, first visit Source: Authors’ estimations based on GHS Data 22 Figure 6: Cumulative distribution functions of predicted values for risk-tolerance Source: Authors’ estimations based on GHS Data 23 Figure 7: Incidence of Fatalities by Local Government Areas, 2011-2016 2011 2012 2013 200 - 1000 100 - 200 50 - 100 10 - 50 1 - 10 2014 2015 2016 0-1 No Fatalities Source: Authors’ elaboration based on ACLED Data (http://www.acleddata.com/data/) 24 Figure 8: Cumulative distribution functions of households living in conflict areas Source: Authors’ estimations based on GHS and ACLED Data 25 List of Tables Table 1: Household interview status during the last visit Observations Frequency Interviewed (in main survey phase) 4,534 90.68 Tracked (and interviewd) 48 0.96 Refused 47 0.94 Not found 100 2.00 Dead 84 1.68 Moved away (not tracked) 48 0.96 Crisis area 139 2.78 Total 5,000 100 Total number of migrants 329 6.6 Source: Authors’ estimations based on GHS Data 26 Table 2: Description of variables used in Equation 8 Variable label Description ln exp log expenditure Per capita Household Expenditure expressed in Loga- ritm. This variable is the expenditure aggregate con- structed by the World Bank for the purpose of poverty measurement hhsize Household Size Number of Household Members living in the household at the moment of the interview demo sh depentants Share of Dependent Members Dependent household members, defined as individuals aged less than 15 and over 64 living in the household, as a share of total household members state State Categorical variable where each level is one of the 36 Nigerian States plus the Federal Capital Territory of Abuja: Abia; Adamawa; Akwa Ibom; Anambra; Bauchi; Bayelsa; Benue; Borno; Cross-rivers; Delta; Ebonyi; Edo; Ekiti; Enugu; Gombe; Imo; Jigawa; Kaduna; Kano; Katsina; Kebbi; Kogi; Kwara; Lagos; 27 Nassarawa; Niger; Ogun; Ondo; Osun; Oyo; Plateau; Rivers; Sokoto; Taraba; Yobe; Zamfara; FCT Abuja lga LGA Categorical variable where each level is one Local Gov- ernment Areas. Of the total 774 LGAs Nigeria is di- vided into, the dataset contains observations from 467 LGAs. rururb Type of Settlement Type of settlement (1=rural; 0=urban) hh sex HH Gender Gender of the Household Head (1=male; 0=female) hh agey HH Age Age of the household head hh eduyrs HH Year of Education Number of years of education attained by the house- hold head hh empl HH Employed Status in employment of the household head (1=em- ployed; 0=unemployed or inactive) hh marstat HH Married Marital Status of the household head (1=married; 0=Divorced, separated, widowed, or never married) hh language HH Principal Language Principal language spoken in the household (1=Hausa; 2=Igbo; 3=Yoruba; 4=Other) Table 2: Description of variables used in Equation 8 (cont.) Variable Label Description dwel rooms Number of Rooms Number of habitable rooms in the dwelling dwel roof Quality of Roof Quality of the material used for the dwelling’s roof (1=Low quality, 1=Medium quality , 3=High quality) dwel wall Quality of Wall Quality of the material used for the dwelling’s walls (1=Low quality, 1=Medium quality , 3=High quality) dwel floor Quality of Floor Quality of the material used for the dwelling’s floors (1=Low quality, 1=Medium quality , 3=High quality) dwel toilet Type of Toilet Type of main toilet facility (1=No facility, 2=Flush toilet, 3=Improved pit latrine, 4=Uncovered pit la- trine, 5=Other) dwel fuellight Main source of Lighting Main source of lighting (1=Electricity/gas, 2=Kerosene, 3=Candles/battery, 4=Other) dwel fuelcook Main heat source for cooking Main heat source for cooking (1=Firewood, 2=Char- 28 coal, 3=Kerosene/oil, 4=Electricity/gas, 5=Other) dwel gdisp Type of garbage disposal Type of garbage disposal (1=In bins collected by pub- lic service; 2= Disposal in compound; 3=Discarded in empty lots/streets/rivers) own tv TV Household owns a TV own fridge Fridge Household owns a Refrigerator own stove Stove Household owns a Stove own bcycle Bicycle Household owns a Bicycle own car Car Household own a Car own iron Iron Household own an Iron conflict Conflict Dummy variable equal to 1 if in the Local Government Area at the time of GHS survey there have been fa- talities due to conflict events as reported in the Acled Dataset Table 3: Descriptive Statistics Sample with Overall F Prob > F Attrition Sample Social and Economic Variables Household Size 5.491 5.365 3.361 0.067 Share of Dependent Members 0.432 0.433 0.038 0.845 Type of Settlement (Rural==1) 0.400 0.409 0.535 0.465 HH Gender (1=Male) 0.851 0.843 0.888 0.346 HH Age 49.578 49.602 0.004 0.947 HH Years of Education 6.545 6.542 0.000 0.985 HH Employed 0.916 0.910 0.740 0.390 HH Married 0.805 0.790 2.520 0.112 HH Gender * HH Married 0.788 0.771 2.613 0.106 HH Language . . . . Hausa 0.280 0.267 1.383 0.240 Igbo 0.174 0.173 0.028 0.867 Yoruba 0.238 0.252 1.549 0.213 Other 0.308 0.308 0.002 0.966 Quality of Dwelling Number of Rooms 3.440 3.386 1.140 0.286 Quality of Roof . . . . Low 0.176 0.184 0.781 0.377 Medium 0.808 0.799 0.867 0.352 High 0.016 0.017 0.061 0.805 Quality of Wall . . . . Low 0.415 0.417 0.016 0.901 Medium 0.100 0.098 0.116 0.734 High 0.485 0.486 0.006 0.938 Quality of Floor . . . . Low 0.103 0.108 0.491 0.483 Medium 0.191 0.189 0.078 0.780 High 0.706 0.703 0.054 0.816 Type of Toilet . . . . No Facility 0.184 0.185 0.026 0.871 Flush Toilet 0.188 0.191 0.059 0.809 Improved Pit Latrine 0.026 0.026 0.001 0.972 Uncovered Pit Latrine 0.476 0.467 0.550 0.458 Other 0.126 0.131 0.474 0.491 Main Source of Lighting . . . . Electricity/Gas 0.353 0.349 0.118 0.731 29 Kerosene 0.371 0.371 0.004 0.949 Candles/Batterty 0.134 0.133 0.009 0.924 Other 0.142 0.147 0.474 0.491 Main heat source for cooking . . . . Firewood 0.659 0.645 1.346 0.246 Charcoal 0.009 0.010 0.185 0.667 Kerosene/Oil 0.288 0.293 0.182 0.670 Electricity/Gas 0.016 0.017 0.105 0.745 Other 0.029 0.036 3.023 0.082 Type of Garbage disposal . . . . Bins 0.146 0.150 0.164 0.685 Compound 0.329 0.317 1.385 0.239 Empty Lots/streets/rivers 0.525 0.534 0.505 0.478 Assets TV 0.428 0.427 0.012 0.915 Fridge 0.195 0.194 0.012 0.912 Stove 0.494 0.499 0.182 0.669 Bicycle 0.191 0.187 0.163 0.686 Car 0.098 0.097 0.031 0.861 Iron 0.389 0.390 0.001 0.971 Source: Authors’ estimations based on GHS Data 30 Table 4: Per Capita Expenditure Prediction Models, overall sample and sample with attrition (OLS) Sample with Attrition Overall Sample (1) (2) (3) (4) VARIABLES coef se coef se Social and Economic Variables Household Size -7,941*** -482.8 -8,151*** -465.2 Share of Dependent Members -26,090*** -4,074 -26,289*** -3,963 State Fixed Effect yes Type of Settlement (Rural=1) 2,011 -3,112 2,841 -3,000 HH Gender (1=Male) 23,215*** -6,538 20,081*** -5,794 HH Age 27.81 -66.04 20.13 -65.94 HH Years of Education 1,149*** -184.4 1,303*** -181.8 HH Employed 288.9 -4,596 271.3 -4,579 HH Married 6,884 -11,319 1,742 -10,388 HH Gender * HH Married -38,980*** -12,082 -32,311*** -10,501 HH Language (baseline= Hausa) Igbo -5,250 -7,139 -6,390 -6,910 Yoruba -3,696 -5,314 -2,704 -5,409 Other -1,258 -3,195 -2,030 -3,169 Quality of Dwelling Number of rooms 1,705*** -504.3 1,753*** -468.4 Quality of Roof (baseline=Low) Medium 6,209*** -2,269 4,346** -2,168 High -2,701 -10,023 -6,430 -8,746 Quality of Wall (baseline=Low) Medium 225.2 -2,976 -507.8 -2,833 High 1,321 -2,605 941.9 -2,619 Quality of Floor (baseline=Low) Medium -2,200 -2,597 -1,068 -2,545 High 573.7 -2,645 1,534 -2,737 Type of Toilet (baseline=No Facility) Flush Toilet 16,712*** -4,927 17,853*** -5,157 Improved Pit Latrine -4,038 -5,750 -1,774 -6,563 Uncovered Pit Latrine 4,931* -2,705 5,148* -2,658 31 Other 5,618* -3,325 5,083 -3,291 Main source of lighting (baseline=Electricity/Gas) Kerosene 460.7 -2,876 1,533 -2,835 Candles/Batteries 2,693 -3,471 2,451 -3,373 Other -1,926 -2,882 -1,960 -2,692 Main heat source for cooking (baseline=Firewood) Charcoal -3,786 -6,211 -1,354 -5,766 Kerosene/Oil 7,966*** -2,853 6,802** -2,796 Electricity/Gas 32,053** -12,680 32,040*** -12,194 Other 13,071* -6,961 12,845** -5,687 Type of garbage disposal (baseline=Bins) Compound -6,878 -4,986 -10,325** -5,095 Empty Lots/streets/rivers -6,616 -5,013 -9,727* -5,054 Assets TV 8,951*** -2,368 10,813*** -2,282 Fridge 3,858 -3,024 2,408 -3,106 Stove 6,867*** -2,470 6,178*** -2,366 Bicycle 1,321 -2,063 1,776 -2,046 Car 19,695*** -4,281 19,015*** -4,147 Iron 6,645*** -1,888 7,761*** -1,945 Constant 120,075*** -14,477 124,696*** -15,740 Observations 4,545 4,925 R-squared 0.453 0.462 Source: Authors’ estimations based on GHS Data 32 Table 5: Logit Model on the probability of being risk-averse (0-risk-tolerant 1-risk-averse) (1) (2) VARIABLES coef se Social and Economic Variables Per Capita Expenditure -1.79e-06** (8.92e-07) Household Size 0.00192 (0.0180) Share of Dependent Members -0.116 (0.174) State Fixed Effect yes Type of Settlement (Rural=1) 0.0186 (0.120) HH Gender (1=Male) -0.458** (0.205) HH Age -0.00376 (0.00306) HH Years of Education 0.000545 (0.0107) HH Employed -0.558*** (0.153) HH Married -0.669 (0.428) HH Gender * HH Married 0.871** (0.437) HH Language (baseline= Hausa) Igbo -0.33 (0.285) Yoruba -0.0613 (0.258) Other -0.0399 (0.162) Quality of Dwelling Number of Rooms -0.036 (0.0232) Quality of Roof (baseline=Low) Medium -0.102 (0.142) High -0.463 (0.344) Quality of Wall (baseline=Low) Medium 0.294* (0.173) High -0.00575 (0.137) Quality of Wall (baseline=Low) Medium 0.0472 (0.168) High 0.098 (0.165) Type of Toilet (baseline=No Facility) Flush Toilet -0.147 (0.207) Improved Pit Latrine 0.0731 (0.244) Uncovered Pit Latrine -0.19 (0.139) 33 Other -0.133 (0.174) Main source of lighting (baseline=Electricity/Gas) Kerosene -0.0824 (0.118) Candles/Batteries 0.0745 (0.173) Other -0.386** (0.164) Main heat source for cooking (baseline=Firewood) Charcoal 1.286** (0.592) Kerosene/Oil 0.12 (0.142) Electricity/Gas -0.111 (0.397) Other 0.392* (0.232) Type of garbage disposal (baseline=Bins) Compound 0.12 (0.175) Empty Lots/streets/rivers 0.0721 (0.165) Assets TV -0.129 (0.122) Fridge 0.0795 (0.140) Stove -0.135 (0.119) Bicycle 0.143 (0.104) Car 0.0267 (0.162) Iron -0.0684 (0.116) Constant 1.436*** (0.522) Observations 4,545 Robust standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1 Source: Authors’ estimations based on GHS Data 34 Table 6: Expenditure Prediction Model (Logaritm of total per-capita household expenditure) for households living in non-conflict areas (OLS) (1) (2) VARIABLES coef se Social and Economic Variables Household Size -0.0949*** (-0.00309) Share of Dependent Members -0.286*** (-0.0247) State Fixed Effect yes Type of Settlement (Rural=1) 0.0332 (-0.0233) HH Gender (1=Male) 0.201*** (-0.0289) HH Age 0.00313*** (-0.000531) HH Years of Education 0.0236*** (-0.00162) HH Employed 0.128*** (-0.0179) HH Married -0.0806 (-0.0529) HH Gender * HH Married -0.189*** (-0.0601 ) HH Language (baseline= Hausa) Igbo 0.0117 (-0.0516) Yoruba -0.0349 (-0.0473) Other -0.0325 (-0.0358) Quality of Dwelling Number of Rooms 0.0346*** (-0.00347) Quality of Roof (baseline=Low) Medium 0.0656*** (-0.0191) High 0.0231 (-0.0536) Quality of Wall (baseline=Low) Medium 0.00103 (-0.0256) High -0.0255 (-0.0163) Quality of Floor (baseline=Low) Medium 0.0343 (-0.0257) High 0.125*** (-0.027) Type of Toilet (baseline=No Facility) Flush Toilet 0.234*** (-0.0266) Improved Pit latrine -0.0628 (-0.0563) Uncovered Pit Latrine 0.0431** (-0.0209) 35 Other 0.0262 (-0.03) Main source of lighting (baseline=Electricity/Gas) Kerosene -0.0832*** (-0.016) Candles/Battery -0.0583*** (-0.0222) Other -0.122*** (-0.0234) Main heat source for cooking (baseline=Firewood) Charcoal 0.0625 (-0.0486) Kerosene/Oil 0.130*** (-0.0197) Electricity/Gas 0.283*** (-0.0392) Other 0.339*** (-0.0501) Type of garbage disposal (baseline=Bins) Compound -0.0722*** (-0.0233) Empty Lots/streets/rivers -0.0560*** (-0.0191) Assets TV 0.0408*** (-0.0155) Fridge 0.0259 (-0.0167) Stove -0.00359 (-0.0161) Bicycle -0.0800*** (-0.015) Car 0.0593*** (-0.0208) Iron 0.00704 (-0.0134) Constant 11.33*** (-0.0847) Observations 21,271 R-squared 0.473 Robust standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1 Source: Authors’ estimations based on GHS and ACLED Data 36 Table 7: Association between migration and risk-aversion (Probit) (1) (2) (3) VARIABLES 0 = HH migr. 0 = HH migr. 0 = HH migr. 1 = HH not migr. 1 = HH not migr. 1 =HH not migr. and some conflict and always conflict Behavioural Risk Aversion -0.179** -0.363*** -0.400** Social and Economic Variables Household Size 0.0324** 0.0465** 0.153** Share of Dependent -0.398*** -0.593*** -0.00753 HH Gender = Male 0.104 0.201 0.209 HH Age 0.0158*** 0.0188*** 0.0165*** HH Years of Education -0.0273*** -0.0369*** -0.0426* HH Employed 0.364*** 0.503*** 0.877*** HH Married 0.188 0.0615 1.192*** HH Gender * HH Married -0.21 -0.221 -1.532** Quality of Dwelling Number of Rooms 0.0614* 0.0527 -0.0260 Quality of Walls (baseline=Low) Medium -0.175 -0.314 0.509* High -0.444*** -0.564*** -0.254 Assets TV -0.0738 -0.0483 0.382 Fridge 0.078 0.259* -0.527 Stove -0.273*** -0.372*** -0.939** Bicycle 0.354*** 0.408* 0.368 Car -0.280** -0.486*** -0.319 Iron 0.0453 0.113 -0.260 Constant 0.819*** 0.0513 -2.585*** Observations 4,577 1,289 359 Pseudo R-squared 0.141 0.19 0.357 Robust standard errors (not reported) *** p<0.01, ** p<0.05, * p<0.1 Source: Authors’ estimations based on GHS and ACLED Data 37 Table 8: Difference in Differences Test (OLS) (1) (2) Risk Aversion Risk Aversion Migration Group (1=never migrated) -0.105*** -0.0873** Conflict (1=some conflict) -0.049 -0.0501 did -0.0124 -0.00969 Social and Economic Variables Household Size 0.000533 Share of Dependent 0.0426 HH Gender = Male -0.0172 HH Age -0.000869 HH Years of Education -0.00298 HH Employed -0.023 HH Married -0.055 HH Gender * HH Married 0.0738 Quality of Dwelling Number of Rooms -0.0146*** Quality of Walls (baseline=Low) Medium 0.0649* High 0.0128 Assets TV 0.0187 Fridge 0.0147 Stove -0.0222 Bicycle -0.0406* Car 0.00511 Iron 0.0198 Constant 0.820*** 0.905*** Observations 4,577 4,577 R-squared 0.006 0.016 Robust standard errors (not reported) *** p<0.01, ** p<0.05, * p<0.1 Source: Authors’ estimations based on GHS and ACLED Data 38