Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

Predictive models for charitable giving using machine learning techniques

  • Leily Farrokhvar ,

    Roles Conceptualization, Investigation, Methodology, Project administration, Resources, Validation, Writing – review & editing

    leily.farrokhvar@mail.wvu.edu

    Affiliation Department of Industrial and Management Systems Engineering, West Virginia University, Morgantown, West Virginia, United States of America

  • Azadeh Ansari,

    Roles Data curation, Formal analysis, Writing – original draft

    Affiliation Department of Industrial and Management Systems Engineering, West Virginia University, Morgantown, West Virginia, United States of America

  • Behrooz Kamali

    Roles Conceptualization, Supervision, Validation, Writing – review & editing

    Affiliation Department of Industrial and Management Systems Engineering, West Virginia University, Morgantown, West Virginia, United States of America

Abstract

Private giving represents more than three fourths of all U.S. charitable donations, about 2% of total Gross Domestic Product (GDP). Private giving is a significant factor in funding the nonprofit sector of the U.S. economy, which accounts for more than 10% of total GDP. Despite the abundance of data available through tax forms and other sources, it is unclear which factors influence private donation, and a reliable predictive mechanism remains elusive. This study aims to develop predictive models to accurately estimate future charitable giving based on a set of potentially influential factors. We have selected several factors, including unemployment rate, household income, poverty level, population, sex, age, ethnicity, education level, and number of vehicles per household. This study sheds light on the relationship between donation and these variables. We use Stepwise Regression to identify the most influential variables among the available variables, based on which predictive models are developed. Multiple Linear Regression (MLR) and machine learning techniques, including Artificial Neural Networks (ANN) and Support Vector Regression (SVR) are used to develop the predictive models. The results suggest that population, education level, and the amount of charitable giving in the previous year are the most significant, independent variables. We propose three predictive models (MLR, ANN, and SVR) and validate them using 10-fold cross-validation method, then evaluate the performance using 9 different measuring criteria. All three models are capable of predicting the amount of future donations in a given region with good accuracy. Based on the evaluation criteria, using a test data set, ANN outperforms SVR and MLR in predicting the amount of charitable giving in the following year.

Introduction

Charities are Non-Profit Organizations (NPO) focused on humanitarian and social issues [1]. The NPOs are listed as tax-exempt organizations which cannot benefit people or other corporations [2]. According to the National Center for Charitable Statistics, there are 1,406,820 tax-exempt organizations in the United States, including 945,415 public charities. Charitable contributions include cash and non-cash gifts [3]. Over two-thirds of households making a significant portion of charitable giving in the United States announce their own giving amount [4]. Americans recently assigned approximately 2% of their disposable income to charitable goals in 2010, a value unchanged over 5 decades before that [5]. Over three-fourths of all contributions come from private or individual donors earn more than 2% of total U.S. Gross Domestic Product (GDP). Their contribution is essential to the nonprofit sector of the U.S. economy [5].

The National Center for Charitable Statistics stated at there were $1.59 trillion in total revenues and $1.49 trillion in total expenses reported by public charities in 2011 [6]. Contributions and government grants made up 22% of the total revenue while program service revenues and other resources made 72% and 6% of the revenue, respectively [1]. Charitable giving has had an upward trend since 1975 and in 2014, it exceeded the peak level right before the recession in 2008 [5]. It was estimated that charitable giving would increase to $358 billion in 2015 from $298 billion in 2011. Individual donors are the most common sources of charitable giving [7]. There is growing competition over time, money, and resources among non-profit organizations, so there is a need to predict donations [8]. It has always been debated how and when donors decide to donate, and many factors are identified in the literature that can affect decisions [9].

The factors that have been reported to drive charitable giving are awareness of need [10], solicitation [11,12], mode of communication [13], donor characteristics [14] including age [4,15, 16], sex [17], ethnicity [4,17], personal income [4,15] and tax itemizing [15], education [4,15], volunteerism/civic-minded donor [4,18], moral norms/values & guilt [11, 12], religiosity [4, 16, 19, 20], obligation (wealthy) [21] or lack of family need [11], and attitudes toward charitable organizations [22], historical data on donation [16], size of request [23], altruism [10, 11], reputation of individual/charity [24], psychological benefits [10], and efficacy [10]. Bekkers and Wiepking [25] performed an extensive literature review on how age, education, religion, and solicitation are correlated with the amount of charitable giving. They stated that most of the previous works had found that education and age have positive relationship with the amount of donation and investigated the effects of different characteristics of religion and solicitation on the giving amount. Wiepking and Bekkers [26] completed their literature review and investigated how sex, family composition, and income are related to charitable giving. They claimed that there is strong evidence showing a positive relationship between income and charitable giving value.

In this manuscript, we aim to develop models capable of predicting levels of charitable giving using measurable and readily available variables describing donor characteristics for different zip codes that are most closely associated with charitable giving on a national level. We validated the models using a U.S. nationwide dataset and analyzed the results.

Methodology

Giving and demographic data

In this section, we describe the process of collecting and preparing the initial set of variables that are used to identify the factors with the most predictive value for the amount of charitable giving. The following demographic data from 2010 U.S. census is gathered for a sample of 9410 U.S. zip codes, each of which includes at least one NPO with charitable contributions in 2014 and 2015: (a) % unemployment, (b) median household income, (c) % of persons living at/below the poverty line, (d) population, (e) sex, (f) age, (g) ethnicity (% non-whites), (h) education (% college graduates), and (i) % households with 2 vehicles or more. This data was collected using Zip Atlas (www.zipatlas.com), which is a structured collection of zip code, area code, city and state demographic, social, and economic profiles. We also included the previous year’s total contribution amount for each zip code as a potential predictive variable. The data regarding charitable contributions to NPOs was obtained by reviewing U.S. federal tax forms (990 and 990EZ) for two consecutive years (2014 and 2015). All the organizations that have some portion of their income exempted from tax are required by the IRS to fill out one of these forms, including organizations that received charitable contributions and grants. These forms are publicly available on the Internal Revenue Service website (IRS; www.irs.gov). Overall, about 25,882 organizations were reviewed for 2014 and 28,517 for 2015. To have consistent scope for all the data elements, we aggregated the contribution data at the zip code level. All the variables are defined in Table 1. The values or ratios are all collected for each specific zip code.

As the donation data set is skewed, Tukey’s [27] outlier detection method was used to identify the outliers. To be more conservative, all data points falling outside the 3 Interquartile Range (IQR) were identified as outliers and Winsorized [27]. Winsor proposed to replacing the value of the potential outliers by the highest value that is not considered an outlier in the data set instead of removing them. To have the same scale for all the variables and coeffiecients, all the variables were normalized using equation.

(1)

In equation, X and XN denote the original and normalized data, respectively. Each entity in the data set is normalized using its own minimum and maximum values shown by Min{X} and Max{X}, respectively.

In this pre-processing step, aggregated contribution data associated with 131 zip codes were Winsorized as they were beyond the 3 IQR limits. All 9410 rows of data were then normalized using equation. Table 2 summarizes the descriptive statistics on all the variables including 9 demographic variables and 2014 and 2015 donation data.

The correlation matrix in Table 3 shows Pearson’s correlation coefficients between all the variables, including the dependent variable. Pearson’s correlation coeficients show how well these variables are linearly related to each other and the output.

Following data pre-processing, Stepwise Regression was applied to the entire data set to identify the most significant variables among 9 demographic variables and one variable for donation amounts in 2014 for 9,410 U.S. zip codes as the input variables to predict donation amounts for 2015.

Stepwise regression is a semi-automated process of model building by successively adding or removing variables based on the t-statistics of their estimated coefficients. This technique starts with a null model and adds the variable with the lowest p-value and continues with forward selection and backward elimination until it reaches to the point where no variable can be added or removed. The alpha-to-enter and alpha-to-remove are often selected as a value between 5% to 15%. We used 5% and 10% as the significant levels for the forward selection and backward elimination in our analysis, respectively.

All data analysis was done in MATLAB environment using a computer with an Intel® Core i5 CPU 2.20 GHz processor and 8 GB RAM. The analysis is described in detail in the following section. The result of the stepwise regression is shown in Table 4.

The stepwise regression selected these four variables to build the predictive models: average household income, population, percentage of college graduates, and amount of charitable giving in the previous year. However, the correlation matrix shows that the average household income has high collinearity with the percentage of college graduates. It is crucial to avoid multicollinearity as it can cause unstable coefficient estimation. Variance Inflation Factor (VIF) is a criterion that measures the multicollinearity between variables. The calculation of VIF for βi estimate is shown in equation.

(2)

In equation, is the coefficient of determination of the regression equation with Xi {\displaystyle X_{i}}on the left hand side, and all other independent variables on the right hand side. The minimum value of VIF is 1, which indicates a model with completely independent variables. A VIF greater than 5 is considered high and can be an evidence of multicollinearity. Table 5. VIF measurement for independent variables Table 5 shows the values of VIF for all independent variables in our model. As we suspected, average household income has a relatively high VIF, meaning that it can be removed from the model without affecting the prediction.

In the results from stepwise regression, giving in the previous year (2014) had the most significant effect, followed by percentage of college graduates. To summarize, the most significant variables among all 10 tested variables were population, percentage of college graduates, and amount of charitable giving in the previous year. Table 5 shows no collinearity in the model with these three variables.

Predictive modelling approaches

We used the most significant variables identified in the previous section to develop three predictive models based on Multiple Linear Regression (MLR), Artificial Neural Networs (ANN), and Support Vector Regression (SVR). All these techniques use input variables to predict future values of a dependent variable. Their main difference is the way they calculate the weights on the connections between input nodes and output nodes. MLR technique was described in the previous section. Here we describe ANN and SVR approaches in more details.

In MLR, the relationship between two or more explanatory (i.e., independent) variables and a response (i.e., dependent) variable is modeled by fitting a linear equation to the learning data. Table 6 shows a list of notations for MLR technique.

The general form of the MLR model is shown in equation.

(3)

In equation, β0 is the intercept and β1,β2,…,βP are the corresponding coefficients for independent variables, which will be obtained using the generalized least square method. The error term of the model is denoted by ε and the output or dependent variable is shown as Y.

The ANN model uses an algorithm to train the network and assign weights to connections between nodes in input layer, hidden layer(s), and output layer. This algorithm can be Back Probagation, Feed Forward, or Feed Backward. In this work, we use a Feed Forward Neural Network (FFNN) which has been widely used in forecasting applications [28]. Table 7 summarizes all the notatios used to explain ANN. All neural network models start with an initial random weight for each connection and improve the weights in their learning process to better predict the output. This process continues until it gets close enough to the output, which is determined by a threshold value.

Fig 1 shows the structure of a one-hidden layer neural network. Each connection is assigned a weight determined by the activation function of the neural networks. The weight of the connection from input node to the hidden node is denoted as wih and the weight of the connection between hidden node and output node is denoted as wh.

thumbnail
Fig 1. One-hidden layer artificial neural networks model.

https://doi.org/10.1371/journal.pone.0203928.g001

With only a few minor differences the SVR uses the same principles as the Support Vector Machine (SVM) for classification which was first developed by Vapnik and Lerner [29]. SVR uses the same concepts to train the model for prediction purposes. The notation used for describing SVR is shown in Table 8.

In SVR, a training data set is introduced as {(x1,t1),…,(xj,tj),…,(xm,tm)}, where xjϵRn and tjϵR represent the input vector and the target value, respectively. The main purpose is to obtain a function f(x) that has less deviation than the maximum acceptable deviation ϵ from the actual target tj for all data points in the training data set. SVR aims to determine this function such that the bias and variance trade-offs can be met. This function should be neither so as complex to cause an over-fitting problem nor so simple that it lacks the capability to capture the patterns. To avoid aforementioned problems, a proper value of C is crucial. The maximum deviation from the target value is denoted by ϵ shown in Fig 2. The points outside the maximum deviation region are assigned a positive or negative deviation depending on their location.

The regression function is defined in equation. In this expression, b is the intercept and are the weights attained from SVR. To have a simpler function, smaller sizes of are recommended.

(4)

The loss function defined in equation assigns a penalty if the point is outside the range of the predefined deviation size of ϵ shown in Fig 2, which assigns a cost “C” for the points outside of the predefined range.

(5)

Equation shows the objective function, which assigns a penalty cost, C, if the point is outside of the acceptable range. Equation shows the set of constraints in the SVR model.

(6)

Subject to: (7)

Where and are positive and negative deviations from acceptable region for the mth pattern, respectively.

Results

Using MATLAB, the 9410 data rows (zip codes) were randomly divided into 2 separate and exclusive data sets: test (20%) and training (80%). The models were developed using the training data set and tested using the test data set, which was not included in the training part. A 10-fold cross validation technique was used for model validation.

Multiple linear regression

MLR was applied to the training data set with the selected 3 input variables identified as the most significant variables. Table 9 shows a summary of the results from the MLR technique.

The Analysis of Variance (ANOVA) table for this model shown in Table 10 confirms that this model is significant at 5% significance level.

This model was tested using the test data set. Fig 3 shows the predicted versus actual donations in 2015 in a normalized scale using MLR technique.

thumbnail
Fig 3. Predicted vs. actual 2015 donations using MLR on training and test data sets.

https://doi.org/10.1371/journal.pone.0203928.g003

Artificial neural networks

The same training data set was used to train and build a model using ANN. This model has 1 hidden layer. The number of neurons in the hidden layer varies from 2 to 10, demonstrating that 2 neurons minimized the percentage of residual variance. In this study, we considered 1 hidden layer. The network with 1 hidden layer and 2 neurons in that single hidden layer gives the best ANN model among all the tested networks with learning rate of 0.001.

Table 11 shows the characteristics of our ANN model. Logistic and Linear activation functions have been adopted for the input and hidden layers, based on a trial and error, to produce less error.

The predicted versus actual donations in 2015 in a normalized scale using the ANN technique is shown in Fig 4.

thumbnail
Fig 4. Predicted vs. actual 2015 Giving using ANN on training and test data sets.

https://doi.org/10.1371/journal.pone.0203928.g004

Support vector regression

SVR is the the other machine learning technique that we use to build a predictive model for 2015 donations. The same training and test data sets are used in this approach. Table 12 demonstrates the parameters of the best model we could capture on this data set.

Fig 5 shows the results from the SVR predictive model on the training and test data sets, comparing the predicted vs the actual 2015 giving in normalized scale.

thumbnail
Fig 5. Predicted vs. actual 2015 giving using SVR on training and test data sets.

https://doi.org/10.1371/journal.pone.0203928.g005

Discussion

The difference between the actual (target) and the predicted value is the forecasting error which represents the accuracy measure. To evaluate the performance of the applied models in this study, some accuracy measures have been calculated as follows:

  1. Symmetric Mean Absolute Percentage Error (SMAPE) is calculated using equation. SMAPE is an alternative criterion for Mean Absolute Percentage Error when there are zero values for giving. (8)
  2. Mean Absolute Error (MAE) is calculated using equation. A MAE that approaches zero is an indication of the model’s high accuracy. (9)
  3. Root Mean Square Error (RMSE) is calculated using equation. Small RMSE values also denote good performance on the part of the model. (10)
  4. Normalized Root Mean Square Error (NRMSE) is calculated using equation. A NRMSE value close to 1 indicates a poor model performance, whereas value close to 0 shows a good model performance. (11)
  5. Mean Square Error (MSE) is calculated using equation. The value of MSE depends on the scale of data but we report the MSE values on the normalized data for easier comparison. (12)

Where is the predicted value and is the actual value of giving for the ith observation.

Using and comparing several accuracy measures alongside each other enables us to better evaluate the results, as each accuracy measure has its own advantages and limitations and there is no single measure that is universally applicable under all conditions [31]. The values of all accuracy measures for all three models is shown in Table 13.

thumbnail
Table 13. Test results comparison of all three techniques.

https://doi.org/10.1371/journal.pone.0203928.t013

As Table 13 shows, ANN outperforms SVR and MLR in predicting the charitable giving using the three significant variables comparing R-Squared (R2) and error terms on this data set. According to this Table 9 measuring criteria show that ANN performs slightly better in predicting the U.S. charitable giving.

Conclusions and future work

In this paper, we studied the effect of 10 factors (population, personal income, education level, unemployment rate, poverty, and charitable giving in a previous year) on the receipt of charitable giving. Stepwise regression identified the four most influential variables among these 10 tested variables to be average household income, population, percentage of college graduates, and giving amounts for the previous year. To avoid dealing with multicollinearity, average household income was excluded as it was highly correlated with percentage of college graduates. All variables have positive relationship with donation. Previous year donation is the most highly correlated factor and percentage of college graduates is the least correlated. Three forecasting models were developed using MLR, ANN, and SVR and tested on a data set. To compare, 9 criteria measures were calculated for the results of each model. All three models are capable of predicting the amount of future donation in a given region with good accuracy, however, ANN outperforms SVR and MLR in most cases.

Moving forward, we plan to expand our study using other machine learning techniques, and by developing comparative studies to find potential ways to improve our suggested models. There are many reasons why being able to forecast charitable giving will help organizations better plan. For example, the occurrence of natural and man-made disasters may affect giving both in positive and in negative ways [32]. As an instance, national level disasters could potentially decrease giving, while a regional natural disaster may increase giving from those unaffected. After the terrorist attacks of September 11, 2001 (New York City, Washington D.C., Somerset County, PA), 65% of US household made charitable contributions [33]. The largest not-for-profit recipient organization received approximately $2 billion USD by the end of 2001. This was the largest giving in recorded US history. Although a blessing, such a large unexpected and rapid charitable influx can create its own complications for the receiving organization as a lack of preparedness, may potentially result in mismanagement of the contributions [34]. This example illustrates why charitable organizations, especially those involved in disaster relief efforts, need to be able to accurately forecast charitable giving to promote more equitable and efficient use of resources.

We plan to study the effects of natural or man-made disasters on the donor’s behavior, and to develop models and evaluate them using the recent disasters, such as hurricane Sandy. Furthermore, future research could identify and incorporate national economic factors in the predictive models.

References

  1. 1. Lacasse R-M, Lambert B. Cracking the complexity code of charities. International Journal of Technology Management & Sustainable Development. 2016;15(2):159–75.
  2. 2. Mularz CM, Ülkü MA. Analytics for Nonprofits. Encyclopedia of Business Analytics and Optimization: IGI Global; 2014. p. 115–23.
  3. 3. Barrett KS. Panel-data estimates of charitable giving: A synthesis of techniques. National Tax Journal. 1991;44(3):365–81.
  4. 4. Gittell R, Tebaldi E. Charitable giving: Factors influencing giving in US states. Nonprofit and Voluntary Sector Quarterly. 2006;35(4):721–36.
  5. 5. Navigator C. Giving USA 2015, the annual report on philanthropy. 2015.
  6. 6. McKeever BS. The Nonprofit Sector in Brief 2015: Public Charities. Giving, and. 2015.
  7. 7. Brown S, Greene WH, Harris MN, Taylor K. An inverse hyperbolic sine heteroskedastic latent class panel tobit model: An application to modelling charitable donations. Economic Modelling. 2015;50:228–36.
  8. 8. de Vries NJ, Reis R, Moscato P. Clustering consumers based on trust, confidence and giving behaviour: Data-driven model building for charitable involvement in the Australian not-for-profit sector. PloS one. 2015;10(4):e0122133. pmid:25849547
  9. 9. Dickert S, Kleber J, Västfjäll D, Slovic P. Mental imagery, impact, and affect: A mediation model for charitable giving. PloS one. 2016;11(2):e0148274. pmid:26859848
  10. 10. Bekkers R, Wiepking P. A literature review of empirical studies of philanthropy: Eight mechanisms that drive charitable giving. Nonprofit and voluntary sector quarterly. 2011;40(5):924–73.
  11. 11. Sikkel D, Schoenmakers E. Bequests to health‐related charitable organisations: a structural model. International Journal of Nonprofit and Voluntary Sector Marketing. 2012;17(3):183–97.
  12. 12. Yörük BK. How responsive are charitable donors to requests to give? Journal of Public Economics. 2009;93(9–10):1111–7.
  13. 13. Sargeant A. Charitable giving: Towards a model of donor behaviour. Journal of Marketing Management. 1999;15(4):215–38.
  14. 14. Hernández-Murillo R, Roisman D. The economics of charitable giving: what gives? The Regional Economist. 2005(Oct):12–3.
  15. 15. Schervish PG, Havens JJ. Social participation and charitable giving: A multivariate analysis. Voluntas: International Journal of Voluntary and Nonprofit Organizations. 1997;8(3):235–60.
  16. 16. Leslie LM, Snyder M, Glomb TM. Who gives? Multilevel effects of gender and ethnicity on workplace charitable giving. Journal of Applied Psychology. 2013;98(1):49. pmid:22985116
  17. 17. Saunders TJ, Taylor AH, Atkinson QD. No evidence that a range of artificial monitoring cues influence online donations to charity in an MTurk sample. Royal Society open science. 2016;3(10):150710. pmid:27853533
  18. 18. Ranganathan SK, Henley WH. Determinants of charitable donation intentions: a structural equation model. International Journal of Nonprofit and Voluntary Sector Marketing. 2008;13(1):1–11.
  19. 19. Ostrower F. Why the wealthy give: The culture of elite philanthropy: Princeton University Press; 1997.
  20. 20. Webb DJ, Green CL, Brashear TG. Development and validation of scales to measure attitudes influencing monetary donations to charitable organizations. Journal of the academy of marketing science. 2000;28(2):299–309.
  21. 21. Winterich KP, Mittal V, Aquino K. When does recognition increase charitable behavior? Toward a moral identity-based model. Journal of Marketing. 2013;77(3):121–34.
  22. 22. Chiang K-P. Prediction of monetary donation: a path analysis. ADVANCES IN MARKETING. 2003:138.
  23. 23. Yen ST. An econometric analysis of household donations in the USA. Applied Economics Letters. 2002;9(13):837–41.
  24. 24. Ülkü MA, Bell KM, Wilson SG. Modeling the impact of donor behavior on humanitarian aid operations. Annals of Operations Research. 2015;230(1):153–68.
  25. 25. Bekkers R, Wiepking P. Who gives? A literature review of predictors of charitable giving part one: religion, education, age and socialisation. Voluntary Sector Review. 2011;2(3):337–65.
  26. 26. Wiepking P, Bekkers R. Who gives? A literature review of predictors of charitable giving. Part Two: Gender, family composition and income. Voluntary Sector Review. 2012;3(2):217–45.
  27. 27. Tukey JW. The future of data analysis. The annals of mathematical statistics. 1962;33(1):1–67.
  28. 28. Schalkoff RJ. Artificial neural networks: McGraw-Hill New York; 1997.
  29. 29. Vapnik V, Lerner AY. Recognition of patterns with help of generalized portraits. Avtomat i Telemekh. 1963;24(6):774–80.
  30. 30. Vapnik V. The nature of statistical learning theory: Springer science & business media; 2013.
  31. 31. Zhang G, Patuwo BE, Hu MY. Forecasting with artificial neural networks:: The state of the art. International journal of forecasting. 1998;14(1):35–62.
  32. 32. Brown MS, Rooney PM. Giving following a crisis: an historical analysis. 2010.
  33. 33. Steinberg KS, Rooney PM. America gives: A survey of Americans’ generosity after September 11. Nonprofit and voluntary sector quarterly. 2005;34(1):110–35.
  34. 34. Steuerle CE. Managing Charitable Giving in the Wake of Disaster. 2002.