dc.contributor.author |
Cannas, Massimo |
dc.contributor.author |
Arpino, Bruno |
dc.date.accessioned |
2018-01-26T10:54:15Z |
dc.date.available |
2018-01-26T10:54:15Z |
dc.date.issued |
2018-01 |
dc.identifier.uri |
http://hdl.handle.net/10230/33768 |
dc.description.abstract |
Using an extensive simulation exercise, we address two open issues in propensity score
analyses: how to estimate propensity scores and how to assess covariates balance. We
compare the performance of several machine learning algorithms and the standard logistic
regression in terms of bias and mean squared errors of matching and weighing estimators
based on the estimated propensity score. Additionally, we profit of the simulation
framework to assess the ability of several measures of covariate balance in predicting the
quality of the propensity score estimators in terms of bias reduction. Among the different
techniques we considered, random forests performed the best when propensity scores were used for matching. In the case of weighting, both random forests and boosted tree
outperformed other techniques. As for the performance of the several diagnostics of
covariate balance we considered, we found that the simplest and most commonly used one, the Absolute Standardized Average Mean difference of covariates (ASAM), predicts well the bias of causal estimators. However, our findings suggest the use of a stringent rule: researchers should aim (at least) at obtaining an average ASAM lower than 10% and/or a low proportion of covariates with ASAM exceeding the 10% threshold. Balancing
interactions among covariates is also desirable. |
dc.format.mimetype |
application/pdf |
dc.language.iso |
eng |
dc.relation.ispartofseries |
RECSM Working Paper Series;54 |
dc.rights |
This is an Open Access article distributed under the terms of the Creative Commons Attribution License Creative Commons Attribution 4.0 International, which permits unrestricted use, distribution and reproduction in any medium provided that the original work is properlyattributed. |
dc.rights.uri |
http://creativecommons.org/licenses/by/4.0/ |
dc.subject.other |
Causal inference |
dc.subject.other |
Propensity score methods |
dc.subject.other |
Covariate balance |
dc.subject.other |
Machine learning |
dc.subject.other |
Algorithms; simulation study |
dc.title |
Machine learning for propensity score matching and weighting : comparing different estimation techniques and assessing ifferent balance diagnostics |
dc.type |
info:eu-repo/semantics/workingPaper |
dc.rights.accessRights |
info:eu-repo/semantics/openAccess |