Modeling the Propensity Score with Statistical Learning

  • Kenshi Uchihashi
  • Atsunori KanemuraEmail author
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9950)


The progress of the ICT technology has produced data-sources that continuously generate datasets with different features and possibly with partial missing values. Such heterogeneity can be mended by integrating several processing blocks, but a unified method to extract conclusions from such heterogeneous datasets would bring consistent results with lower complexity. This paper proposes a flexible propensity score estimation method based on statistical learning for classification, and compared its performance against classical generalized linear methods.


Propensity scores Missing value estimation Observational studies Statistical learning Deep learning 



This study was supported in part by the New Energy and Industrial Technology Development Organization (NEDO), Japan, and by JSPS KAKENHI 26730130 and 15K12112.


  1. 1.
    Little, R.J.A., Rubin, D.B.: Statistical Analysis with Missing Data, 2nd edn. Wiley, Hoboken (2002)zbMATHGoogle Scholar
  2. 2.
    Carpenter, J., Kenward, M.: Multiple Imputation and its Applications. Wiley, Hoboken (2013)CrossRefzbMATHGoogle Scholar
  3. 3.
    Guo, S., Fraser, M.W., Analysis, P.S.: Statistical Methods and Applications, 2nd edn. Sage, Thousand Oaks (2015)Google Scholar
  4. 4.
    Rosenbaum, P.R., Rubin, D.B.: The central role of the propensity score in observational studies for causal effects. Biometrika 70(1), 41–55 (1983)MathSciNetCrossRefzbMATHGoogle Scholar
  5. 5.
    Bishop, C.M.: Pattern Recognition and Machine Learning. Springer, Heidelberg (2006)zbMATHGoogle Scholar
  6. 6.
    Rosenbaum, P.R.: Design of Observational Studies. Springer, Heidelberg (2010)CrossRefzbMATHGoogle Scholar
  7. 7.
    LeCun, Y., Bottou, L., Benjio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)CrossRefGoogle Scholar
  8. 8.
    Sainath, T.N., Weiss, R.J., Wilson, K.W., Vinyals, O.: Learning the speech front-end with raw wave from CLDNNs. In: Proceedings of INTERSPEECH (2015)Google Scholar
  9. 9.
    Lei, T., Barzilay, R., Jaakkola, T.: Molding CNNs for text: non-linear, non-consecutive convolutions. In: Proceedings of Empirical Methods on Natural Language Processing (EMNLP) (2015)Google Scholar
  10. 10.
    Rubin, D.B.: Matched Sampling for Causal Effects. Cambridge University Press, Cambridge (2006)CrossRefzbMATHGoogle Scholar

Copyright information

© Springer International Publishing AG 2016

Authors and Affiliations

  1. 1.National Institute of Advanced Industrial Science and Technology (AIST)TsukubaJapan

Personalised recommendations