Skip to main content

Open Source Integer Linear Programming Solvers for Error Localization in Numerical Data

  • Chapter
  • First Online:
Advances in Theoretical and Applied Statistics

Abstract

Error localization problems can be converted into Integer Linear Programming problems. This approach provides several advantages and guarantees to find a set of erroneous fields having minimum total cost. By doing so, each erroneous record produces an Integer Linear Programming model that should be solved. This requires the use of specific solution softwares called Integer Linear Programming solvers. Some of these solvers are available as open source software. A study on the performance of internationally recognized open source Integer Linear Programming solvers, compared to a reference commercial solver on real-world data having only numerical fields, is reported. The aim was to produce a stressing test environment for selecting the most appropriate open source solver for performing error localization in numerical data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Achterberg, T.: SCIP - a framework to integrate Constraint and Mixed Integer Programming. Technical Report 04-19, Zuse Institute Berlin (2004)

    Google Scholar 

  2. Banff Support Team: Functional Description of the Banff System for Edit and Imputation System. Statistics Canada, Quality Assurance and Generalized Systems Section Tech. Rep. (2003)

    Google Scholar 

  3. Bankier, M.: Canadian Census Minimum change Donor imputation methodology. In: Proceedings of the Workshop on Data Editing, UN/ECE, Cardiff, United Kingdom (2000)

    Google Scholar 

  4. Bianchi, G., Manzari, A., Reale, A., Salvi, S.: Valutazione dell’idoneità del software DIESIS all’individuazione dei valori errati in variabili quantitative. Istat - Collana Contributi Istat n. 1-2009 (2009)

    Google Scholar 

  5. Bruni, R.: Discrete models for data imputation. Discrete Appl. Math. 144(1), 59–69 (2004)

    Article  MathSciNet  MATH  Google Scholar 

  6. Bruni, R.: Error Correction for Massive Data Sets. Optimization Methods and Software 20 (2-3), 295–314 (2005)

    MathSciNet  Google Scholar 

  7. Bruni, R., Bianchi, G.: A formal procedure for finding contradictions into a set of rules. Appl. Math. Sci. 6(126), 6253–6271 (2012)

    MathSciNet  MATH  Google Scholar 

  8. De Waal, T.: Computational Results with Various Error Localization Algorithms. UNECE Statistical Data Editing Work Session, Madrid, Spain (2003)

    Google Scholar 

  9. Fellegi, I.P., Holt, D.: A systematic approach to automatic edit and imputation. J. Am. Stat. Assoc. 71, 17–35 (1976)

    Article  Google Scholar 

  10. Forrest, J., Lougee-Heimer, R.: CBC User Guide. Online available at http://www.coin-or.org/Cbc (2005)

  11. Forrest, J., De la Nuez, D., Lougee-Heimer, R.: CLP User Guide. Online available at http://www.coin-or.org/Clp (2005)

  12. Garey, M.R., Johnson, D.S.: Computers and Intractability: A Guide to the Theory of NP-Completeness. W.H. Freeman and Company, San Francisco (1979)

    MATH  Google Scholar 

  13. Makhorin, A.: GLPK Reference Manual. Online at http://www.gnu.org/software/glpk/

  14. Nemhauser, G.L., Wolsey, L.A.: Integer and Combinatorial Optimization. Wiley, New York (1999)

    MATH  Google Scholar 

  15. Ralphs, T.: Symphony Documentation. Online available at https://projects.coin-or.org/SYMPHONY

  16. Riera-Ledesma, J., Salazar-Gonzalez, J.-J. New Algorithms for the Editing and Imputation Problem. UNECE Statistical Data Editing Work Session, Madrid, Spain (2003)

    Google Scholar 

  17. Winkler, W.E.: State of Statistical Data Editing and current Research Problems. In: Proceedings of the Workshop on Data Editing, UN/ECE, Rome, Italy (1999)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Gianpiero Bianchi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Bianchi, G., Bruni, R., Reale, A. (2013). Open Source Integer Linear Programming Solvers for Error Localization in Numerical Data. In: Torelli, N., Pesarin, F., Bar-Hen, A. (eds) Advances in Theoretical and Applied Statistics. Studies in Theoretical and Applied Statistics(). Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35588-2_28

Download citation

Publish with us

Policies and ethics