Abstract
Error localization problems can be converted into Integer Linear Programming problems. This approach provides several advantages and guarantees to find a set of erroneous fields having minimum total cost. By doing so, each erroneous record produces an Integer Linear Programming model that should be solved. This requires the use of specific solution softwares called Integer Linear Programming solvers. Some of these solvers are available as open source software. A study on the performance of internationally recognized open source Integer Linear Programming solvers, compared to a reference commercial solver on real-world data having only numerical fields, is reported. The aim was to produce a stressing test environment for selecting the most appropriate open source solver for performing error localization in numerical data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Achterberg, T.: SCIP - a framework to integrate Constraint and Mixed Integer Programming. Technical Report 04-19, Zuse Institute Berlin (2004)
Banff Support Team: Functional Description of the Banff System for Edit and Imputation System. Statistics Canada, Quality Assurance and Generalized Systems Section Tech. Rep. (2003)
Bankier, M.: Canadian Census Minimum change Donor imputation methodology. In: Proceedings of the Workshop on Data Editing, UN/ECE, Cardiff, United Kingdom (2000)
Bianchi, G., Manzari, A., Reale, A., Salvi, S.: Valutazione dell’idoneità del software DIESIS all’individuazione dei valori errati in variabili quantitative. Istat - Collana Contributi Istat n. 1-2009 (2009)
Bruni, R.: Discrete models for data imputation. Discrete Appl. Math. 144(1), 59–69 (2004)
Bruni, R.: Error Correction for Massive Data Sets. Optimization Methods and Software 20 (2-3), 295–314 (2005)
Bruni, R., Bianchi, G.: A formal procedure for finding contradictions into a set of rules. Appl. Math. Sci. 6(126), 6253–6271 (2012)
De Waal, T.: Computational Results with Various Error Localization Algorithms. UNECE Statistical Data Editing Work Session, Madrid, Spain (2003)
Fellegi, I.P., Holt, D.: A systematic approach to automatic edit and imputation. J. Am. Stat. Assoc. 71, 17–35 (1976)
Forrest, J., Lougee-Heimer, R.: CBC User Guide. Online available at http://www.coin-or.org/Cbc (2005)
Forrest, J., De la Nuez, D., Lougee-Heimer, R.: CLP User Guide. Online available at http://www.coin-or.org/Clp (2005)
Garey, M.R., Johnson, D.S.: Computers and Intractability: A Guide to the Theory of NP-Completeness. W.H. Freeman and Company, San Francisco (1979)
Makhorin, A.: GLPK Reference Manual. Online at http://www.gnu.org/software/glpk/
Nemhauser, G.L., Wolsey, L.A.: Integer and Combinatorial Optimization. Wiley, New York (1999)
Ralphs, T.: Symphony Documentation. Online available at https://projects.coin-or.org/SYMPHONY
Riera-Ledesma, J., Salazar-Gonzalez, J.-J. New Algorithms for the Editing and Imputation Problem. UNECE Statistical Data Editing Work Session, Madrid, Spain (2003)
Winkler, W.E.: State of Statistical Data Editing and current Research Problems. In: Proceedings of the Workshop on Data Editing, UN/ECE, Rome, Italy (1999)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Bianchi, G., Bruni, R., Reale, A. (2013). Open Source Integer Linear Programming Solvers for Error Localization in Numerical Data. In: Torelli, N., Pesarin, F., Bar-Hen, A. (eds) Advances in Theoretical and Applied Statistics. Studies in Theoretical and Applied Statistics(). Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35588-2_28
Download citation
DOI: https://doi.org/10.1007/978-3-642-35588-2_28
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35587-5
Online ISBN: 978-3-642-35588-2
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)