Abstract
So far little attention has been paid to file format robustness, i.e., a file formats capability for keeping its information as safe as possible in spite of data corruption. The paper on hand reports on the first comprehensive research on this topic. The research work is based on a study on the status quo of file format robustness for various file formats from the image domain. A controlled test corpus was built which comprises files with different format characteristics. The files are the basis for data corruption experiments which are reported on and discussed.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Avcıbas, I., Sankur, B., Sayood, K.: Statistical evaluation of image quality measures. Journal of Electronic Imaging 11(2), 206–223 (2002)
Bairavasundaram, L.N., et al.: An Analysis of Data Corruption in the Storage Stack. ACM Transactions on Storage 4(3) (2008)
Buonora, P., Liberati, F.: A Format for Digital Preservation of Images: A Study on JPEG 2000 File Robustness. D-Lib Magazine 7/8 (2008), http://www.dlib.org/dlib/july08/buonora/07buonora.html (accessed May 2009)
Chapman, S., et al.: Page Image Compression for Mass Digitization. In: Archiving 2007. Final program and proceedings, pp. 37–42 (2007)
Gilesse, R., Rog, J., Verheusen, A.: Life Beyond uncompressed TIFF: Alternative File Formats for the Storage of Master Image Files. In: Archiving 2008. Final program and proceedings, pp. 41–46 (2008)
Heydegger, V.: Analysing the Impact of File Formats on Data Integrity. In: Archiving 2008. Final program and proceedings, pp. 50–55 (2008)
Iraci, J.: The Relative Stabilities of Optical Disk Formats. Restaurator 26(2) (2005)
ISO/IEC 15444-5:2003. JPEG 2000 image coding system (2003)
Matsumoto, M., Nishimura, T.: Mersenne Twister: A 623-dimensionally equidistributed uniform pseudorandom number generator. ACM Trans. on Modeling and Computer Simulation 8(1), 3–30 (1998)
Panzer-Steindel, B.: Data Integrity, internal CERN/IT study (2007), http://indico.cern.ch/getFile.py/access?contribId=3&sessionId=0&resId=1&materialId=paper&confId=13797 (accessed May 2009)
Schroeder, B., Gibson, G.A.: Disk failures in the real world: What does an mttf of 1,000,000 hours mean to you? In: Proceedings of the 5th USENIX Conference on File and Storage Technologies, FAST (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Heydegger, V. (2009). Just One Bit in a Million: On the Effects of Data Corruption in Files. In: Agosti, M., Borbinha, J., Kapidakis, S., Papatheodorou, C., Tsakonas, G. (eds) Research and Advanced Technology for Digital Libraries. ECDL 2009. Lecture Notes in Computer Science, vol 5714. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04346-8_31
Download citation
DOI: https://doi.org/10.1007/978-3-642-04346-8_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04345-1
Online ISBN: 978-3-642-04346-8
eBook Packages: Computer ScienceComputer Science (R0)