Skip to main content

Compression of clustered inverted files

  • Communications
  • Conference paper
  • First Online:
  • 120 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 64))

Abstract

One way to save memory space in inverted file organizations is to map each address list to a bit-vector and compress it by a suitable compression technique. Eight such techniques are discussed for nonuniformly distributed bit-vectors in this study. Occurrences of clusters with high 1-bit densities are simulated using a n-state bit-vector generating process. Experiments with a real life file are also reported.

This is a preview of subscription content, log in via an institution.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. L.R. Bahl, H. Kobayashi: Image Data Compression by Predictive Coding II, IBM J. Res. Develop., Vol 18, pp 172–179, (1974).

    Google Scholar 

  2. S.D. Bradley: Optimizing a Scheme for Run Length Encoding, Proc. IEEE, Vol 57, pp 108–109, (1969).

    Google Scholar 

  3. A.F. Cardenas: Analysis and Performance of Inverted Data Base Management Systems, Comm. ACM, Vol 18, pp 253–263, (1975).

    Article  Google Scholar 

  4. S.W. Golomb: Run-Length Encodings, IEEE Trans. Inf. Theory, IT-12, pp 399–401, (1966).

    Article  Google Scholar 

  5. R.A. Howard: Dynamic Probabilistic Systems, Vol I: Markov Models, John Wiley & Sons, Inc., New York, (1971).

    Google Scholar 

  6. M. Jakobsson, O. Nevalainen: On the Compression of Inverted Files, Rept. B 14, Dept. of Comp. Sci., Univ. of Turku, Finland, (1977).

    Google Scholar 

  7. D.R. King: The Binary Vector as a Basis of an Inverted Index File, J. Libr. Autom., Vol 7, pp 307–315, (1974).

    Google Scholar 

  8. D. Lefkowitz: File Structures for On-line Systems, Spartan Books, New York, (1969).

    Google Scholar 

  9. E.J. Schuegraf: Compression of Large Inverted Files with Hyperbolic Term Distribution, Inf. Proc. Manag., Vol 12, pp 377–384, (1976).

    Google Scholar 

  10. L.H. Thiel, H.S. Heaps: Program Design for Retrospective Searches on Large Data Bases, Inf. Stor. Retr., Vol 8, No 1, (1972).

    Google Scholar 

  11. H. Wedekind, T. Härder: Datenbanksysteme II, B.I.-Wissenschaftswerlag Mannheim/Wien/Zürich, (1976).

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

J. Winkowski

Rights and permissions

Reprints and permissions

Copyright information

© 1978 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Nevalainen, O., Jakobsson, M., Berg, R. (1978). Compression of clustered inverted files. In: Winkowski, J. (eds) Mathematical Foundations of Computer Science 1978. MFCS 1978. Lecture Notes in Computer Science, vol 64. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-08921-7_87

Download citation

  • DOI: https://doi.org/10.1007/3-540-08921-7_87

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-08921-6

  • Online ISBN: 978-3-540-35757-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics