On the Security of Individual Data

Demetrovics, János; Katona, Gyula O. H.; Miklós, Dezső

doi:10.1007/978-3-540-24627-5_5

János Demetrovics⁶,
Gyula O. H. Katona⁷ &
Dezső Miklós⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2942))

Included in the following conference series:

International Symposium on Foundations of Information and Knowledge Systems

328 Accesses
3 Citations

Abstract

We will consider the following problem in this paper:

Assume there are n numeric data {x ₁,x ₂,...,x _n} (like salaries of n individuals) stored in a database and some subsums of these numbers are disclosed by making public or just available for persons not eligible to learn the original data. Our motivating question is: at most how many of these subsums may be disclosed such that none of the numbers x ₁,x ₂,...,x _n can be uniquely determined from these sums. These types of problems arise in the cases when certain tasks concerning a database are done by subcontractors who are not eligible to learn the elements of the database, but naturally should be given some data to fulfill there task. In database theory such examples are called statistical databases as they are used for statistical purposes and no individual data are supposed to be obtained using a restricted list of SUM queries. This problem was originally introduced by Chin and Ozsoyoglu [1], originally solved by Miller et al. [5] and revisited by Griggs [4].

It turned out [5] that the problem is equivalent to the following question: If there are n real, non-zero numbers X={x ₁,x ₂,...,x _n} given, what is the maximum number of 0 subsums of it, that is, what is the maximum number of the subsets of X whose elements sum up to 0. This approach, together with the Sperner theorem shows that no more than \(\left(\begin{array}{c} n \\ n/2 \\ \end{array}\right)\) sub-sums of a given set of secure data may be disclosed without disclosing at least one of the data, which upper bound is sharp as well.

However, it is natural to assume that the disclosed sub-sums of the original elements of the database will contain only a limited number of elements, say at most k (in the applications databases are usually huge, while the number of operations is in most of the cases limited). We have now the same question: at most how many of these subsums of at most k members may be given such that none of the numbers x ₁,x ₂,...,x _n can be uniquely determined from these sums. The main result of this paper gives an upper bound on this number, which turns out to be sharp if we allow subsums of only k or k-1 members and asymptotically sharp in case of subsums of at most k members.

The work was supported by the Hungarian National Foundation for Scientific Research grant numbers 37846 and 42706 and the European Community’s Centre of Excellence Grant numbers ICA1-CT-2000-70009 and ICA1-CT-2000-70025.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Chin, F.Y., Ozsoyoglu, G.: Auditing and inference control in statistical databases. IEEE Transactions on Software Engineering SE-8, 574–582 (1982)
Article MathSciNet Google Scholar
Denning, D.E.: Cryptography and Data Security. Addison-Wesley, Sydney (1982)
MATH Google Scholar
Denning, D.E., Schlorer, J.: Inference controls for statistical databases. Computer, 69–82 (1983)
Google Scholar
Griggs, J.R.: Concentrating subset sums at k points. Bull. Inst. Comb. Applns. 20, 65–74 (1997)
MATH MathSciNet Google Scholar
Miller, M., Roberts, I., Simpson, I.: Application of Symnmetric Chains to an Optimization Problem in the Security of Statistical Databases. Bull. ICA 2, 47–58 (1991)
MATH MathSciNet Google Scholar
Miller, M., Roberts, I., Simpson, I.: Preventation of Relative Compromise in Statistical Databases Using Audit Expert. Bull. ICA 10, 51–62 (1994)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Computer and Automation Institute, Hungarian Academy of Sciences, Kende u. 13-17, H-1111, Hungary
János Demetrovics
Alfréd Rényi Institute of Mathematics, Hungarian Academy of Sciences, Budapest, P.O.B. 127, H-1364, Hungary
Gyula O. H. Katona & Dezső Miklós

Authors

János Demetrovics
View author publications
You can also search for this author in PubMed Google Scholar
Gyula O. H. Katona
View author publications
You can also search for this author in PubMed Google Scholar
Dezső Miklós
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Universität Würzburg, Am Hubland, 97074, Würzburg, Germany
Dietmar Seipel
Department of Information Systems, Massey University, Information Science Research Centre, PO Box 756, Wellington, New Zealand
José María Turull-Torres

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Demetrovics, J., Katona, G.O.H., Miklós, D. (2004). On the Security of Individual Data. In: Seipel, D., Turull-Torres, J.M. (eds) Foundations of Information and Knowledge Systems. FoIKS 2004. Lecture Notes in Computer Science, vol 2942. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24627-5_5

Download citation

DOI: https://doi.org/10.1007/978-3-540-24627-5_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20965-2
Online ISBN: 978-3-540-24627-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics