Pattern Discovery for High-Dimensional Binary Datasets

Snášel, Václav; Moravec, Pavel; Húsek, Dušan; Frolov, Alexander; Řezanková, Hana; Polyakov, Pavel

doi:10.1007/978-3-540-69158-7_89

Václav Snášel¹,
Pavel Moravec¹,
Dušan Húsek²,
Alexander Frolov³,
Hana Řezanková⁴ &
…
Pavel Polyakov⁵

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4984))

Included in the following conference series:

International Conference on Neural Information Processing

1003 Accesses

Abstract

In this paper we compare the performance of several dimension reduction techniques which are used as a tool for feature extraction. The tested methods include singular value decomposition, semi-discrete decomposition, non-negative matrix factorization, novel neural network based algorithm for Boolean factor analysis and two cluster analysis methods as well. So called bars problem is used as the benchmark. Set of artificial signals generated as a Boolean sum of given number of bars is analyzed by these methods. Resulting images show that Boolean factor analysis is upmost suitable method for this kind of data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agrawal, R., Srikant, R.: Fast algorithms for mining association rules in large databases. In: VLDB 1994: Proceedings of the 20th International Conference on Very Large Data Bases, pp. 487–499. Morgan Kaufmann Publishers Inc., San Francisco (1994)
Google Scholar
Brin, S., Motwani, R., Ullman, J.D., Tsur, S.: Dynamic itemset counting and implication rules for market basket data. In: SIGMOD 1997: Proceedings of the 1997 ACM SIGMOD international conference on Management of data, pp. 255–264. ACM Press, New York (1997)
Chapter Google Scholar
Spellman, P.T., Sherlock, G., Zhang, M.Q., Anders, V.I.K., Eisen, M.B., Brown, P., Botstein, D., Futcher, B.: Comprehensive identification of cell cycle-regulated genes of the yeast saccharomyces cerevisiae by microarray hybridization. Molecular Biology of the Cell 9, 3273–3297 (1998)
Google Scholar
Koyutürk, M., Grama, A., Ramakrishnan, N.: Nonorthogonal decomposition of binary matrices for bounded-error data compression and analysis. ACM Trans. Math. Softw. 32(1), 33–69 (2006)
Article Google Scholar
Földiák., P.: Forming sparse representations by local anti-Hebbian learning. Biological cybernetics 64(22), 165–170 (1990)
Article Google Scholar
Frolov, A., Húsek, D., Polyakov, P., Řezanková, H.: New Neural Network Based Approach Helps to Discover Hidden Russian Parliament Votting Paterns. In: International Joint Conference on Neural Networks, Omnipress, pp. 6518–6523 (2006)
Google Scholar
Frolov, A.A., Húsek, D., Muravjev, P., Polyakov, P.: Boolean Factor Analysis by Attractor Neural Network. Neural Networks, IEEE Transactions 18(3), 698–707 (2007)
Article Google Scholar
Berry, M., Dumais, S., Letsche, T.: Computational Methods for Intelligent Information Access. In: Proceedings of the 1995 ACM/IEEE Supercomputing Conference, San Diego, California, USA (1995)
Google Scholar
Kolda, T.G., O’Leary, D.P.: Computation and uses of the semidiscrete matrix decomposition. In: ACM Transactions on Information Processing (2000)
Google Scholar
Shahnaz, F., Berry, M., Pauca, P., Plemmons, R.: Document clustering using nonnegative matrix factorization. Journal on Information Processing and Management 42, 373–386 (2006)
Article MATH Google Scholar
Spratling, M.W.: Learning Image Components for Object Recognition. Journal of Machine Learning Research 7, 793–815 (2006)
MathSciNet Google Scholar
Frolov, A.A., Húsek, D., Muravjev, P.: Informational efficiency of sparsely encoded Hopfield-like autoassociative memory. Optical Memory and Neural Networks (Information Optics), 177–198 (2003)
Google Scholar
Frolov, A.A., Sirota, A.M., Húsek, D., Muravjev, P.: Binary factorization in Hopfield-like neural networks: single-step approximation and computer simulations. Neural Networks World, 139–152 (2004)
Google Scholar
Goles-Chacc, E., Fogelman-Soulie, F.: Decreasing energy functions as a tool for studying threshold networks. Discrete Mathematics, 261–277 (1985)
Google Scholar
Faloutsos, C.: Gray Codes for Partial Match and Range Queries. IEEE Transactions on Software Engineering 14(10) (1988)
Google Scholar
Faloutsos, C., Lin, K.: FastMap: A Fast Algorithm for Indexing, Data-Mining and Visualization of Traditional and Multimedia Datasets. ACM SIGMOD Record 24(2), 163–174 (1995)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, FEECS, VŠB – Technical University of Ostrava, 17. listopadu 15, 708 33, Ostrava-Poruba, Czech Republic
Václav Snášel & Pavel Moravec
Institute of Computer Science, Dept. of Nonlinear Systems, Academy of Sciences of the Czech Republic, Pod Vodárenskou věží 2, 182 07, Prague, Czech Republic
Dušan Húsek
Institute of Higher Nervous Activity and Neurophysiology, Russian Academy of Sciences, Butlerova 5a, 117 485, Moscow, Russia
Alexander Frolov
Department of Statistics and Probability, University of Economics, Prague, W. Churchill sq. 4, 130 67, Prague, Czech Republic
Hana Řezanková
Institute of Optical Neural Technologies, Russian Academy of Sciences, Vavilova 44, 119 333, Moscow, Russia
Pavel Polyakov

Authors

Václav Snášel
View author publications
You can also search for this author in PubMed Google Scholar
Pavel Moravec
View author publications
You can also search for this author in PubMed Google Scholar
Dušan Húsek
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Frolov
View author publications
You can also search for this author in PubMed Google Scholar
Hana Řezanková
View author publications
You can also search for this author in PubMed Google Scholar
Pavel Polyakov
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Masumi Ishikawa Kenji Doya Hiroyuki Miyamoto Takeshi Yamakawa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Snášel, V., Moravec, P., Húsek, D., Frolov, A., Řezanková, H., Polyakov, P. (2008). Pattern Discovery for High-Dimensional Binary Datasets. In: Ishikawa, M., Doya, K., Miyamoto, H., Yamakawa, T. (eds) Neural Information Processing. ICONIP 2007. Lecture Notes in Computer Science, vol 4984. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69158-7_89

Download citation

DOI: https://doi.org/10.1007/978-3-540-69158-7_89
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69154-9
Online ISBN: 978-3-540-69158-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics