Skip to main content

An Algorithm for Sample and Data Dimensionality Reduction Using Fast Simulated Annealing

  • Conference paper
Advanced Data Mining and Applications (ADMA 2011)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7120))

Included in the following conference series:

Abstract

This paper deals with dimensionality and sample length reduction applied to the tasks of exploratory data analysis. Proposed technique relies on distance preserving linear transformation of given dataset to the lower dimensionality feature space. Coefficients of feature transformation matrix are found using Fast Simulated Annealing - an algorithm inspired by physical annealing of solids. Furthermore the elimination or weighting of data elements which, as an effect of above mentioned transformation, were moved significantly from the rest of the dataset can be performed. Presented method was positively verified in routines of clustering, classification and outlier detection. It ensures proper efficiency of those procedures in compact feature space and with reduced data sample length at the same time.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Alba, E. (ed.): Parallel Metaheuristics. John Wiley & Sons, New York (2005)

    MATH  Google Scholar 

  2. Bishop, C.M.: Pattern Recognition and Machine Learning. Springer, Heidelberg (1999)

    MATH  Google Scholar 

  3. Chandola, V., Banerjee, A., Kumar, V.: Anomaly detection: A survey. ACM Computing Surveys 41, 15:1–15:58 (2009)

    Article  Google Scholar 

  4. Cox, T.F., Cox, M.A.A.: Multidimensional Scaling. Chapman & Hall, Boca Raton (2000)

    MATH  Google Scholar 

  5. Ingber, L.: Adaptive simulated annealing (ASA): Lessons learned. Control and Cybernetics 25(1), 33–54 (1996)

    MATH  Google Scholar 

  6. Jain, A.K., Duin, R.P.W., Mao, J.: Statistical Pattern Recognition: A Review. IEEE Transactions on Pattern Analysis and Machine Intelligence 22(1), 4–37 (2000)

    Article  Google Scholar 

  7. Kirkpatrick, S., Gelatt, C.D., Vecchi, M.: Optimization by Simulated Annealing. Science 220, 671–680 (1983)

    Article  MathSciNet  MATH  Google Scholar 

  8. Kowalski, P.A., Kulczycki, P.: Data Sample Reduction for Classification of Interval Information using Neural Network Sensitivity Analysis. In: Dicheva, D., Dochev, D. (eds.) AIMSA 2010. LNCS, vol. 6304, pp. 271–272. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  9. Kulczycki, P.: Kernel Estimators in Industrial Applications. In: Prasad, B. (ed.) Soft Computing Applications in Industry, pp. 69–91. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  10. Lee, J.L., Verleysen, M.: Nonlinear Dimensionality Reduction. Springer, Heidelberg (2007)

    Book  MATH  Google Scholar 

  11. Łukasik, S.: Parallel Computing of Kernel Density Estimates with MPI. In: Shi, Y., van Albada, G.D., Dongarra, J., Sloot, P.M.A. (eds.) ICCS 2007. LNCS, vol. 4489, pp. 726–733. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  12. van der Maaten, L.J.P.: Feature Extraction from Visual Data. PhD Thesis, Tilburg University, June 23 (2009)

    Google Scholar 

  13. Mitra, P., Murthy, C.A., Pal, S.K.: Unsupervised Feature Selection using Feature Similarity. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(4), 301–312 (2002)

    Article  Google Scholar 

  14. Pal, S.K., Mitra, P.: Pattern Recognition Algorithms for Data Mining. Chapman and Hall, London (2004)

    Book  MATH  Google Scholar 

  15. Sait, S.M., Youssef, H.: Iterative computer algorithms with applications in engineering. IEEE Computer Society, Los Alamitos (1999)

    MATH  Google Scholar 

  16. Saxena, A., Pal, N.R., Vora, M.: Evolutionary methods for unsupervised feature selection using Sammons stress function. Fuzzy Information and Engineering 2(3), 229–247 (2010)

    Article  Google Scholar 

  17. Silverman, B.W.: Density Estimation for Statistics and Data Analysis. Chapman and Hall, London (1986)

    Book  MATH  Google Scholar 

  18. Szu, H., Hartley, R.: Fast simulated annealing. Physics Letters A 122(3-4), 157–162 (1987)

    Article  Google Scholar 

  19. Tan, P.-N., Steinbach, M., Kumar, V.: Introduction to Data Mining. Pearson Addison-Wesley, Boston (2006)

    Google Scholar 

  20. UCI Machine Learning Repository, http://archive.ics.uci.edu/ml/

  21. Verleysen, M., François, D.: The Curse of Dimensionality in Data Mining and Time Series Prediction. In: Cabestany, J., Prieto, A., Sandoval, F. (eds.) IWANN 2005. LNCS, vol. 3512, pp. 758–770. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  22. Wand, M.P., Jones, M.C.: Kernel Smoothing. Chapman and Hall, London (1995)

    Book  MATH  Google Scholar 

  23. Xu, R., Wunsch, D.C.: Clustering. Wiley, New York (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

Š 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Łukasik, S., Kulczycki, P. (2011). An Algorithm for Sample and Data Dimensionality Reduction Using Fast Simulated Annealing. In: Tang, J., King, I., Chen, L., Wang, J. (eds) Advanced Data Mining and Applications. ADMA 2011. Lecture Notes in Computer Science(), vol 7120. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25853-4_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-25853-4_12

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-25852-7

  • Online ISBN: 978-3-642-25853-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics