Improving the Accuracy of Prediction of Plant Diseases Using Dimensionality Reduction-Based Ensemble Models

  • A. R. Mohamed Yousuff
  • M. Rajasekhara BabuEmail author
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 1054)


In many real-world applications, different features can be obtained and how to duly utilize them in reduced dimension is a challenge. Simply concatenating them into a long vector is not appropriate because each view has its specific statistical property and physical interpretation. Many dimensionality reduction methods have been developed to identify this lower-dimensional space and map data to it, reducing the number of predictors in supervised learning problems and allowing for better visualization of data relations and clusters. However, the plethora of dimensionality reduction techniques provides a variety of nonlinear, linear, global, and local methods, and it is likely that each method captures different data features. Ensemble methods have achieved much success in supervised learning, from Random Forest to AdaBoost. Ensembles exploit diversity and balance bias, variance, and covariance to achieve these results is likely that disparate dimensionality reduction methods will enhance diversity within a dimensionality reduction-based ensemble. AdaBoost and Random Forest are popular ensemble methods which are widely used for classification of target variables. Major problem with ensembles like AdaBoost and Random Forest is that they perform worse when dimensionality of data is high. Random Forest is the predictor ensemble with a set of decision trees that grow in randomly selected subspaces of data. The proposed research work aims to improve the performance of classification tasks on diseased plants by exploring t-distributed Stochastic Neighbor Embedding (t-SNE) based Ensemble Models. The infected and healthy plant images are subjected to deep learning model to produce their corresponding image embedding. The high dimensional data with thousands of features is then reduced to a smaller number of features dataset by the state-of-the-art t-SNE algorithm. The significant feature dataset is then given as input to the ensemble models to perform the prediction task.


Dimensionality reduction t-SNE Deep learning model Ensemble models 


  1. 1.
    van Der Maaten, L., Postma, E., Van den Herik, J.: Dimensionality reduction: a comparative. J. Mach. Learn. Res. 10, 66–71 (2009)Google Scholar
  2. 2.
    van der Maaten, L., Hinton, G.: Visualizing data using t-SNE. J. Mach. Learn. Res. 9, 2579–2605 (2008). Available:
  3. 3.
    Laparra, V., Malo, J., Camps-Valls, G.: Dimensionality reduction via regression in hyperspectral imagery. IEEE J. Sel. Top. Signal Process. 9(6), 1026–1036 (2015). Available:
  4. 4.
    Huang, H., Yang, M.: Dimensionality reduction of hyperspectral images with sparse discriminant embedding. IEEE Trans. Geosci. Remote Sens. 53(9), 5160–5169 (2015). Available:
  5. 5.
    An, J., Zhang, X., Jiao, L.C.: Dimensionality reduction based on group-based tensor model for hyperspectral image classification. IEEE Geosci. Remote Sens. Lett. 13(10), 1497–1501 (2016). Available:
  6. 6.
    Xie, L., Yin, M., Yin, X., Liu, Y., Yin, G.: Low-rank sparse preserving projections for dimensionality reduction. IEEE Trans. Image Process. 27(11), 5261–5274 (2018). Available:
  7. 7.
    An, J., Zhang, X., Zhou, H., Feng, J., Jiao, L.: Patch tensor-based sparse and low-rank graph for hyperspectral images dimensionality reduction. IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens. 11(7), 2513–2527 (2018). Available:
  8. 8.
    Zheng, C., Zhao, R., Liu, F., Kong, J., Wang, J., Bi, C., Yi, Y.: Dimensionality reduction via multiple locality-constrained graph optimization. IEEE Access 6, 54,479–54,494 (2018). Available:
  9. 9.
    Chouhan, S.S., Kaul, A., Singh, U.P., Jain, S.: Bacterial foraging optimization based radial basis function neural network (BRBFNN) for identification and classification of plant leaf diseases: an automatic approach towards plant pathology. IEEE Access 6, 8852–8863 (2018). Available:
  10. 10.
    Kamble, J.K.: Plant disease detector. (UNAV) (2018). Available:
  11. 11.
    Sardogan, M., Tuncer, A., Ozen, Y.: Plant leaf disease detection and classification based on CNN with LVQ algorithm. (UNAV) (2018). Available:
  12. 12.
    Trongtorkid, C., Pramokchon, P.: Expert system for diagnosis mango diseases using leaf symptoms analysis. (UNAV) (2018). Available:
  13. 13.
    Patrick, A., Pelham, S., Culbreath, A., Holbrook, C.C., De Godoy, I.J., Li, C.: High throughput phenotyping of tomato spot wilt disease in peanuts using unmanned aerial systems and multispectral imaging. IEEE Instrum. Meas. Mag. 20(3), 4–12 (2017). Available:
  14. 14.
    Simonyan, K., Zisserman, A.: Very Deep Convolutional Networks for Large-Scale Image Recognition (2014). Available: [cs]
  15. 15.
    Zhao, X., Wu, Y., Lee, D.L., Cui, W.: iForest: interpreting random forests via visual analytics. IEEE Trans. Vis. Comput. Graph. 25(1), 407–416 (2019). Available:
  16. 16.
    Zhang, C., Cai, Q., Song, Y.: Boosting with pairwise constraints. Neurocomputing 73(4–6), 908–919 (2010). Available:

Copyright information

© Springer Nature Singapore Pte Ltd. 2020

Authors and Affiliations

  1. 1.School of Computer Science and EngineeringVIT UniversityVelloreIndia

Personalised recommendations