Investigating Engineering Data by Probabilistic Measures

  • L. A. BullEmail author
  • K. Worden
  • T. J. Rogers
  • E. J. Cross
  • N. Dervilis
Conference paper
Part of the Conference Proceedings of the Society for Experimental Mechanics Series book series (CPSEMS)


A critical issue for data-based engineering is a lack of descriptive labels for the measured data. For many engineering systems, these labels are costly/impractical to obtain, and as a result, conventional supervised learning is not feasible. This article outlines a probabilistic framework for the investigation and labelling of engineering datasets. Two alternative probabilistic measures are suggested to define the most informative observations to investigate and annotate, in order to maximise the classification performance of a statistical model.


Active learning Guided sampling Semi-supervised learning Online structural health monitoring 



The authors gratefully acknowledge the support of the UK Engineering and Physical Sciences Research Council (EPSRC) through Grant reference number EP/R003645/1. Further thanks are extended to Karen Holford and Rhys Pullin at Cardiff University for providing the AE data.


  1. 1.
    Farrar, C.R., Worden, K.: Structural Health Monitoring: A Machine Learning Perspective. Wiley, New York (2012)CrossRefGoogle Scholar
  2. 2.
    Chapelle, O., Scholkopf, B., Zien, A.: Semi-Supervised Learning. MIT Press,Cambridge (2006)CrossRefGoogle Scholar
  3. 3.
    Schwenker, F., Trentin, E.: Pattern classification and clustering: a review of partially supervised learning approaches. Pattern Recogn. Lett. 37(1), 4–14 (2014)CrossRefGoogle Scholar
  4. 4.
    Bull, L., Worden, K., Manson, G., Dervilis, N.: Active learning for semi-supervised structural health monitoring. J. Sound Vib. 437, 373–388 (2018)CrossRefGoogle Scholar
  5. 5.
    Wang, M., Min, F., Zhang, Z.H., Wu, Y.X.: Active learning through density clustering. Expert Syst. Appl. 85, 305–317 (2017)CrossRefGoogle Scholar
  6. 6.
    Zhu, X., Zhang, P., Lin, X., Shi, Y.: Active learning from data streams. Seventh IEEE International Conference on Data Mining (ICDM 2007), pp. 757–762 (2007)Google Scholar
  7. 7.
    Murphy, K.P.: Conjugate bayesian analysis of the Gaussian distribution. Def 1(7), 1–29 (2007)Google Scholar
  8. 8.
    Dasgupta, S., Hsu, D.: Hierarchical sampling for active learning. In: Proceedings of the 25th International Conference on Machine Learning, pp. 208–215. ACM, New York (2008)Google Scholar
  9. 9.
    Huang, S.J., Jin, R., Zhou, Z.H.: Active learning by querying informative and representative examples. In: Advances in Neural Information Processing Systems, pp. 892–900 (2010)Google Scholar

Copyright information

© Society for Experimental Mechanics, Inc. 2020

Authors and Affiliations

  • L. A. Bull
    • 1
    Email author
  • K. Worden
    • 1
  • T. J. Rogers
    • 1
  • E. J. Cross
    • 1
  • N. Dervilis
    • 1
  1. 1.Dynamics Research Group, Department of Mechanical EngineeringUniversity of SheffieldSheffieldUK

Personalised recommendations