Skip to main content

“Cry Baby”: Using Spectrographic Analysis to Assess Neonatal Health Status from an Infant’s Cry

  • Chapter
  • First Online:
Advances in Speech Recognition

Abstract

Infant cry analysis is a multidisciplinary area of research incorporating­ ­pediatrics, neurology, physiology, engineering, developmental linguistics, and ­psychology. It has been proposed in the pediatric literature that the infant cry is a reflection of complex neurophysiologic functions and that analysis of the cry itself can be used to assess the status of the infant’s health. Given the diagnostic importance of infant cry, this chapter presents application of spectrographic analysis to the vocal sounds of an infant, comparing normal with abnormal infant cry. Drawing from a rich body of research on spectrographic analysis predominantly used for performance of speaker recognition, this chapter presents how such spectral features that are used to identify and verify speakers can be applied to assess the neonate’s health status, by comparing a normal to an abnormal cry. Ten distinct cry modes, viz., hyperphonation, dysphonation, inhalation, double harmonic break, trailing, vibration, weak vibration, flat, rising, and falling have been identified for normal infant cry and their spectrographic patterns were observed. This analysis was then extended to abnormal infant cry. It has been observed that the double harmonic break is more dominant for ­abnormal infant cry in cases of myalgia (muscular pain). The inhalation pattern is distinct for infants suffering from asthma or other respiratory ailments such as a cough or cold. For example, for the infant whose larynx is not well developed, the pitch harmonics are nearly absent. As such, there are no voicing or glottal vibrations in the cry signal. In addition, for infants with Hypoxic Ischemic Encephalopathy (HIE), there is an ­initial tendency of pitch harmonics­ to rise and then to be followed by a blurring of such ­harmonics. Finally, an infant cry classification system is analyzed by observing the nature of the optimal ­warping path in the Dynamic Time Warping (DTW) algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    SIDS-is a syndrome marked by the sudden death of an infant that is unexpected by history and remains unexplained after a thorough forensic autopsy, a detailed crime scene investigation, and an exploration of the medical history of the infant and family. SIDS was responsible for 0.543 deaths per 1,000 live births in the U.S. in 2005 [49,36,39,41]. According to a recent study, babies who die of SIDS have abnormalities in brain stem (the medulla oblongata) which helps in control functions like breathing (which in turn may affect infant cry), blood pressure, arousal and abnormalities in serotonin signaling. According to the National Institute of Health (NIH), which funded the study, this finding is the strongest evidence to date that the structural difference in a specific part of the brain may contribute to the risk of SIDS [30,49]. Colton and Steinschneider analyzed cry of SIDS victim and correlated the cause of SIDS with relatively lower Fo, longer duration, lower formant frequencies and greater sound pressure level throughout the spectrum [6]. Corwin et. al. observed that infants whose first cries exhibited a high first formant were more likely to die of SIDS than infants whose first cries did not have this characteristics [5]. In a groundbreaking study of 74 larynges removed at death from children who died of SIDS reported by Harrison, it was observed that SIDS can be attributed to reduction in the subglottic area (particularly around the age of 3 months), which is potentially lethal. The reduction in subglottic airway is often secondary to an increase in mucus-secreting glands caused by an upper respiratory tract infection, all of which affects infant cry [15]. These pioneering studies indicate that the infants with SIDS have significant abnormalities in their cry signal and hence changes in spectral characteristics. Moreover, imagine a situation where an infant cry analyzer is developed to increase doctors’ confidence in making decisions about slow developing abnormalities in infants from their cry beforehand. In such cases, suitable steps in the form of drugs or therapy could be given to save the life of infant. In addition, the work on cry analysis of infants whose larynx is not fully developed (e.g. laryngomalacia) is scant. This present work may constitute a crucial step in filling this gap.

References

  1. V. Apgar, “A proposal for a new method of evaluation of the newborn infant,” Curr. Res. Anesth. Analg., vol. 32, pp. 260–267, 1953.

    Google Scholar 

  2. J. F. Bosma, H. M. Truby, and J. Lind, “Cry motions of the newborn infant,” Acta Paediat. Scand. Suppl., vol. 163, pp. 61–92, 1965.

    Google Scholar 

  3. Neeharika Buddha and Hemant A. Patil, “Corpora for analysis of infant cry,” Int. Conf. on Speech Databases and Assessments, Oriental COCOSDA, Hanoi, Vietnam, pp. 43–48, Dec. 4–6, 2007.

    Google Scholar 

  4. M. Corwin, and H. Golub, “Medical applications of infant cry analysis,” A. Milunsky, E. Friedman, and L. Gluck, Advances in prenatal medicine, New York: Plenum Press, vol. 4, pp. 163–188, 1985.

    Google Scholar 

  5. M. J. Corwin, B. M. Lester, C. Sepkoski, Peucker, M. Kayne, H., H. L. Golub, “Newborn acoustic cry characteristics of infants subsequently dying of sudden infant death syndrome,” Pediatrics, vol. 96, no. 1, pp. 73–77, July 1995.

    Google Scholar 

  6. R. Colton and A. Steinschneider, “The cry characteristics of an infant who died of sudden infant death syndrome,” J. Speech, Hearing Dis., vol. 46, pp. 359–363, 1981.

    Google Scholar 

  7. K. John Cullen Jr., Nancy Fargo, Richard A. Chase, Peggy Baker, “The development of auditory feedback monitoring: I Delayed auditory feedback studies on infant cry,” J. Speech, Hearing Research, vol. 11, pp.85–93 1968.

    Google Scholar 

  8. G. Fant. Acoustic Theory of Speech Production. The Hague: Mouton, 1960.

    Google Scholar 

  9. V. Fisichelli, M. Coxe, L. Rosenfeld, A. Haber, J. Davis, and A. Karelitz, “The phonetic content of the cries of normal infants and those with brain damage,” Journal of Psychology, vol. 64, pp. 119–126, 1966.

    Article  Google Scholar 

  10. B. F. Fuller and Y. Horii, “Spectral energy distribution in four types of infant vocalizations,” J. Commun. Disord., vol. 21, pp. 251–261, 1988.

    Article  Google Scholar 

  11. L. Gray, “Signal detection and analysis of delays in neonates vocalization,” J. Acoust. Soc. Am., vol. 82, pp. 1608–1611, 1987.

    Article  Google Scholar 

  12. Susan M. Grau, Michael P. Robb, Anthony T. Cacace, “Acoustic correlates of inspiratory phonation during infant cry: Research note,” Journal of Speech and Hearing Research, vol. 38, pp. 373–381, April 1995.

    Google Scholar 

  13. H. L. Golub and M. J. Corwin, “A physioacoustic model of infant cry,” in Infant Crying: Theoretical And Research Perspective, B. M. Lester and C. F. Z. Boukydis, Eds. New York, Plenum, pp.59–81, 1985.

    Chapter  Google Scholar 

  14. G. E. Gustafson and J. A. Green, “On the importance of fundamental frequency and other acoustic features in cry perception and infant development,” Child Developmerit, vol. 60, pp. 772–80, 1989.

    Article  Google Scholar 

  15. D. Harrison, “Histologic evaluation of the larynx in sudden infant death syndrome,” Annals of Otology, Rhinology, and Laryngology, vol. 100, pp. 173–175, 1991.

    Google Scholar 

  16. O. C. Irwin, “Infant speech: Development of vowel sounds,” J. Speech Hearing Dis., vol. 13, pp. 31–34, 1948.

    Google Scholar 

  17. F. A. Ismaelli, A. Manfredi, and P. Bruscaglion, “Parametric and non-parametric estimation of speech formants: application to infant cry,” Med. Eng. Phy., vol. 18, no.8, pp. 677–691, 1996.

    Article  Google Scholar 

  18. C. C. Johnston, and D. O’Shaughnessy, “Acoustical attributes of infant pain cries: Discriminating features,” in Proc. Vth World Congress on Pain, R. Dubner, G. F. Gebhart, and M. R. Bond (Eds), Elsevier, Amsterdam, 1988.

    Google Scholar 

  19. L.G. Kersta, “Voiceprint Identification,” Nature, vol. 196, pp. 1253–1257, 1962.

    Article  Google Scholar 

  20. R. D. Kent and A. D. Murray, “Acoustic features of infant vocalic utterances at 3, 6, and 9 months,” J. Acoust. Soc. Am., vol. 72, pp. 353–365, 1982.

    Article  Google Scholar 

  21. P. Lieberman, K. S. Harris, and P. Wolff, “Newborn infant cry in relation to nonhuman ­primate vocalizations,” J. Acoust. Soc. Amer., vol. 44, pp. 365 (A), 1968.

    Google Scholar 

  22. P. Lieberman, K. S. Harris, P. Wolff and L. H. Russell, “Newborn infant cry in relation to nonhuman primate vocalizations,” J. Speech and Hearing Res., vol. 14, pp. 718–727, 1971.

    Google Scholar 

  23. J. Makhoul, “Linear prediction: A tutorial review,” Proc. IEEE, vol. 63, pp.561–580, 1975.

    Article  Google Scholar 

  24. K. Michelsson, “Cry analyses of symptomless low-birth-weight neonares and of asphyxiated newborn infants,” Acta Paediat. Scand. Suppl. vol. 216, pp. 1–45, 1971.

    Google Scholar 

  25. K. Michelsson and P. Sirvio, “Cry analysis in congenital hypothyroidism,” Folia Phoniafrica, vol. 28, pp. 40–7, 1976.

    Article  Google Scholar 

  26. K. Michelsson, P. Sirvio, and O. Wasz-Hockert, “Sound spectrographic cry analysis of infants with bacterial meningitis,” Develop. Med. arid Child Neuro., vol. 19 (b), pp. 309–15, 1977.

    Google Scholar 

  27. K. S. R. Murty, B. Yegnanarayana and M. A. Joseph, “Characterization of glottal activity from speech signals,” IEEE Signal Process. Lett., vol. 16, no. 6, pp. 469–472, June. 2009.

    Article  Google Scholar 

  28. P. E Ostwald and T. Murry, “The communicative and diagnostic significance of infant sounds,” chapter 7, Infant Crying: Theoretical and Research Perspectives. Plenum Press, New York, NY, pp.139–158, 1985.

    Chapter  Google Scholar 

  29. Hemant A. Patil, “Infant identification from their cry,” 7th Int. Conf. Advances in Pattern Recognition ICAPR, ISI Kolkata, IEEE Computer Society, pp. 107–109, Feb. 4–6, 2009.

    Google Scholar 

  30. D. S. Paterson, F. L. Trachtenberg, F. G. Thompson et al., “Multiple serotonergic brainstem abnormalities in sudden infant death syndrome,” J. Amer. Med. Ass., vol. 296, no. 17, pp.2124–2132, November 2006.

    Article  Google Scholar 

  31. R. Prescott, “Infant cry sound: Developmental features,” J. Acoust. Soc. Amer., vol. 57, pp. 1186–1191, 1975.

    Article  Google Scholar 

  32. Athanassios Protopapasa and Peter D. Eimas, “Perceptual differences in infant cries revealed by modifications of acoustic features,” J. Acoust. Soc. Am., vol.102, no.6, pp. 3723–3734, December 1997.

    Article  Google Scholar 

  33. L. Rabiner, A. Rosenberg, and S. Levinson, “Considerations in dynamic time warping algorithms for discrete word recognition,” IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-26, pp. 575–582, Dec. 1978.

    Article  Google Scholar 

  34. Rohilah Sahak, Wahidah Mansor, Lee Yoot Khuan, Azlee Zabidi, Farah Yasmin, “An investigation into infant cry and Apgar score using principle component analysis,” 5th Int. Colloquium on Signal Proces. and its Applications (CSPA), pp.209–214, 2009.

    Google Scholar 

  35. H. Sakoe and S. Chiba, “Dynamic programming optimization for spoken word ­recognition,” IEEE Trans. Acoustics, Speech, Signal Proc., ASSP-26, no.1, pp/43–49, February 1978.

    Article  Google Scholar 

  36. R. Stark, and S. Nathanson, “Unusual features of cry in an infant dying suddenly and unexpectedly,” J. Bosma and J. Showacre (Eds.), Development of upper respiratory anatomy and function: implications for the Sudden Infant Death Syndrome, pp.323–352. Washington, DC: US Printing Office, 1975.

    Google Scholar 

  37. L. Sheppard and J. Kaufman, “Sorting out the role of air pollutants in asthma initiation,” Epidemiology, vol. 11, no.2, pp.100–101, March 2000.

    Article  Google Scholar 

  38. Janet L. Tenold, David H. Crowell, Richard H. Jones, Thomas H. Daniel, D. Frank McPherson, Arthur N. PoPper, “Cepstral and stationarity analyses of full term and ­premature infants’ cries,” J. Aooust. Soc. Am., vol. 56, No. 3, pp. 975–980, September 1974.

    Article  Google Scholar 

  39. B. Thach, “The potential role of airway obstruction in sudden infant death syndrome,” J. Culbertson, H. Krous and R. Bendell (Eds.), Sudden Infant Death Syndrome. Baltimore: Johns Hopkins University Press, pp. 62–93, 1989.

    Google Scholar 

  40. J. Thoden, A.-L. Jarvenpaa and K. Michelsson, “Sound spectrographic cry analysis of pain cry in prematures,” chapter 5. Infant Crying: Theoretical and Research Perspectives. Plenum Press, New York, New York, 1985.

    Google Scholar 

  41. S. Tonkin, “Airway occlusion as a possible cause of SIDS,” F. Robinson (Ed.), Sudden Infant Death Syndrome (SIDS) Canada: Canadian Foundation for the Study of Sudden Infant Death, pp. 34–97, 1974.

    Google Scholar 

  42. H. M. Truby and J. Lind, “Cry sounds of a newborn infant,” Acta Pediatric Scand., Suppl., vol. 163, pp.8–59, 1965.

    Article  Google Scholar 

  43. Antonio Verduzco-Mendoza, Emilio Arch-Tirado, Carlos A. Reyes García, Jaime Leybón Ibarra, and Juan Licona Bonilla, “Qualitative and quantitative crying analysis of new born babies delivered under high risk gestation,” A. Esposito et al. (Eds.): Multimodal Signals, LNAI, Springer-Verlag Berlin Heidelberg, vol. 5398, pp. 320–327, 2009.

    Google Scholar 

  44. O. Wasz-Hockert, V. Vuorenkoski, E. Valanne, K. Michelsson, “Sound spectrographic studies­ of the cry of newborn infants,” Experientia, vol. 15, no.18, pp. 583–584, 1962.

    Article  Google Scholar 

  45. O. Wasz-Hockert, E. Valanne, V. Vuorenkoski, K. Michelsson, A. Sovijarvi, “Analysis of some types of vocalization in the newborn and in early infancy,” Ann Paediatr Fenn., vol.9, pp.1–10, 1963.

    Google Scholar 

  46. O. Wasz-Hockert, T. Partanen, V. Vuorenkoski, E. Valanne, and K. Michelsson, “The ­identification of some specific meanings in infant vocalization,” Experiencia, vol. 20, pp. 154–156, 1964.

    Article  Google Scholar 

  47. O. Wasz-Hockert, J. Lind, V. Vuorenkoski, T. Partanen, and E. Valanne, “The infant cry- A spectrographic and auditory analysis”, Spastics International Medical Publications in Association with William Heinemann Medical Books Ltd., 1968.

    Google Scholar 

  48. O Wasz-Hockert, K. Michelsson, and J. Lind, “Twenty-Five Years of Scandinavian Cry Research,” chapter 4. Infant Crying: Theoretical and Research Perspectives. Plenum Press, New York, New York, 1985.

    Google Scholar 

  49. Wikipedia: Sudden Infant Death Syndrome.

    Google Scholar 

  50. P. H. Wolff, “The natural history of crying and other vocalizations in early infancy,’’ Determinants of Infant Behavior IV, M. Foss (Eds), Methuen, London, U.K. 1969.

    Google Scholar 

  51. Q. Xie, R. K. Ward and C. A. Laszlo, Automatic assessment of infant’s levels-of-distress from the cry signals,” IEEE Trans. on Speech and Audio Processing, vol. 4, no. 4, pp. 253–265, July 1996.

    Article  Google Scholar 

  52. Q. Xie, R. K. Ward, and C. A. Laszlo, “Determining normal infants’ level-of-distress from cry sounds,” Proc. of the 1993 Canadian Conf. on Electrical and Computer Engineering, pp. 1094–1096, Vancouver, B. C., September 14–17, 1993.

    Google Scholar 

  53. Naoto Yamane and Yoko Shimura, “The acoustic characteristics of infant cries,” J. Acoust. Soc. Am., vol. 120, no. 5, pt. 2, pp. 3136, November 2006.

    Google Scholar 

  54. B. Yegnanarayana, S. R. M. Prasanna, J.M. Zachariah and C. S. Gupta, “Combining evidence from source, suprasegmental and spectral features for a fixed-text speaker verification system”, IEEE Trans. Speech Audio Processing, vol.13, no.4, pp.575–582, July 2005.

    Article  Google Scholar 

  55. P. Zeskind, and B. Lester, “Acoustic features and auditory perceptions of the cries of newborns with prenatal and perinatal complications,” Child Development, vol. 49, pp. 580–589, 1978.

    Article  Google Scholar 

  56. P. Zeskind and B. Lester, “Analysis of cry features in newborns with differential fetal growth,” Child Development, vol. 52, pp. 207–212, 1981.

    Article  Google Scholar 

  57. T. F. Quatieri, Discrete-Time Speech Signal Processing: Principles and Practices. Pearson Education, 2002.

    Google Scholar 

  58. Why Cry @ Baby Crying Analyzer: http://www.showeryourbaby.com/whycrbacran1.html.

Download references

Acknowledgment

The author would like to thank DA-IICT authorities for their kind support to carry out this research work. He would also like to thank Ms. Neeharika Buddha, Dr. B. V. Adinarayana (KGH, Visakhapatnam) and Prof. B. Yegnanarayana of IIIT Hyderabad for their kind help and cooperation during this work.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hemant A. Patil .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer Science+Business Media, LLC

About this chapter

Cite this chapter

Patil, H.A. (2010). “Cry Baby”: Using Spectrographic Analysis to Assess Neonatal Health Status from an Infant’s Cry. In: Neustein, A. (eds) Advances in Speech Recognition. Springer, Boston, MA. https://doi.org/10.1007/978-1-4419-5951-5_14

Download citation

  • DOI: https://doi.org/10.1007/978-1-4419-5951-5_14

  • Published:

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4419-5950-8

  • Online ISBN: 978-1-4419-5951-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics