Abstract
Infant cry analysis is a multidisciplinary area of research incorporating pediatrics, neurology, physiology, engineering, developmental linguistics, and psychology. It has been proposed in the pediatric literature that the infant cry is a reflection of complex neurophysiologic functions and that analysis of the cry itself can be used to assess the status of the infant’s health. Given the diagnostic importance of infant cry, this chapter presents application of spectrographic analysis to the vocal sounds of an infant, comparing normal with abnormal infant cry. Drawing from a rich body of research on spectrographic analysis predominantly used for performance of speaker recognition, this chapter presents how such spectral features that are used to identify and verify speakers can be applied to assess the neonate’s health status, by comparing a normal to an abnormal cry. Ten distinct cry modes, viz., hyperphonation, dysphonation, inhalation, double harmonic break, trailing, vibration, weak vibration, flat, rising, and falling have been identified for normal infant cry and their spectrographic patterns were observed. This analysis was then extended to abnormal infant cry. It has been observed that the double harmonic break is more dominant for abnormal infant cry in cases of myalgia (muscular pain). The inhalation pattern is distinct for infants suffering from asthma or other respiratory ailments such as a cough or cold. For example, for the infant whose larynx is not well developed, the pitch harmonics are nearly absent. As such, there are no voicing or glottal vibrations in the cry signal. In addition, for infants with Hypoxic Ischemic Encephalopathy (HIE), there is an initial tendency of pitch harmonics to rise and then to be followed by a blurring of such harmonics. Finally, an infant cry classification system is analyzed by observing the nature of the optimal warping path in the Dynamic Time Warping (DTW) algorithm.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
SIDS-is a syndrome marked by the sudden death of an infant that is unexpected by history and remains unexplained after a thorough forensic autopsy, a detailed crime scene investigation, and an exploration of the medical history of the infant and family. SIDS was responsible for 0.543 deaths per 1,000 live births in the U.S. in 2005 [49,36,39,41]. According to a recent study, babies who die of SIDS have abnormalities in brain stem (the medulla oblongata) which helps in control functions like breathing (which in turn may affect infant cry), blood pressure, arousal and abnormalities in serotonin signaling. According to the National Institute of Health (NIH), which funded the study, this finding is the strongest evidence to date that the structural difference in a specific part of the brain may contribute to the risk of SIDS [30,49]. Colton and Steinschneider analyzed cry of SIDS victim and correlated the cause of SIDS with relatively lower Fo, longer duration, lower formant frequencies and greater sound pressure level throughout the spectrum [6]. Corwin et. al. observed that infants whose first cries exhibited a high first formant were more likely to die of SIDS than infants whose first cries did not have this characteristics [5]. In a groundbreaking study of 74 larynges removed at death from children who died of SIDS reported by Harrison, it was observed that SIDS can be attributed to reduction in the subglottic area (particularly around the age of 3 months), which is potentially lethal. The reduction in subglottic airway is often secondary to an increase in mucus-secreting glands caused by an upper respiratory tract infection, all of which affects infant cry [15]. These pioneering studies indicate that the infants with SIDS have significant abnormalities in their cry signal and hence changes in spectral characteristics. Moreover, imagine a situation where an infant cry analyzer is developed to increase doctors’ confidence in making decisions about slow developing abnormalities in infants from their cry beforehand. In such cases, suitable steps in the form of drugs or therapy could be given to save the life of infant. In addition, the work on cry analysis of infants whose larynx is not fully developed (e.g. laryngomalacia) is scant. This present work may constitute a crucial step in filling this gap.
References
V. Apgar, “A proposal for a new method of evaluation of the newborn infant,” Curr. Res. Anesth. Analg., vol. 32, pp. 260–267, 1953.
J. F. Bosma, H. M. Truby, and J. Lind, “Cry motions of the newborn infant,” Acta Paediat. Scand. Suppl., vol. 163, pp. 61–92, 1965.
Neeharika Buddha and Hemant A. Patil, “Corpora for analysis of infant cry,” Int. Conf. on Speech Databases and Assessments, Oriental COCOSDA, Hanoi, Vietnam, pp. 43–48, Dec. 4–6, 2007.
M. Corwin, and H. Golub, “Medical applications of infant cry analysis,” A. Milunsky, E. Friedman, and L. Gluck, Advances in prenatal medicine, New York: Plenum Press, vol. 4, pp. 163–188, 1985.
M. J. Corwin, B. M. Lester, C. Sepkoski, Peucker, M. Kayne, H., H. L. Golub, “Newborn acoustic cry characteristics of infants subsequently dying of sudden infant death syndrome,” Pediatrics, vol. 96, no. 1, pp. 73–77, July 1995.
R. Colton and A. Steinschneider, “The cry characteristics of an infant who died of sudden infant death syndrome,” J. Speech, Hearing Dis., vol. 46, pp. 359–363, 1981.
K. John Cullen Jr., Nancy Fargo, Richard A. Chase, Peggy Baker, “The development of auditory feedback monitoring: I Delayed auditory feedback studies on infant cry,” J. Speech, Hearing Research, vol. 11, pp.85–93 1968.
G. Fant. Acoustic Theory of Speech Production. The Hague: Mouton, 1960.
V. Fisichelli, M. Coxe, L. Rosenfeld, A. Haber, J. Davis, and A. Karelitz, “The phonetic content of the cries of normal infants and those with brain damage,” Journal of Psychology, vol. 64, pp. 119–126, 1966.
B. F. Fuller and Y. Horii, “Spectral energy distribution in four types of infant vocalizations,” J. Commun. Disord., vol. 21, pp. 251–261, 1988.
L. Gray, “Signal detection and analysis of delays in neonates vocalization,” J. Acoust. Soc. Am., vol. 82, pp. 1608–1611, 1987.
Susan M. Grau, Michael P. Robb, Anthony T. Cacace, “Acoustic correlates of inspiratory phonation during infant cry: Research note,” Journal of Speech and Hearing Research, vol. 38, pp. 373–381, April 1995.
H. L. Golub and M. J. Corwin, “A physioacoustic model of infant cry,” in Infant Crying: Theoretical And Research Perspective, B. M. Lester and C. F. Z. Boukydis, Eds. New York, Plenum, pp.59–81, 1985.
G. E. Gustafson and J. A. Green, “On the importance of fundamental frequency and other acoustic features in cry perception and infant development,” Child Developmerit, vol. 60, pp. 772–80, 1989.
D. Harrison, “Histologic evaluation of the larynx in sudden infant death syndrome,” Annals of Otology, Rhinology, and Laryngology, vol. 100, pp. 173–175, 1991.
O. C. Irwin, “Infant speech: Development of vowel sounds,” J. Speech Hearing Dis., vol. 13, pp. 31–34, 1948.
F. A. Ismaelli, A. Manfredi, and P. Bruscaglion, “Parametric and non-parametric estimation of speech formants: application to infant cry,” Med. Eng. Phy., vol. 18, no.8, pp. 677–691, 1996.
C. C. Johnston, and D. O’Shaughnessy, “Acoustical attributes of infant pain cries: Discriminating features,” in Proc. Vth World Congress on Pain, R. Dubner, G. F. Gebhart, and M. R. Bond (Eds), Elsevier, Amsterdam, 1988.
L.G. Kersta, “Voiceprint Identification,” Nature, vol. 196, pp. 1253–1257, 1962.
R. D. Kent and A. D. Murray, “Acoustic features of infant vocalic utterances at 3, 6, and 9 months,” J. Acoust. Soc. Am., vol. 72, pp. 353–365, 1982.
P. Lieberman, K. S. Harris, and P. Wolff, “Newborn infant cry in relation to nonhuman primate vocalizations,” J. Acoust. Soc. Amer., vol. 44, pp. 365 (A), 1968.
P. Lieberman, K. S. Harris, P. Wolff and L. H. Russell, “Newborn infant cry in relation to nonhuman primate vocalizations,” J. Speech and Hearing Res., vol. 14, pp. 718–727, 1971.
J. Makhoul, “Linear prediction: A tutorial review,” Proc. IEEE, vol. 63, pp.561–580, 1975.
K. Michelsson, “Cry analyses of symptomless low-birth-weight neonares and of asphyxiated newborn infants,” Acta Paediat. Scand. Suppl. vol. 216, pp. 1–45, 1971.
K. Michelsson and P. Sirvio, “Cry analysis in congenital hypothyroidism,” Folia Phoniafrica, vol. 28, pp. 40–7, 1976.
K. Michelsson, P. Sirvio, and O. Wasz-Hockert, “Sound spectrographic cry analysis of infants with bacterial meningitis,” Develop. Med. arid Child Neuro., vol. 19 (b), pp. 309–15, 1977.
K. S. R. Murty, B. Yegnanarayana and M. A. Joseph, “Characterization of glottal activity from speech signals,” IEEE Signal Process. Lett., vol. 16, no. 6, pp. 469–472, June. 2009.
P. E Ostwald and T. Murry, “The communicative and diagnostic significance of infant sounds,” chapter 7, Infant Crying: Theoretical and Research Perspectives. Plenum Press, New York, NY, pp.139–158, 1985.
Hemant A. Patil, “Infant identification from their cry,” 7th Int. Conf. Advances in Pattern Recognition ICAPR, ISI Kolkata, IEEE Computer Society, pp. 107–109, Feb. 4–6, 2009.
D. S. Paterson, F. L. Trachtenberg, F. G. Thompson et al., “Multiple serotonergic brainstem abnormalities in sudden infant death syndrome,” J. Amer. Med. Ass., vol. 296, no. 17, pp.2124–2132, November 2006.
R. Prescott, “Infant cry sound: Developmental features,” J. Acoust. Soc. Amer., vol. 57, pp. 1186–1191, 1975.
Athanassios Protopapasa and Peter D. Eimas, “Perceptual differences in infant cries revealed by modifications of acoustic features,” J. Acoust. Soc. Am., vol.102, no.6, pp. 3723–3734, December 1997.
L. Rabiner, A. Rosenberg, and S. Levinson, “Considerations in dynamic time warping algorithms for discrete word recognition,” IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-26, pp. 575–582, Dec. 1978.
Rohilah Sahak, Wahidah Mansor, Lee Yoot Khuan, Azlee Zabidi, Farah Yasmin, “An investigation into infant cry and Apgar score using principle component analysis,” 5th Int. Colloquium on Signal Proces. and its Applications (CSPA), pp.209–214, 2009.
H. Sakoe and S. Chiba, “Dynamic programming optimization for spoken word recognition,” IEEE Trans. Acoustics, Speech, Signal Proc., ASSP-26, no.1, pp/43–49, February 1978.
R. Stark, and S. Nathanson, “Unusual features of cry in an infant dying suddenly and unexpectedly,” J. Bosma and J. Showacre (Eds.), Development of upper respiratory anatomy and function: implications for the Sudden Infant Death Syndrome, pp.323–352. Washington, DC: US Printing Office, 1975.
L. Sheppard and J. Kaufman, “Sorting out the role of air pollutants in asthma initiation,” Epidemiology, vol. 11, no.2, pp.100–101, March 2000.
Janet L. Tenold, David H. Crowell, Richard H. Jones, Thomas H. Daniel, D. Frank McPherson, Arthur N. PoPper, “Cepstral and stationarity analyses of full term and premature infants’ cries,” J. Aooust. Soc. Am., vol. 56, No. 3, pp. 975–980, September 1974.
B. Thach, “The potential role of airway obstruction in sudden infant death syndrome,” J. Culbertson, H. Krous and R. Bendell (Eds.), Sudden Infant Death Syndrome. Baltimore: Johns Hopkins University Press, pp. 62–93, 1989.
J. Thoden, A.-L. Jarvenpaa and K. Michelsson, “Sound spectrographic cry analysis of pain cry in prematures,” chapter 5. Infant Crying: Theoretical and Research Perspectives. Plenum Press, New York, New York, 1985.
S. Tonkin, “Airway occlusion as a possible cause of SIDS,” F. Robinson (Ed.), Sudden Infant Death Syndrome (SIDS) Canada: Canadian Foundation for the Study of Sudden Infant Death, pp. 34–97, 1974.
H. M. Truby and J. Lind, “Cry sounds of a newborn infant,” Acta Pediatric Scand., Suppl., vol. 163, pp.8–59, 1965.
Antonio Verduzco-Mendoza, Emilio Arch-Tirado, Carlos A. Reyes García, Jaime Leybón Ibarra, and Juan Licona Bonilla, “Qualitative and quantitative crying analysis of new born babies delivered under high risk gestation,” A. Esposito et al. (Eds.): Multimodal Signals, LNAI, Springer-Verlag Berlin Heidelberg, vol. 5398, pp. 320–327, 2009.
O. Wasz-Hockert, V. Vuorenkoski, E. Valanne, K. Michelsson, “Sound spectrographic studies of the cry of newborn infants,” Experientia, vol. 15, no.18, pp. 583–584, 1962.
O. Wasz-Hockert, E. Valanne, V. Vuorenkoski, K. Michelsson, A. Sovijarvi, “Analysis of some types of vocalization in the newborn and in early infancy,” Ann Paediatr Fenn., vol.9, pp.1–10, 1963.
O. Wasz-Hockert, T. Partanen, V. Vuorenkoski, E. Valanne, and K. Michelsson, “The identification of some specific meanings in infant vocalization,” Experiencia, vol. 20, pp. 154–156, 1964.
O. Wasz-Hockert, J. Lind, V. Vuorenkoski, T. Partanen, and E. Valanne, “The infant cry- A spectrographic and auditory analysis”, Spastics International Medical Publications in Association with William Heinemann Medical Books Ltd., 1968.
O Wasz-Hockert, K. Michelsson, and J. Lind, “Twenty-Five Years of Scandinavian Cry Research,” chapter 4. Infant Crying: Theoretical and Research Perspectives. Plenum Press, New York, New York, 1985.
Wikipedia: Sudden Infant Death Syndrome.
P. H. Wolff, “The natural history of crying and other vocalizations in early infancy,’’ Determinants of Infant Behavior IV, M. Foss (Eds), Methuen, London, U.K. 1969.
Q. Xie, R. K. Ward and C. A. Laszlo, “Automatic assessment of infant’s levels-of-distress from the cry signals,” IEEE Trans. on Speech and Audio Processing, vol. 4, no. 4, pp. 253–265, July 1996.
Q. Xie, R. K. Ward, and C. A. Laszlo, “Determining normal infants’ level-of-distress from cry sounds,” Proc. of the 1993 Canadian Conf. on Electrical and Computer Engineering, pp. 1094–1096, Vancouver, B. C., September 14–17, 1993.
Naoto Yamane and Yoko Shimura, “The acoustic characteristics of infant cries,” J. Acoust. Soc. Am., vol. 120, no. 5, pt. 2, pp. 3136, November 2006.
B. Yegnanarayana, S. R. M. Prasanna, J.M. Zachariah and C. S. Gupta, “Combining evidence from source, suprasegmental and spectral features for a fixed-text speaker verification system”, IEEE Trans. Speech Audio Processing, vol.13, no.4, pp.575–582, July 2005.
P. Zeskind, and B. Lester, “Acoustic features and auditory perceptions of the cries of newborns with prenatal and perinatal complications,” Child Development, vol. 49, pp. 580–589, 1978.
P. Zeskind and B. Lester, “Analysis of cry features in newborns with differential fetal growth,” Child Development, vol. 52, pp. 207–212, 1981.
T. F. Quatieri, Discrete-Time Speech Signal Processing: Principles and Practices. Pearson Education, 2002.
Why Cry @ Baby Crying Analyzer: http://www.showeryourbaby.com/whycrbacran1.html.
Acknowledgment
The author would like to thank DA-IICT authorities for their kind support to carry out this research work. He would also like to thank Ms. Neeharika Buddha, Dr. B. V. Adinarayana (KGH, Visakhapatnam) and Prof. B. Yegnanarayana of IIIT Hyderabad for their kind help and cooperation during this work.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer Science+Business Media, LLC
About this chapter
Cite this chapter
Patil, H.A. (2010). “Cry Baby”: Using Spectrographic Analysis to Assess Neonatal Health Status from an Infant’s Cry. In: Neustein, A. (eds) Advances in Speech Recognition. Springer, Boston, MA. https://doi.org/10.1007/978-1-4419-5951-5_14
Download citation
DOI: https://doi.org/10.1007/978-1-4419-5951-5_14
Published:
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4419-5950-8
Online ISBN: 978-1-4419-5951-5
eBook Packages: EngineeringEngineering (R0)