Abstract
Automatic bird species classification and identification are issues that have aroused interest in recent years. The main goals involve more exhaustive environmental monitoring and natural resources managing. One of the more relevant characteristics of calling birds is the vocalisation because this allows to recognise species or identify new ones, to know its natural history and macro-systematic relations, among others. In this work, some spectral-based features and extreme learning machines (ELM) are used to perform bird species classification. The experiments were carried on using 25 species of the family Furnariidae that inhabit the Paranaense Littoral region of Argentina (South America) and were validated in a cross-validation scheme. The results show that ELM classifier obtains high classification rates, more than 90% in accuracy, and the proposed features overperform the baseline features.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
The internet bird collection. Ref. Rev. 26(8), 42–43 (2012)
Albornoz, E.M., Milone, D.H., Rufiner, H.L.: Spoken emotion recognition using hierarchical classifiers. Comput. Speech Lang. 25(3), 556–570 (2011)
Albornoz, E.M., Milone, D.H., Rufiner, H.L.: Feature extraction based on bio-inspired model for robust emotion recognition. Soft Comput. 21, 5145–5158 (2017)
Albornoz, E.M., Vignolo, L.D., Sarquis, J.A., Leon, E.: Automatic classification of furnariidae species from the paranaense littoral region using speech-related features and machine learning. Ecol. Inform. 38, 39–49 (2017)
Areta, J.I., Pearman, M.: Species limits and clinal variation in a widespread high andean furnariid: the buff-breasted earthcreeper (upucerthia validirostris). Condor 115(1), 131–142 (2013)
Ben-Israel, A., Greville, T.N.E.: Generalized Inverses: Theory and Applications, 2nd edn. Springer, New York (2001). https://doi.org/10.1007/b97366
Betts, M., Mitchell, D., Diamond, A., Bêty, J.: Uneven rates of landscape change as a source of bias in roadside wildlife surveys. J. Wildlife Manag. 71(7), 2266–2273 (2007)
Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)
Burkart, R., Bárbaro, N., Sánchez, R., Gómez, D.: Eco-Regiones de la Argentina. Administración de Parques Nacionales (APN). Secretaría de Recursos Naturales y Desarrollo Sostenible, Presidencia de la Nación Argentina (1999)
Byers, B.E.: Geographic variation of song form within and among chestnut-sided warbler populations. The Auk, pp. 288–299 (1996)
Chi, T., Ru, P., Shamma, S.A.: Multiresolution spectrotemporal analysis of complex sounds. J. Acoust. Soc. Am. 118(2), 887–906 (2005)
Contreras, J.R., Agnolin, F., Davies, Y.E., Godoy, I., Giacchino, A., Ríos., E.E.: Atlas ornitogeográfico de la provincia de Formosa. Vazquez Mazzini (2014)
Dufour, O., Artieres, T., Glotin, H., Giraudet, P.: Clusterized mel filter cepstral coefficients and support vector machines for bird song identification. In: Soundscape Semiotics - Localization and Categorization. InTech Open Book (2014)
Eyben, F., Weninger, F., Gross, F., Schuller, B.: Recent developments in openSMILE, the Munich open-source multimedia feature extractor. In: 21st ACM International Conference on Multimedia, pp. 835–838. Barcelona, Spain, October 2013
Fagerlund, S.: Bird species recognition using support vector machines. EURASIP J. Appl. Sig. Process. 2007(1), 64–64 (2007)
Ganchev, T.D., Jahn, O., Marques, M.I., de Figueiredo, J.M., Schuchmann, K.L.: Automated acoustic detection of vanellus chilensis lampronotus. Expert Syst. Appl. 42(15–16), 6098–6111 (2015)
Giannoulis, D., Benetos, E., Stowell, D., Rossignol, M., Lagrange, M., Plumbley, M.D.: Detection and classification of acoustic scenes and events: an IEEE AASP challenge. In: Proceedings of the Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)
Gütlein, M., Frank, E., Hall, M., Karwath, A.: Large-scale attribute selection using wrappers. In: IEEE Symposium on Computational Intelligence and Data Mining, CIDM 2009, pp. 332–339. IEEE (2009)
Harris, C.M.: Absorption of sound in air versus humidity and temperature. J. Acoust. Soc. Am. 40(1), 148–159 (1966)
Hesler, N., Mundry, R., Dabelsteen, T.: Does song repertoire size in common blackbirds play a role in an intra-sexual context? J. Ornithol. 152(3), 591–601 (2011)
Huang, G., Huang, G.B., Song, S., You, K.: Trends in extreme learning machines: a review. Neural Netw. 61, 32–48 (2015)
Huang, G.B., Zhu, Q.Y., Siew, C.K.: Extreme learning machine: theory and applications. Neurocomputing 70(1–3), 489–501 (2006). https://doi.org/10.1016/j.neucom.2005.12.126
ICML International Conference on Proceedings of 1st Workshop on Machine Learning for Bioacoustics - ICML4B (2013). http://sabiod.univ-tln.fr
Joly, A., et al.: LifeCLEF 2014: multimedia life species identification challenges. In: Kanoulas, E., et al. (eds.) CLEF 2014. LNCS, vol. 8685, pp. 229–249. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-11382-1_20
Keen, S., Ross, J.C., Griffiths, E.T., Lanzone, M., Farnsworth, A.: A comparison of similarity-based approaches in the classification of flight calls of four species of north american wood-warblers (parulidae). Ecol. Inform. 21, 25–33 (2014)
Laje, R., Mindlin, G.B.: Highly structured duets in the song of the south American Hornero. Phys. Rev. Lett. 91(25), 258104 (2003)
Leon, E.J., et al.: Song structure of the golden-billed saltator (saltator aurantiirostris) in the middle parana river floodplain. Bioacoustics 24(2), 145–152 (2015)
Louette, M., Bijnens, L., Upoki Agenong’a, D., Fotso, R.: The utility of birds as bioindicators: case studies in equatorial africa. Belgian J. Zool. 125(1), 157–165 (1995)
Marler, P.: Three models of song learning: evidence from behavior. J. Neurobiol. 33(5), 501–516 (1997)
Narosky, T., Yzurieta, D.: Aves de Argentina y Uruguay-Birds of Argentina & Uruguay: Guía de Identificación Edición Total-A Field Guide Total Edition. Buenos Aires, 16 edn (2010)
Päckert, M., Martens, J., Kosuch, J., Nazarenko, A.A., Veith, M.: Phylogenetic signal in the song of crests and kinglets (Aves: Regulus). Evolution 57(3), 616–629 (2003)
Payne, R.B.: Song traditions in indigo buntings: origin, improvisation, dispersal, and extinction in cultural evolution. Ecol. Evol. Acoust. Commun. Birds 198–220 (1996)
Planqué, B., Vellinga, W.P.: Xeno-cano.org. http://www.xeno-canto.org. Accessed 10 July 2015
Potamitis, I.: Unsupervised dictionary extraction of bird vocalisations and new tools on assessing and visualising bird activity. Ecol. Inform. 26, Part 3, 6–17 (2015)
Potamitis, I., Ntalampiras, S., Jahn, O., Riede, K.: Automatic bird sound detection in long real-field recordings: applications and tools. Appl. Acoust. 80, 1–9 (2014)
Ptacek, L., Machlica, L., Linhart, P., Jaska, P., Muller, L.: Automatic recognition of bird individuals on an open set using as-is recordings. Bioacoustics 25(1), 1–19 (2015)
Raposo, M.A., Höfling, E.: Overestimation of vocal characters in suboscine taxonomy (Aves: Passeriformes: Tyranni): causes and implications. Lundiana 4(1), 35–42 (2003)
Roch, M.A., Soldevilla, M.S., Burtenshaw, J.C., Henderson, E.E., Hildebrand, J.A.: Gaussian mixture model classification of odontocetes in the Southern California bight and the Gulf of California. J. Acoust. Soc. Am. 121(3), 1737–1748 (2007)
Schuller, B., et al.: The INTERSPEECH 2013 Computational Paralinguistics Challenge: Social Signals, Conflict, Emotion, Autism. Proceedings of Interspeech, ISCA, pp. 148–152 (2013)
Stowell, D., Plumbley, M.D.: Segregating event streams and noise with a Markov renewal process model. J. Mach. Learn. Res. 14, 1891–1916 (2013)
Towsey, M., Wimmer, J., Williamson, I., Roe, P.: The use of acoustic indices to determine avian species richness in audio-recordings of the environment. Ecol. Inform. 21, 110–119 (2014)
Ventura, T.M., et al.: Audio parameterization with robust frame selection for improved bird identification. Expert Syst. Appl. 42(22), 8463–8471 (2015)
Weninger, F., Eyben, F., Schuller, B.W., Mortillaro, M., Scherer, K.R.: On the acoustics of emotion in audio: what speech, music, and sound have in common. Front. Emot. Sci. 4(292), 1–12 (2013)
Woolley, S.M., Fremouw, T.E., Hsu, A., Theunissen, F.E.: Tuning for spectro-temporal modulations as a mechanism for auditory discrimination of natural sounds. Nature Neurosc. 8(10), 1371–1379 (2005)
Yang, X., Wang, K., Shamma, S.A.: Auditory representations of acoustic signals. IEEE Trans. Inf. Theory 38(2), 824–839 (1992)
Zimmer, K.J., Whittaker, A.: The rufous cacholote (Furnariidae: Pseudoseisura) is two species. Condor 102(2), 409–422 (2000)
Zollinger, S.A., Brumm, H.: Why birds sing loud songs and why they sometimes don’t. Anim. Behav. 105, 289–295 (2015)
Acknowledgements
The authors wish to thank to Agencia Nacional de Promoción Científica y Tecnológica (ANPCyT)(with PICT-2015-977), Universidad Nacional del Litoral (with CAID-PJ-50020150100055LI and CAID-PJ-50020150100059LI) and Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), from Argentina, for their support.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Albornoz, E.M., Vignolo, L.D., Sarquis, J.A., Martínez, C.E. (2018). Furnariidae Species Classification Using Extreme Learning Machines and Spectral Information. In: Simari, G., Fermé, E., Gutiérrez Segura, F., Rodríguez Melquiades, J. (eds) Advances in Artificial Intelligence - IBERAMIA 2018. IBERAMIA 2018. Lecture Notes in Computer Science(), vol 11238. Springer, Cham. https://doi.org/10.1007/978-3-030-03928-8_14
Download citation
DOI: https://doi.org/10.1007/978-3-030-03928-8_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-03927-1
Online ISBN: 978-3-030-03928-8
eBook Packages: Computer ScienceComputer Science (R0)