Gender Recognition Inclusive with Transgender from Speech Classification

  • Ghazaala YasminEmail author
  • Omkar Mullick
  • Arijit Ghosal
  • Asit K. Das
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 755)


Automatic gender classification system has prompted a pertinent of increasing amount of applications, particularly the rise of social platforms and criminal investigation. Focus of substantial past researches was limited towards the discrimination of male and female gender only. Recently transgender has achieved legal recognition. So, any gender classification system should consider this third gender also. But unfortunately there is a lack of good gender classification system which can discriminate all the three types of gender well. This proposed work uses judiciously chosen acoustic features for classification of three classes of genders from their solo voice. The proposed system has been pursued with the sampled audio data extracted from audio signal. From the sampled data, acoustic features like tempo, pitch and spectral flux have been extracted using the idea of pattern recognition. The extracted feature set has been served for classification to predict the gender of a given unknown voice.


Pitch Tempo Spectral flux Speech recognition Gender classification 



“This chapter does not contain any studies with human participants or animals performed by any of the authors.”


  1. 1.
    Ali, Md.S., Islam, Md.S., Hossain, Md.A.: Gender recognition system using speech signal. Int. J. Comput. Sci. Eng. Inf. Technol. (IJCSEIT) 2.1, 1–9 (2012)Google Scholar
  2. 2.
    Alías, F., Socoró, J. C., Sevillano, X.: A review of physical and perceptual feature extraction techniques for speech, music and environmental sounds. Appl. Sci. 6.5, 143 (2016)Google Scholar
  3. 3.
    Subramanian, H., Rao, P., Roy, S.D.: Audio signal classification. In: EE Dept, IIT Bombay, pp. 1–5 (2004)Google Scholar
  4. 4.
    Bach, J.H., Anemüller, J., Kollmeier, B.: Robust speech detection in real acoustic backgrounds with perceptually motivated features. Speech Commun. 53(5), 690–706 (2011)CrossRefGoogle Scholar
  5. 5.
    Richard, G., Sundaram, S., Narayanan, S.: An overview on perceptually motivated audio indexing and classification. Proc. IEEE 101(9), 1939–1954 (2013)CrossRefGoogle Scholar
  6. 6.
    Harb, H., Chen, L.: Gender identification using a general audio classifier. In: Proceedings of 2003 International Conference on Multimedia and Expo, 2003. ICME’03, vol. 2, pp. II–733. IEEE (2003)Google Scholar
  7. 7.
    Ghosal, A., Dutta S.: Automatic male-female voice discrimination. In: Issues and Challenges in Intelligent Computing Techniques (ICICT), pp. 731–735. IEEE (2014)Google Scholar
  8. 8.
    Pahwa, A., Aggarwal, G.: Speech feature extraction for gender recognition. Int. J. Image Graph. Signal Process. 8(9), 17–25 (2016)CrossRefGoogle Scholar
  9. 9.
    Kumar, N., et al.: Robust multichannel gender classification from speech in movie audio. Interspeech 2016, 2233–2237 (2016)CrossRefGoogle Scholar
  10. 10.
    Jabid, T., Kabir, Md.H., Chae, O.: Gender classification using local directional pattern (LDP). In: 2010 20th International Conference on Pattern Recognition (ICPR), pp. 2162–2165. IEEE (2010)Google Scholar
  11. 11.
    Lartillot, O., Toiviainen, P., Eerola, T.: A matlab toolbox for music information retrieval. In: Data Analysis, Machine Learning and Applications, pp. 261–268. Springer, Berlin, Heidelberg (2008)Google Scholar
  12. 12.
    Müller, M., Ewert, S.: Chroma Toolbox: MATLAB implementations for extracting variants of chroma-based audio features. In: Proceedings of the 12th International Conference on Music Information Retrieval (ISMIR) (2012)Google Scholar
  13. 13.
    Grosche, P., Müller, M.: Extracting predominant local pulse information from music recordings. IEEE Trans. Audio Speech Lang. Process. 19(6), 1688–1701 (2011)CrossRefGoogle Scholar
  14. 14.
    Srivastava, S.: Weka: a tool for data preprocessing, classification, ensemble, clustering and association rule mining. Int. J. Comput. Appl. 88, 10 (2014)Google Scholar
  15. 15.
    Malhi, A., Gao, R.X.: PCA-based feature selection scheme for machine defect classification. IEEE Trans. Instrum. Meas. 53(6), 1517–1525 (2004)CrossRefGoogle Scholar
  16. 16.
    Grosche, P., Müller, M.: Tempogram toolbox: matlab implementations for tempo and pulse analysis of music recordings. In: Proceedings of the 12th International Conference on Music Information Retrieval (ISMIR), Miami, FL, USA (2011)Google Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2019

Authors and Affiliations

  • Ghazaala Yasmin
    • 1
    Email author
  • Omkar Mullick
    • 2
  • Arijit Ghosal
    • 3
  • Asit K. Das
    • 4
  1. 1.Department of Computer Science & EngineeringSt. Thomas’ College of Engineering and TechnologyKolkataIndia
  2. 2.Department of Electronics and Communication EngineeringSt. Thomas’ College of Engineering and TechnologyKolkataIndia
  3. 3.Department of Information TechnologySt. Thomas’ College of Engineering and TechnologyKolkataIndia
  4. 4.Department of Computer Science and TechnologyIndian Institute of Engineering Science and TechnologyShibpur, HowrahIndia

Personalised recommendations