Home
International Journal of Science and Research Archive
International, Peer reviewed, Open access Journal ISSN Approved Journal No. 2582-8185

Main navigation

  • Home
  • Past Issues

Efficacy of Artificial Intelligence for gender classification in speech signals

Breadcrumb

  • Home
  • Efficacy of Artificial Intelligence for gender classification in speech signals

Olumide Olayode Ajayi 1, *, Ayo Isaac Oyedeji 2, Olusogo Julius Adetunji 2 and Janet Olubunmi Jooda 3

1 Department of Electrical and Electronics Engineering, Faculty of Engineering, Adeleke University, Ede, Nigeria. 

2 Department of Computer Engineering, Faculty of Engineering, Olabisi Onabanjo University, Ago Iwoye, Nigeria.

3 Department of Computer Engineering, Faculty of Engineering, Redeemer’s University, Ede, Nigeria.

Research Article

International Journal of Science and Research Archive, 2025, 16(01), 1818-1829

Article DOI: 10.30574/ijsra.2025.16.1.2238

DOI url: https://doi.org/10.30574/ijsra.2025.16.1.2238

Received on 12 June 2025; revised on 24 July 2025; accepted on 26 July 2025

The classification or recognition of human voices are required for different applications such as speech emotion recognition, medicals, communications software, and security. However, gender classification of speech signals is complex aspect of speech recognition systems; thereby requiring robust signal processing strategies. This paper investigates the efficacy of Artificial Intelligence (AI) for gender classification in speech signals. Two different AI-based gender voice classifiers namely K-Nearest Neighbor (K-NN) and Long Short-Term Memory (LSTM) were developed. First, speech signals were recorded from different male and female speakers at a sampling rate of 48 kHz. Each of the raw speech signals was filtered and the useful portion of the signal was segmented. The Mel Frequency Cepstral Coefficient (MFCC) and Mel Spectrum (MS) features were extracted from each signal via framing, hamming window, and FFT. An equal number of observations each for the male and female classes were generated. The total 2000 observations were partitioned into 80% for training and 20% for testing. The training dataset was used to train both the K-NN and LSTM classifiers.  The results obtained from testing with the testing dataset showed that the K-NN classifier gave precision, recall, accuracy and F1-score values of 0.9852, 0.9850, 0.9925 and 0.9851, respectively, whereas the LSTM classifier gave 0.9132, 0.9091, 0.9525, and 0.9091, respectively. The classifiers achieve more than 0.95 (or 95%) classification accuracy; thereby demonstrating the efficacy of the AI strategies in distinguishing between a male voice and a female voice.

Artificial Intelligence (AI); Speech Signal; Gender Voice; Deep Learning (Dl); Machine Learning (Ml)

https://journalijsra.com/sites/default/files/fulltext_pdf/IJSRA-2025-2238.pdf

Preview Article PDF

Olumide Olayode Ajayi, Ayo Isaac Oyedeji, Olusogo Julius Adetunji and Janet Olubunmi Jooda. Efficacy of Artificial Intelligence for gender classification in speech signals. International Journal of Science and Research Archive, 2025, 16(01), 1818-1829. Article DOI: https://doi.org/10.30574/ijsra.2025.16.1.2238

Copyright © 2025 Author(s) retain the copyright of this article. This article is published under the terms of the Creative Commons Attribution Liscense 4.0

Footer menu

  • Contact

Copyright © 2026 International Journal of Science and Research Archive - All rights reserved

Developed & Designed by VS Infosolution