Bilingual Speech Recognition On the Edge Using Machine Learning

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Remarkable advancements in speech recognition systems have led to significant growth in assistive technology for individuals with disabilities. This paper signifies a collaborative effort to develop an embedded speech recognition system equipped with Machine Learning (ML) capabilities. The low-cost standalone speech recognition system is designed to assist individuals with disabilities by enabling them to operate their household appliances effortlessly using simple voice commands. One distinctive feature of the system is its ability to understand both English and Arabic, specifically the Emirati dialect. The development process involves the acquisition of datasets in both Arabic and English and the training of various ML models which will then be integrated into a microcontroller. Substantial testing is conducted to select the best ML model, ensuring high system accuracy. The model with 3 convolutional layers which is trained for 30 epochs is determined to be the optimal choice among the models, all of which demonstrated excellent accuracy, precision, and recall scores, exceeding 97%.

Original languageBritish English
Title of host publication2024 7th International Conference on Signal Processing and Information Security, ICSPIS 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350368673
DOIs
StatePublished - 2024
Event7th International Conference on Signal Processing and Information Security, ICSPIS 2024 - Dubai, United Arab Emirates
Duration: 12 Nov 202414 Nov 2024

Publication series

Name2024 7th International Conference on Signal Processing and Information Security, ICSPIS 2024

Conference

Conference7th International Conference on Signal Processing and Information Security, ICSPIS 2024
Country/TerritoryUnited Arab Emirates
CityDubai
Period12/11/2414/11/24

Keywords

  • CNN
  • Image Recognition
  • Machine Learning
  • Spectrograms
  • Speech Recognition

Fingerprint

Dive into the research topics of 'Bilingual Speech Recognition On the Edge Using Machine Learning'. Together they form a unique fingerprint.

Cite this