AI-Based Real-Time Speech-to-Sign Language Translation System for Assistive Communication

Siddhu, Bhukya

doi:https://doi.org/10.55041/ijcope.v2i5.279

Volume 02, Issue 05

Published on: May 2026

AI-BASED REAL-TIME SPEECH-TO-SIGN LANGUAGE TRANSLATION SYSTEM FOR ASSISTIVE COMMUNICATION

Bhukya Siddhu

Y. Dayanand Kumar

Department of Computer Science & AI Central University of Andhra Pradesh

DOI:https://doi.org/10.55041/ijcope.v2i5.279

Article Status

Plagiarism Passed Peer Reviewed Open Access

Available Documents

Download PDF Review Report

Abstract

This paper presents the design and development of an Artificial Intelligence (AI)-based, real-time Speech-to- Indian Sign Language (ISL) Translation System that bridges the communication gap between the hearing population and the deaf and hard-of-hearing community. The proposed system captures spoken audio via a standard microphone, converts it to text using cloud-based Automatic Speech Recognition (ASR), applies Natural Language Processing (NLP) techniques including tokenization, stop-word removal, and sentence sim- plification, and maps the resulting tokens to pre-recorded ISL GIF animations for sequential visual display. Implemented as a software-only Flask web application, the system requires no specialized hardware such as sensor gloves or motion-capture devices, enabling deployment on any standard browser-equipped device. A modular pipeline architecture ensures that individual components can be independently upgraded without disrupting the overall system. Experimental evaluation demonstrates high gesture-mapping accuracy for common vocabulary in controlled conditions, with near-real-time response latency suitable for conversational use. Out-of-vocabulary words are handled via a finger-spelling fallback mechanism, ensuring uninterrupted communication. Results confirm that an accessible, cost-effective, and scalable speech-to-sign translation tool can be realized using widely available web technologies and AI-driven NLP, address- ing limitations of existing hardware-dependent approaches and advancing social inclusion for the deaf community.

Index Terms—Automatic Speech Recognition (ASR), Indian Sign Language (ISL), Natural Language Processing (NLP), As- sistive Communication Technology, GIF-Based Gesture Mapping, Real-Time Translation, Flask Web Application.

How to Cite this Paper

Siddhu, B. (2026). AI-Based Real-Time Speech-to-Sign Language Translation System for Assistive Communication. International Journal of Creative and Open Research in Engineering and Management, <i>02</i>(05). https://doi.org/10.55041/ijcope.v2i5.279

Siddhu, Bhukya. "AI-Based Real-Time Speech-to-Sign Language Translation System for Assistive Communication." International Journal of Creative and Open Research in Engineering and Management, vol. 02, no. 05, 2026, pp. . doi:https://doi.org/10.55041/ijcope.v2i5.279.

Siddhu, Bhukya. "AI-Based Real-Time Speech-to-Sign Language Translation System for Assistive Communication." International Journal of Creative and Open Research in Engineering and Management 02, no. 05 (2026). https://doi.org/https://doi.org/10.55041/ijcope.v2i5.279.

Search & Index

References

[1]     World Health Organization, “World report on hearing,” WHO Press, Geneva, Switzerland, 2021.

[2]     J. Devlin, M.-W. Chang, K. Lee, and K. Toutanova, “BERT: Pre-training of deep bidirectional transformers for language understanding,” in Proc. NAACL-HLT, Minneapolis, MN, 2019, pp. 4171–4186.

[3]     S. Ko, J. Kim, and H. Lee, “Speech-to-sign language translation using deep learning,” Int. J. Artif. Intell. Res., vol. 8, no. 2, pp. 112–120, 2019.

[4]     A. Graves, A. R. Mohamed, and G. E. Hinton, “Speech recognition with deep recurrent neural networks,” in Proc. IEEE ICASSP, Vancouver, BC, 2013, pp. 6645–6649.

[5]     G. E. Hinton, L. Deng, D. Yu, G. E. Dahl et al., “Deep neural networks for acoustic modeling in speech recognition,” IEEE Signal Process. Mag., vol. 29, no. 6, pp. 82–97, 2012.

[6]     D. Jurafsky and J. H. Martin, Speech and Language Processing, 3rd ed. Prentice Hall, 2023.

[7]     S. Bird, E. Loper, and E. Klein, Natural Language Processing with Python. O’Reilly Media, 2009.

[8]     N. C. Camgoz, S. Hadfield, O. Koller, and R. Bowden, “Neural sign language translation,” in Proc. IEEE CVPR, Salt Lake City, UT, 2018,7784–7793.

[9]     A. Zhang, “SpeechRecognition: A library for performing speech recog- nition,” GitHub, 2022. [Online]. Available: https://github.com/Uberi/ speech recognition

[10]    O. Koller, H. Ney, and R. Bowden, “Continuous sign language recogni- tion: Towards large vocabulary systems,” Comput. Vis. Image Underst., vol. 141, pp. 108–125, 2015.

Ethical Compliance & Review Process

•All submissions are screened under plagiarism detection.
•Review follows editorial policy.
•Authors retain copyright.
•Peer Review Type: Double-Blind Peer Review
•Published on: May 09 2026

CCBYNC

This article is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License. You are free to share and adapt this work for non-commercial purposes with proper attribution.

View License

Back to Volume 02, Issue 05 View All Issues Next Article

← Previous Article

AI-Based Micro Credit Assessment System

Next Article →

AI-Based Smart Animal Detection and Crop Protection System Using Sensors and Sound Alert Mechanism