Scalable Retrieval-Augmented Generation for Context-Aware Educational Assistants: A Case Study on the NoteLeech AI Platform

Yadav, Roopal; Saxena, Kshitij; Jain, Rishabh; Dhakar, Suraj Singh; Raghuvanshi, Adarsh

doi:https://doi.org/10.55041/ijcope.v2i5.676

Volume 02, Issue 05

Published on: May 2026

SCALABLE RETRIEVAL-AUGMENTED GENERATION FOR CONTEXT-AWARE EDUCATIONAL ASSISTANTS: A CASE STUDY ON THE NOTELEECH AI PLATFORM

Roopal Yadav Kshitij Saxena Rishabh Jain Suraj Singh Dhakar Adarsh Raghuvanshi

Department of Information Technology, Indore Institute of Science and Technology, Indore, India

DOI:https://doi.org/10.55041/ijcope.v2i5.676

Article Status

Plagiarism Passed Peer Reviewed Open Access

Available Documents

Download PDF Review Report

Abstract

The rapid proliferation of digital educational re-sources has created a critical information retrieval challenge for students and researchers. Standard keyword-based search mechanisms are increasingly inadequate for navigating dense, highly technical academic corpora, while raw Large Language Models (LLMs) suffer from severe knowledge cutoffs and hal-lucinatory tendencies when queried on specific, localized study materials. This paper introduces the architecture and empirical evaluation of the NoteLeech AI platform, a highly optimized, cross-platform Retrieval-Augmented Generation (RAG) system engineered specifically for academic context extraction and synthesis. By integrating a multi-stage ingestion pipeline with hierarchical semantic chunking and a quantized dense vector database, NoteLeech AI dynamically bridges the gap between unstructured academic notes and generative AI. We propose a hybrid retrieval mechanism that combines Hierarchical Naviga-ble Small World (HNSW) dense vector search with BM25 sparse keyword matching, deployed alongside a lightweight Llama-3-8B generative endpoint. The system architecture is uniquely tailored to process highly technical datasets, including complex engineering mathematics and computer science syllabi. Extensive experimental evaluations utilizing a specialized corpus of GATE CSE preparation materials demonstrate that the NoteLeech RAG pipeline achieves a Recall@5 of 92.4% while restricting end-to-end question-answering latency to under 450ms. Furthermore, we present comprehensive ablation studies on semantic chunk sizes and embedding dimensionality, proving that hybrid retrieval drastically reduces hallucination rates by over 87% compared to zero-shot LLM baselines. The findings establish that optimally tuned RAG architectures provide a robust, highly accurate, and scalable solution for personalized educational technologies, fundamentally altering how students interact with unstructured knowledge bases.

Index Terms—Retrieval-Augmented Generation, NoteLeech AI, Large Language Models, Vector Databases, Educational Technology, Natural Language Processing, Semantic Search.

How to Cite this Paper

Yadav, R., Saxena, K., Jain, R., Dhakar, S. S. & Raghuvanshi, A. (2026). Scalable Retrieval-Augmented Generation for Context-Aware Educational Assistants: A Case Study on the NoteLeech AI Platform. International Journal of Creative and Open Research in Engineering and Management, <i>02</i>(05). https://doi.org/10.55041/ijcope.v2i5.676

Yadav, Roopal, et al.. "Scalable Retrieval-Augmented Generation for Context-Aware Educational Assistants: A Case Study on the NoteLeech AI Platform." International Journal of Creative and Open Research in Engineering and Management, vol. 02, no. 05, 2026, pp. . doi:https://doi.org/10.55041/ijcope.v2i5.676.

Yadav, Roopal,Kshitij Saxena,Rishabh Jain,Suraj Dhakar, and Adarsh Raghuvanshi. "Scalable Retrieval-Augmented Generation for Context-Aware Educational Assistants: A Case Study on the NoteLeech AI Platform." International Journal of Creative and Open Research in Engineering and Management 02, no. 05 (2026). https://doi.org/https://doi.org/10.55041/ijcope.v2i5.676.

Search & Index

References

Lewis, E. Perez, A. Piktus, F. Petroni, V. Karpukhin, N. Goyal, H. Ku¨ttler, M. Lewis, W. Yih, T. Rockta¨schel, and S. Riedel, “Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks,” in Ad-vances in Neural Information Processing Systems, vol. 33, pp. 9459–9474, 2020.

A. Malkov and D. A. Yashunin, “Efficient and robust approxi-mate nearest neighbor search using hierarchical navigable small world graphs,” IEEE Transactions on Pattern Analysis and Machine Intelli-gence, vol. 42, no. 4, pp. 824–836, April 2020.

Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez,

Kaiser, and I. Polosukhin, “Attention is all you need,” in Advances in Neural Information Processing Systems, vol. 30, 2017.

Devlin, M. Chang, K. Lee, and K. Toutanova, “BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding,” in Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019, pp. 4171–4186.

Wang, F. Wei, L. Dong, H. Bao, N. Yang, and M. Zhou, “MiniLM: Deep Self-Attention Distillation for Task-Agnostic Compression of Pre-Trained Transformers,” in Advances in Neural Information Processing Systems, vol. 33, pp. 5776–5788, 2020.

Robertson, S. Zaragoza, and M. Taylor, “Simple BM25 extension to multiple weighted fields,” in Proceedings of the Thirteenth ACM International Conference on Information and Knowledge Management, 2004, pp. 42–49.

Ethical Compliance & Review Process

•All submissions are screened under plagiarism detection.
•Review follows editorial policy.
•Authors retain copyright.
•Peer Review Type: Double-Blind Peer Review
•Published on: May 22 2026

CCBYNC

This article is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License. You are free to share and adapt this work for non-commercial purposes with proper attribution.

View License

Back to Volume 02, Issue 05 View All Issues Next Article

← Previous Article

Sales Forecasting Using Machine Learning

Next Article →

Scheme Villa: A Zone-Based B2B Supply Chain Management Platform