Doc Assist: Intelligent Document Processing Assistance for Enhanced Accessibility

Authors

  • Chandana Kasibatta UG, CSE (AI&ML) Engineering, Sphoorthy Engineering College, JNTUH, Hyderabad, Telangana, India Author
  • Shreyas Suvarna UG, CSE (AI&ML) Engineering, Sphoorthy Engineering College, JNTUH, Hyderabad, Telangana, India Author
  • Nikhil Chakravarthula UG, CSE (AI&ML) Engineering, Sphoorthy Engineering College, JNTUH, Hyderabad, Telangana, India Author
  • Tagore satya narayana UG, CSE (AI&ML) Engineering, Sphoorthy Engineering College, JNTUH, Hyderabad, Telangana, India Author
  • Mohd Ayaz Uddin Assistant Professor, Department of Computer Science & Engineering (AI&ML), Sphoorthy Engineering College, JNTUH, Hyderabad, Telangana, India Author
  • Dr. M. Ramesh Professor & Head of the Department, Department of Computer Science & Engineering (AI&ML), Sphoorthy Engineering College, JNTUH, Hyderabad, Telangana, India Author

DOI:

https://doi.org/10.47392/IRJAEH.2025.0324

Keywords:

OCR, BM25, Semantic Search, Word Embedding’s, Document Retrieval

Abstract

This project presents a desktop assistant designed to retrieve information from non-machine-readable documents, such as scanned images and PDFs. Using Tesseract OCR, the system extracts text, and BM25 is employed for effective document ranking based on user-provided keywords. Additionally, word embeddings are integrated to improve semantic search accuracy. The application is built with Tkinter, offering an intuitive, offline experience. The system's architecture is optimized for quick document retrieval, ensuring minimal resource consumption while maintaining relevance. This documentation covers the design, implementation, and challenges encountered during development.

Downloads

Download data is not yet available.

Downloads

Published

2025-05-13

How to Cite

Doc Assist: Intelligent Document Processing Assistance for Enhanced Accessibility. (2025). International Research Journal on Advanced Engineering Hub (IRJAEH), 3(05), 2210-2214. https://doi.org/10.47392/IRJAEH.2025.0324

Similar Articles

21-30 of 72

You may also start an advanced similarity search for this article.

Most read articles by the same author(s)