AI-Driven Data Quality Assurance in Multi-Cloud Data Warehousing Environments

Aneeshkumar Perukilakattunirappel Sundareswaran; Swamy Sai Krishna Kireeti Athamakuri; Khushmeet Singh; Rajeev Kumar Sharma

doi:10.47392/IRJAEH.2025.0462

Authors

Aneeshkumar Perukilakattunirappel Sundareswaran Cochin University of Science and Technology, Cochin, Kerala, India Author
Swamy Sai Krishna Kireeti Athamakuri Andhra University, Visakhapatnam, Andhra Pradesh, India Author
Khushmeet Singh Dr. A.P.J. Abdul Kalam Technical University, Naya Khera, Jankipuram, Lucknow, Uttar Pradesh, India Author
Rajeev Kumar Sharma Western Governors University, Millcreek, UT. Author

DOI:

https://doi.org/10.47392/IRJAEH.2025.0462

Keywords:

Multi-cloud data warehousing, Natural language processing (NLP), AI-assisted data quality framework

Abstract

Multi-cloud data warehousing has emerged as a critical enabler for organizations seeking enhanced agility, scalability, and resilience in today’s rapidly evolving data-driven and cloud-native environments. Being subjected to various cloud platforms makes inconsistencies, latency, duplication, and governance imbalances harder to maintain and oversee, which is considered a significant problem today. This study aims to keep data quality across the cloud by developing an AI-driven data quality strategy. This framework employs a machine learning model that identifies, categorizes, and corrects data quality issues in cloud-based systems. This article implements a supervised learning model that relies on datasets from industry-specific cloud repositories to monitor data anomaly and data integrity infringement. Also, metadata and data lineage can be analyzed using NLP, enabling better traceability. Having executed the framework on AWS Redshift and Google BigQuery, the systems display effectiveness in scale, precision, and operational performance. The evidence indicates a 30% increase in anomaly detection accuracy with a reduction of 45% in overall time spent during the process. Like the prior models, this improves quality data management more anticipatively by using evolving data patterns. In addition, the AI-powered DQA solution proposed in this work considerably enhances data trustworthiness in multi-cloud data warehousing environments.

Downloads

Download data is not yet available.

AI-Driven Data Quality Assurance in Multi-Cloud Data Warehousing Environments

Authors

DOI:

Keywords:

Abstract

Downloads

Published

Issue

Section

License

How to Cite

Language

Information

Make a Submission