Ensuring BI Reporting Accuracy Using AI-Based Back-Tracing of Metrics to ETL Lineage and Data Marts

Main Article Content

Pramod Raja Konda

Abstract

Business Intelligence (BI) systems rely heavily on the accuracy and consistency of the data processed through complex ETL pipelines and stored in data marts. However, modern BI ecosystems face challenges such as inconsistent metric definitions, undocumented transformation logic, repeated data duplication, and broken lineage across ETL workflows. These issues lead to inaccurate dashboards, misleading KPIs, and poor decision-making. This research proposes an AI-based back-tracing framework that automatically maps BI metrics to their underlying ETL lineage, source tables, and data mart structures. The framework utilizes natural language processing (NLP), graph-based lineage reconstruction, metadata mining, and anomaly detection to validate the correctness of metrics and identify inconsistencies. A real-world case study from a retail analytics environment demonstrates the efficacy of the model, supported by a table and a graphical representation of field-to-metric lineage. Results show significant improvements in reporting accuracy, automated error detection, and transparency of metric definitions

Article Details

How to Cite
Ensuring BI Reporting Accuracy Using AI-Based Back-Tracing of Metrics to ETL Lineage and Data Marts. (2023). International Machine Learning Journal and Computer Engineering, 6(6). https://mljce.in/index.php/Imljce/article/view/69
Section
Articles

How to Cite

Ensuring BI Reporting Accuracy Using AI-Based Back-Tracing of Metrics to ETL Lineage and Data Marts. (2023). International Machine Learning Journal and Computer Engineering, 6(6). https://mljce.in/index.php/Imljce/article/view/69

References

Batini, C., & Scannapieco, M. (2016). Data and Information Quality: Dimensions, Principles and Techniques. Springer.

Bernstein, P. A., & Rahm, E. (2011). Data integration in the cloud: Challenges and opportunities. ACM Data Engineering Bulletin, 34(1), 3–13.

Doan, A., Halevy, A., & Ives, Z. (2012). Principles of Data Integration. Morgan Kaufmann.

IBM. (2014). Modernizing Legacy Systems for Cloud Integration. IBM Redbooks.

Jian, S., & Li, W. (2018). Machine learning approaches for schema alignment. IEEE Access, 6, 42045–42056.

Mullins, C. (2013). Database Administration: The Complete Guide to Practices and Procedures. Addison-Wesley.

Rahm, E., & Do, H. (2000). Data cleaning: Problems and current approaches. IEEE Data Engineering Bulletin, 23(4), 3–13.

Zhu, Q., & Chen, H. (2016). Intelligent ETL frameworks using semantic reasoning. Expert Systems with Applications, 55, 56–67