International Journal of Leading Research Publication

E-ISSN: 2582-8010     Impact Factor: 9.56

A Widely Indexed Open Access Peer Reviewed Multidisciplinary Bi-monthly Scholarly International Journal

Call for Paper Volume 6 Issue 6 June 2025 Submit your research before last 3 days of to publish your research paper in the issue of June.

AI-driven ETL Optimization for Security and Performance Tuning in Big Data Architectures

Author(s) Shiva Kumar Vuppala
Country United States
Abstract In today’s era of big data, Extract, Transform, and Load (ETL) pipelines form the backbone of enterprise data processing. Traditional ETL systems, while foundational, often suffer from major limitations such as poor scalability, vulnerability to security breaches, slow transformation speeds, and heavy reliance on manual tuning and monitoring. These inefficiencies lead to increased human errors, delayed analytics, and difficulties in maintaining compliance with stringent security standards like PCI 4.0 and GDPR. Furthermore, conventional anomaly detection techniques used within ETL pipelines often fail to accurately identify complex, time-dependent irregularities in large and heterogeneous datasets. To overcome these challenges, this work proposes an AI-driven ETL optimization framework that enhances performance, security, and compliance capabilities. The methodology integrates Attention-LSTM networks for real-time anomaly detection, enabling the system to dynamically focus on critical sequence patterns for higher detection accuracy. Secure data extraction is enforced using TLS 1.3 encryption protocols, ensuring data confidentiality during transfer, while intelligent data transformation is achieved through Random Forest algorithms to automate and optimize transformation operations efficiently. Role-Based Access Control (RBAC) mechanisms are used to strengthen secure data loading, and the final deployment is seamlessly integrated with AWS Glue for scalable orchestration. Extensive experimental results demonstrate that the proposed AI-enhanced ETL pipeline significantly outperforms traditional methods in processing speed, anomaly detection accuracy, data throughput, and security compliance, while simultaneously reducing the risk of human error. This research highlights the transformative potential of AI technologies in modernizing ETL architectures, paving the way for more resilient and intelligent big data systems in enterprise environments.
Keywords AI-driven ETL, Anomaly Detection, Attention, Secure Data Extraction, Data Transformation, Compliance Automation
Field Computer > Data / Information
Published In Volume 6, Issue 5, May 2025
Published On 2025-05-10
Cite This AI-driven ETL Optimization for Security and Performance Tuning in Big Data Architectures - Shiva Kumar Vuppala - IJLRP Volume 6, Issue 5, May 2025.

Share this