You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
An End-to-End ETL data pipeline that leverages pyspark parallel processing to process about 25 million rows of data coming from a SaaS application using Apache Airflow as an orchestration tool and various data warehouse technologies and finally using Apache Superset to connect to DWH for generating BI dashboards for weekly reports
🏗️ High-impact Power BI projects showcasing advanced ETL processes and complex DAX modeling. Focused on building scalable, high-performance dashboards from fragmented datasets. Covers Star Schema, Time Intelligence, and Geospatial Analytics.