This project implements an end-to-end flight data engineering pipeline using Apache Airflow for orchestration and Snowflake as the data warehouse, following a medallion architecture pattern with bronze, silver, and gold data layers. The bronze layer ingests raw flight data from JSON files, the silver layer performs data cleansing and transformation, and the gold layer aggregates insights for analytics, all automated through scheduled Airflow DAGs. Built with Python scripts for each layer and Docker Compose for containerized deployment, this solution ensures scalable, reliable data processing from ingestion to warehousing, supporting real-time flight operations analytics.

