
Data Engineering
Build robust data pipelines and infrastructure at scale. Learn cloud platforms, ETL processes, and modern data architecture.
This 6-month intensive programme equips you with the skills to design, build, and maintain the data infrastructure that powers modern organisations. Classes meet twice a week for one hour each session. You will work with cloud platforms (AWS, GCP), learn to build ETL pipelines, manage data warehouses, and implement data quality frameworks. By the end, you will have hands-on experience with the tools and technologies used by leading data engineering teams worldwide.
Curriculum
Data Engineering Fundamentals
The role of data engineering in modern organisations
Python Programming
Python for data engineering, scripting, and automation
SQL & Database Design
Advanced SQL, schema design, and performance optimisation
ETL Pipeline Development
Building extract, transform, load pipelines with Apache Airflow
Cloud Data Platforms
AWS (S3, Redshift, Glue) and GCP (BigQuery, Dataflow)
Data Warehousing
Dimensional modelling, star schemas, and data marts
Stream Processing
Real-time data processing with Apache Kafka
Data Quality & Governance
Testing, monitoring, and ensuring data reliability
Docker & Infrastructure
Containerisation and infrastructure as code
Capstone Project
Build a complete data platform from scratch