Data Engineer-I @ Ipsos
- Designed and maintained ETL pipelines processing 5M+ records from global market research surveys using Python, SQL, and Apache Spark
- Implemented dbt models to standardise data transformations, improving documentation, testing, and repeatability
- Automated reporting workflows — reduced manual data preparation time by 40%
- Built data validation frameworks across 15+ datasets, reducing downstream errors by 25%
- Supported migration of on-premise infrastructure to AWS (S3, Lambda, EC2)
- Developed reusable Spark jobs and Python scripts adopted as the team's core toolkit