Travel Data Analytics Project - Singapore
2021
As a data engineer, I spearheaded the development and optimization of an ETL pipeline using AWS Glue, achieving a 50% reduction in runtime and operational costs.
Spearheaded the development and continuous optimization of an ETL pipeline using AWS Glue, resulting in a 50% reduction in both runtime and operational costs.
Played a pivotal role in crafting and maintaining intricate Airflow DAGs, ensuring a reliable and robust workflow orchestration system to support complex data processes.
Led the development and ongoing maintenance of data marts within Redshift, facilitating the creation of insightful dashboards and business intelligence reports using DBT, enabling data-driven decision-making.
Processed real-time streaming customer behavior data from the website backend using DynamoDB and Kafka, enabling rapid insights into user actions and preferences, ultimately enhancing customer experience.
Spearheaded the creation of interactive dashboards to analyze customer behavior, utilizing key metrics such as CTR/CVR across all web pages, leveraging tools like Metabase and QuickSight, fostering data-driven strategies to improve user engagement.
Implemented a robust notification system, enabling timely alerts and communication on pending bookings, data ingestion failures, and more, through email and Slack channels, contributing to seamless data operations and issue resolution.
Drove research and implementation efforts for prompt engineering with ChatGPT, resulting in a notable 15-20% reduction in development time.
Technologies: AWS Glue, Redshift, Postgres on AWS RDS, DynamoDB, AWS Elastic Container, Airflow, Python, Faust, Git, QuickSight, Metabase, DBT