Chotot, Mudah, and Carousell Infrastructure Management and Optimization
Managed the infrastructure for Chotot, Mudah, and Carousell across GCP, AWS, and on-premise data centers. Deployed and managed a Kubernetes cluster with over 20 nodes, supporting 200 microservices and cron jobs, and maintaining almost 3000 pods while handling 300 million requests per month. Acted as cloud architect for Chotot and Mudah, aiming to integrate the cloud infrastructures of both companies. Defined an Infrastructure-as-Code framework for Chotot and Mudah using GCP and AWS. Executed a cloud migration project that involved the migration of approximately 200 microservices, over 30 databases, caches, and message queues from the Vietnam data center to Google Cloud Platform. Developed a chat platform for Chotot, Mudah, and Carousell, which replaced Twilio chat and significantly reduced costs. Established SLI and SLO in collaboration with product owners and other teams. Defined and implemented an incident management process while participating in the on-call rotation. Created a developer portal to handle CI/CD processes, replacing existing GitHub Action or Travis CI setups and serving as a GitOps tool akin to ArgoCD for managing and deploying resources on Kubernetes. Standardized CI/CD flows and built a new CI/CD system for Chotot and Mudah, integrating it as part of the developer portal initiative. Developed a version auto-generation tool based on business requirements. Established a secret management system for Chotot and Mudah. Provided database performance troubleshooting and analysis. Developed a simplified incident management tool akin to Opsgenie. Collaborated with the security team to implement a zero-trust solution for Chotot and Carousell. Consulted with the data team on building a data warehouse and data processing pipelines. Supported the software team in examining and selecting technical stacks for new services. Assisted Mudah SRE in applying GitOps for Ansible, application configuration, and Terraform. Collaborated with Mudah SRE in developing a new monitoring solution capable of storing and querying monitor data for three years. Provided technical consultation to Engineering Managers and the Head of Tech. Conducted cloud cost optimization. Responsible for recruiting and training new team members.
Working with the infrastructure of Chotot, Mudah, and Carousell
Deploy and manage Kubernetes clusters
Cloud architect for Chotot and Mudah
Define Infrastructure as Code framework
Work on cloud migration projects
Build chat platforms for Chotot, Mudah, and Carousell
Collaborate with PO and other teams to establish SLI and SLO
Define incident management process
Develop developer portal
Standardize CI/CD flow and build new CI/CD systems
Develop a tool to auto-generate versions
Build secret management systems
Troubleshoot and analyze database performance
Develop incident management tools
Work with security team to apply zero trust solutions
Consult and work with data team to build data warehouse
Consult and support software team in choosing technical stack
Work with Mudah SRE to apply GitOps
Work with Mudah SRE to build new monitoring solutions
Consult technical solutions for Engineering Manager and Head of Tech
Examine and optimize cloud costs
Recruit and train new members
Technologies: GCP, AWS, Kubernetes, GitOps, ArgoCD, CI/CD, Ansible, Terraform, Prometheus, Grafana, Datadog, Vault, Consul, Kafka, Kubernetes, GitOps, Nginx, HA proxy, Kong, Traefik, Istio, Linkerd, Hadoop, Yarn, Airflow, Spark, Presto, Jupyter, Ceph, Minio, OpsGenie, PagerDuty