Summary
Overview
Work History
Education
Skills
Certification
Languages
Affiliations
Timeline
Generic
Yasitha Sandeep Dhanajaya

Yasitha Sandeep Dhanajaya

20/A, Katuwawala,11

Summary

Dynamic Senior DevOps / Site Reliability Engineer with a proven track record in optimizing infrastructure, ensuring system reliability, and driving efficient project launches. Proficient in leveraging Datadog, Terraform, Jenkins CI/CD, and GitHub to enhance observability, automate deployments, and maintain proactive monitoring. Experienced in AWS, Google Cloud Platform, VMware vCenter, GKE Enterprise, Ansible, GitLab / GitHub, and Kubernetes, with a strong focus on performance optimization and system resilience. Skilled in managing CI/CD pipelines, integrating monitoring solutions, and implementing Chaos Engineering with Litmus to improve system robustness. Adept at incident management through ServiceNow, collaborating with development teams to resolve issues, and maintaining comprehensive Confluence documentation and runbook. Committed to staying at the forefront of cloud infrastructure, automation, security, and reliability best practices, with excellent problem-solving, communication, and collaboration skills.

Overview

8
8
years of professional experience
2
2
Certification

Work History

Senior DevOps /Site Reliability Engineer

Sysco Labs Srilanka
Colombo, SriLanka
07.2024 - Current
  • Monitoring & Observability: Managed proactive monitoring through Datadog, improving analytical insights to foresee and mitigate potential service degradation, ensuring site reliability.
  • Infrastructure as Code (IaC): Created and managed Datadog monitors using Terraform, regularly auditing configurations to ensure accuracy and effectiveness.
  • CI/CD & Automation: Deployed monitoring changes via Jenkins CI/CD pipelines and integrated Datadog with automated deployment workflows.
  • Version Control & Documentation: Managed code repositories in GitHub and maintained detailed documentation, including Confluence runbooks, for operational transparency.
  • Incident Management & Support: Provided L2 support for customer-reported incidents on ServiceNow, collaborating with development teams to troubleshoot and resolve issues.
  • Release Management: Oversaw major and minor release deployments through Jenkins CI/CD pipelines, ensuring smooth and reliable software delivery.
  • Performance & Chaos Engineering: Leveraged Datadog insights for performance testing and implemented chaos engineering practices using the Litmus tool to improve system resilience.

Senior Sysops Engineer

Circles
Colombo, SriLanka
09.2022 - 07.2024
  • Maintained uninterrupted service by ensuring zero downtime in multiple production environments deployed on AWS and GCP.
  • Took a proactive approach to monitoring through the use of Grafana, New Relic, OpsGenie, and the ELK stack, which provided real-time insight into potential issues.
  • Empowered dev teams with controlled access to the environments through FreeIPA (an OpenLDAP solution from Red Hat) and JumpCloud to increase efficiency and security.
  • Conducted timely vulnerability assessment scans and security patch updates to ensure that the production environments were always up-to-date with the latest security protocols, and aligned with the CIS Benchmark.
  • Created and deployed new infrastructure based on requirements to support the growth of the business.
  • Participated in an on-call roster to provide 24-hour support when necessary, to ensure that any issues were promptly resolved.
  • Supported the successful launch of new projects, primarily on the cutting-edge Google Cloud Platform, as well as worked on an on-premises project using VMware vCenter.
  • Gained valuable experience working with Google Anthos presently know as GKE Enterprise, which allowed for the seamless integration of cloud services with on-premises Kubernetes infrastructure, creating a unified system.
  • Utilized automation languages like Terraform, Pulumi, and Ansible to rapidly deploy mass infrastructures and services through Infrastructure as Code streamlines the deployment process and significantly reduces deployment times.
  • Managed source code through GitLab, leveraging its robust capabilities to oversee code changes, merge requests, and more.
  • Integrated GitLab pipelines for seamless CI/CD integrations, allowing for quick testing, deployment, and monitoring of new services.
  • Provisioned Kubernetes clusters with ease on both Google Cloud Platform's GKE and on-premises infrastructures, utilizing Google Anthos creates a hybrid system that seamlessly integrates both cloud services and on-premises infrastructure.
  • Utilized cutting-edge technologies, like Helm charts, to streamline the deployment process of microservices within Kubernetes clusters, ensuring a fast and efficient deployment process.
  • Facilitated the integration of FreeIPA to other services as an open source LDAP auth mode is designed to increase efficiency and ensure security.
  • Gained valuable experience in a variety of essential networking tasks, including DNS management, SSL certificate management, L4 and L7 load balancer management, firewall management, and IPsec and SSL VPN setup and management.
  • Generated engineering documentation and oversaw design projects.

Systems Engineer

Rezgateway
Colombo, SriLanka
05.2021 - 04.2022
  • As a System Engineer for the company's primary solution provider I have actively involved in the following areas
  • System Engineer /DevOps Skills
  • AWS service installations and server Migration
  • Hands on Experience in AWS services like ec2 ECS (Elastic Container Service) S3 VPC Route 53 Cloud9
  • Maintaining AWS security group policies
  • Server and service installations on Rackspace cloud environment
  • Managing Rackspace cloud security policies
  • Experience in Jenkins CI-CD Delivery pipelines
  • Hands on Experience in ELK stack technologies
  • Experience in HA Proxy Load balancing
  • Worked on RND project in provisioning of Centralized Developer environment; AWS Cloud9, Visual Studio Code, Eclipse Che
  • Experience in Containerization technologies such as Docker and
  • Kubernetes.

Junior Network Administrator

Rezgateway
Colombo, SriLanka
09.2018 - 12.2019
  • Provide first and second level support, configure. Maintain and troubleshoot application and database servers (Managed/Unmanaged).
  • Windows/Linux servers: Monitor and respond to Nagios.
  • Newrelic
  • Triometric system monitoring tools perform system quality checks.
  • Function as 1st contact for client inquiries (USA, UK, South Africa, Colombia and Sri Lanka) and work under given SLAs Communicate and provide support for the development team Performing and coordinating emergency and schedule maintenance task of the LAN & WAN Based on the standard procedures.

Network Administrator, LAN Administrator

Gateway Inc
Colombo, SriLanka
01.2019 - 04.2019
  • Maintaining and Troubleshooting the LAN infrastructure
  • Firewall Management
  • AWS service installations and server Migration
  • Maintaining AWS security group policies
  • Jboss
  • Apache2
  • Tomcat and postgres service installation
  • SSL certificate installations and maintenance
  • Hands on Experience in Ansible
  • Terraform automation Languages
  • Nagios Service Installation and Client deployment
  • Proactive Monitoring with Nagios, Consul, Grafana etc.

Technical Support Executive

Rezgateway, Reservations Gateway
Colombo, SriLanka
08.2017 - 09.2018

• Linux (Ubuntu) Operating System Installation
• Linux (Ubuntu) Operating System Troubleshooting
• Windows Operating Systems Installation
• Windows Operating Systems Troubleshooting
• Apple iMac Troubleshooting
• PC Hardware Troubleshooting
• Network Troubleshooting

Education

Bachelor of Science - Computer Science

Wrexham Glyndŵr University
United Kingdom
05.2022

BTEC Higher National Diploma - Computer Science & Networking

Esoft Metro Campus
Colombo
01.2020

Skills

  • Amazon Web Services (AWS)
  • Google Cloud Platform (GCP)
  • GKE Enterprise
  • CI/CD pipelines
  • Infrastructure as code
  • Automation tools
  • Documentation management
  • Network troubleshooting
  • Analytical thinking
  • System troubleshooting
  • Monitoring systems
  • VMware
  • Kubernetes
  • Ansible
  • Terraform
  • Oracle
  • RHEL
  • LDAP

Certification

  • CKA: Certified Kubernetes Administrator
  • Datadog 101: Site Reliability Engineer
  • AWS Certified Solutions Architect – Associate
  • Architecting with Google Kubernetes Engine: Workloads
  • Architecting with Google Kubernetes Engine: Foundations
  • Perform Foundational Infrastructure Tasks in Google Cloud - Google Cloud Skills
  • Reliable Google Cloud Infrastructure: Design and Process - Google Cloud Skills
  • Architecting with Google Kubernetes Engine: Foundations - Google Cloud Skills
  • Google Cloud Fundamentals: Core Infrastructure - Google Cloud Skills Boost
  • Architecting with Google Kubernetes Engine: Workloads - Google Cloud Skills
  • Developing a Google SRE Culture - Google Cloud Skills Boost
  • Red Hat Certified System Administrator (RHCSA) - Red Hat
  • Certified OKR Professional - Profit.co

Languages

  • English, Fluent
  • Sinhala, Native speaker

Affiliations

  • Represent Sysco Labs Mercantile Cricket Team for 2025 season.

Timeline

Senior DevOps /Site Reliability Engineer

Sysco Labs Srilanka
07.2024 - Current

Senior Sysops Engineer

Circles
09.2022 - 07.2024

Systems Engineer

Rezgateway
05.2021 - 04.2022

Network Administrator, LAN Administrator

Gateway Inc
01.2019 - 04.2019

Junior Network Administrator

Rezgateway
09.2018 - 12.2019

Technical Support Executive

Rezgateway, Reservations Gateway
08.2017 - 09.2018

Bachelor of Science - Computer Science

Wrexham Glyndŵr University

BTEC Higher National Diploma - Computer Science & Networking

Esoft Metro Campus
Yasitha Sandeep Dhanajaya