Summary

Overview

Work History

Education

Skills

Certification

Languages

Affiliations

Timeline

Yasitha Sandeep Dhanajaya

20/A, Katuwawala,11

Summary

Dynamic Senior DevOps / Site Reliability Engineer with a proven track record in optimizing infrastructure, ensuring system reliability, and driving efficient project launches. Proficient in leveraging Datadog, Terraform, Jenkins CI/CD, and GitHub to enhance observability, automate deployments, and maintain proactive monitoring. Experienced in AWS, Google Cloud Platform, VMware vCenter, GKE Enterprise, Ansible, GitLab / GitHub, and Kubernetes, with a strong focus on performance optimization and system resilience. Skilled in managing CI/CD pipelines, integrating monitoring solutions, and implementing Chaos Engineering with Litmus to improve system robustness. Adept at incident management through ServiceNow, collaborating with development teams to resolve issues, and maintaining comprehensive Confluence documentation and runbook. Committed to staying at the forefront of cloud infrastructure, automation, security, and reliability best practices, with excellent problem-solving, communication, and collaboration skills.

Overview

years of professional experience

Certification

Work History

Senior DevOps /Site Reliability Engineer

Sysco Labs Srilanka

Colombo, SriLanka

07.2024 - Current

Monitoring & Observability: Managed proactive monitoring through Datadog, improving analytical insights to foresee and mitigate potential service degradation, ensuring site reliability.
Infrastructure as Code (IaC): Created and managed Datadog monitors using Terraform, regularly auditing configurations to ensure accuracy and effectiveness.
CI/CD & Automation: Deployed monitoring changes via Jenkins CI/CD pipelines and integrated Datadog with automated deployment workflows.
Version Control & Documentation: Managed code repositories in GitHub and maintained detailed documentation, including Confluence runbooks, for operational transparency.
Incident Management & Support: Provided L2 support for customer-reported incidents on ServiceNow, collaborating with development teams to troubleshoot and resolve issues.
Release Management: Oversaw major and minor release deployments through Jenkins CI/CD pipelines, ensuring smooth and reliable software delivery.
Performance & Chaos Engineering: Leveraged Datadog insights for performance testing and implemented chaos engineering practices using the Litmus tool to improve system resilience.

Senior Sysops Engineer

Circles

Colombo, SriLanka

09.2022 - 07.2024

Maintained uninterrupted service by ensuring zero downtime in multiple production environments deployed on AWS and GCP.
Took a proactive approach to monitoring through the use of Grafana, New Relic, OpsGenie, and the ELK stack, which provided real-time insight into potential issues.
Empowered dev teams with controlled access to the environments through FreeIPA (an OpenLDAP solution from Red Hat) and JumpCloud to increase efficiency and security.
Conducted timely vulnerability assessment scans and security patch updates to ensure that the production environments were always up-to-date with the latest security protocols, and aligned with the CIS Benchmark.
Created and deployed new infrastructure based on requirements to support the growth of the business.
Participated in an on-call roster to provide 24-hour support when necessary, to ensure that any issues were promptly resolved.
Supported the successful launch of new projects, primarily on the cutting-edge Google Cloud Platform, as well as worked on an on-premises project using VMware vCenter.
Gained valuable experience working with Google Anthos presently know as GKE Enterprise, which allowed for the seamless integration of cloud services with on-premises Kubernetes infrastructure, creating a unified system.
Utilized automation languages like Terraform, Pulumi, and Ansible to rapidly deploy mass infrastructures and services through Infrastructure as Code streamlines the deployment process and significantly reduces deployment times.
Managed source code through GitLab, leveraging its robust capabilities to oversee code changes, merge requests, and more.
Integrated GitLab pipelines for seamless CI/CD integrations, allowing for quick testing, deployment, and monitoring of new services.
Provisioned Kubernetes clusters with ease on both Google Cloud Platform's GKE and on-premises infrastructures, utilizing Google Anthos creates a hybrid system that seamlessly integrates both cloud services and on-premises infrastructure.
Utilized cutting-edge technologies, like Helm charts, to streamline the deployment process of microservices within Kubernetes clusters, ensuring a fast and efficient deployment process.
Facilitated the integration of FreeIPA to other services as an open source LDAP auth mode is designed to increase efficiency and ensure security.
Gained valuable experience in a variety of essential networking tasks, including DNS management, SSL certificate management, L4 and L7 load balancer management, firewall management, and IPsec and SSL VPN setup and management.
Generated engineering documentation and oversaw design projects.

Systems Engineer

Rezgateway

Colombo, SriLanka

05.2021 - 04.2022

As a System Engineer for the company's primary solution provider I have actively involved in the following areas
System Engineer /DevOps Skills
AWS service installations and server Migration
Hands on Experience in AWS services like ec2 ECS (Elastic Container Service) S3 VPC Route 53 Cloud9
Maintaining AWS security group policies
Server and service installations on Rackspace cloud environment
Managing Rackspace cloud security policies
Experience in Jenkins CI-CD Delivery pipelines
Hands on Experience in ELK stack technologies
Experience in HA Proxy Load balancing
Worked on RND project in provisioning of Centralized Developer environment; AWS Cloud9, Visual Studio Code, Eclipse Che
Experience in Containerization technologies such as Docker and
Kubernetes.

Junior Network Administrator

Rezgateway

Colombo, SriLanka

09.2018 - 12.2019

Provide first and second level support, configure. Maintain and troubleshoot application and database servers (Managed/Unmanaged).
Windows/Linux servers: Monitor and respond to Nagios.
Newrelic
Triometric system monitoring tools perform system quality checks.
Function as 1st contact for client inquiries (USA, UK, South Africa, Colombia and Sri Lanka) and work under given SLAs Communicate and provide support for the development team Performing and coordinating emergency and schedule maintenance task of the LAN & WAN Based on the standard procedures.

Network Administrator, LAN Administrator

Gateway Inc

Colombo, SriLanka

01.2019 - 04.2019

Maintaining and Troubleshooting the LAN infrastructure
Firewall Management
AWS service installations and server Migration
Maintaining AWS security group policies
Jboss
Apache2
Tomcat and postgres service installation
SSL certificate installations and maintenance
Hands on Experience in Ansible
Terraform automation Languages
Nagios Service Installation and Client deployment
Proactive Monitoring with Nagios, Consul, Grafana etc.

Technical Support Executive

Rezgateway, Reservations Gateway

Colombo, SriLanka

08.2017 - 09.2018

• Linux (Ubuntu) Operating System Installation
• Linux (Ubuntu) Operating System Troubleshooting
• Windows Operating Systems Installation
• Windows Operating Systems Troubleshooting
• Apple iMac Troubleshooting
• PC Hardware Troubleshooting
• Network Troubleshooting

Education

Bachelor of Science - Computer Science

Wrexham Glyndŵr University

United Kingdom

05.2022

BTEC Higher National Diploma - Computer Science & Networking

Esoft Metro Campus

Colombo

01.2020

Skills

Amazon Web Services (AWS)
Google Cloud Platform (GCP)
GKE Enterprise
CI/CD pipelines
Infrastructure as code
Automation tools
Documentation management
Network troubleshooting
Analytical thinking

System troubleshooting
Monitoring systems
VMware
Kubernetes
Ansible
Terraform
Oracle
RHEL
LDAP

Certification

CKA: Certified Kubernetes Administrator
Datadog 101: Site Reliability Engineer
AWS Certified Solutions Architect – Associate
Architecting with Google Kubernetes Engine: Workloads
Architecting with Google Kubernetes Engine: Foundations
Perform Foundational Infrastructure Tasks in Google Cloud - Google Cloud Skills
Reliable Google Cloud Infrastructure: Design and Process - Google Cloud Skills
Architecting with Google Kubernetes Engine: Foundations - Google Cloud Skills
Google Cloud Fundamentals: Core Infrastructure - Google Cloud Skills Boost
Architecting with Google Kubernetes Engine: Workloads - Google Cloud Skills
Developing a Google SRE Culture - Google Cloud Skills Boost
Red Hat Certified System Administrator (RHCSA) - Red Hat
Certified OKR Professional - Profit.co

Languages

English, Fluent
Sinhala, Native speaker

Affiliations

Represent Sysco Labs Mercantile Cricket Team for 2025 season.

Timeline

Senior DevOps /Site Reliability Engineer

Sysco Labs Srilanka

07.2024 - Current

Senior Sysops Engineer

Circles

09.2022 - 07.2024

Systems Engineer

Rezgateway

05.2021 - 04.2022

Network Administrator, LAN Administrator

Gateway Inc

01.2019 - 04.2019

Junior Network Administrator

Rezgateway

09.2018 - 12.2019

Technical Support Executive

Rezgateway, Reservations Gateway

08.2017 - 09.2018

Bachelor of Science - Computer Science

Wrexham Glyndŵr University

BTEC Higher National Diploma - Computer Science & Networking

Esoft Metro Campus