Summary
Overview
Work History
Education
Skills
Certification
Languages
Timeline
Awards
CORE COMPETENCIES
Generic

Kumar Saurabh

Singapore

Summary

DevOps Engineer with 10+ years of IT experience including 8+ years of specialized DevOps expertise in designing, automating, and optimizing mission-critical deployments in AWS and GCP. Skilled in architecting scalable, highly available, and fault-tolerant cloud infrastructures, leveraging AWS services such as EC2, VPC, Transit Gateway, Site-to-Site VPN, Route53, S3, RDS, CloudFormation, CloudWatch, SQS, and IAM. Experienced in managing multi-region, cross-continent deployments with a strong focus on security, performance, and cost optimization.

Hands-on expertise in Infrastructure as Code (Terraform, Ansible, CloudFormation) and CI/CD automation (Jenkins, Git, Docker, Helm) to accelerate deployments, reduce human error, and improve delivery pipelines. Strong background in monitoring, logging, and observability using Prometheus, Grafana, ELK Stack, AWS CloudWatch, and GCP Stackdriver, ensuring proactive incident management and performance tuning.

Proven experience in service mesh technologies (Istio) for traffic management, resilience, and observability across microservices. Skilled in Linux administration (RHEL, CentOS, Ubuntu, Oracle Linux) and automation with shell scripting.

Experienced in ITIL processes (incident, change, and release management) using BMC Remedy, ServiceNow, and JIRA, with a consistent record of meeting SLAs and driving incident resolution within agreed timelines. Adept at working in high-pressure environments with strong leadership skills, capable of motivating teams and collaborating across functions to increase productivity and achieve business goals.

Overview

14
14
years of professional experience
1
1
Certification

Work History

Staff Devops Engineer

8x8
07.2023 - Current
  • Designed and implemented CI/CD pipelines to streamline deployment processes.
  • Collaborated with development teams to optimize infrastructure for scalability and performance.
  • Automated system monitoring and alerting to enhance operational efficiency and reduce downtime.
  • Led cross-functional teams in identifying and resolving system vulnerabilities, improving security posture.
  • Evaluated new technologies to improve existing workflows and enhance service delivery outcomes.
  • Improved code deployment efficiency by automating processes with CI/CD pipelines.
  • Designed and implemented containerization strategies using Docker and Kubernetes, improving resource utilization and management.
  • Reduced system downtime for critical applications by implementing robust monitoring and alerting tools.
  • Contributed to the creation of a DevOps culture within the organization, leading to increased agility and cross-functional collaboration.
  • Provided 24/7 on-call support for critical systems, ensuring high availability and rapid issue resolution.
  • Designed and executed cloud infrastructure migrations, resulting in enhanced system flexibility and cost savings.
  • Reduced operational costs by optimizing cloud resource utilization and implementing cost-effective solutions.
  • Coordinated deployments of new software, feature updates and fixes.

Staff. Devops Engineer

Illumina Inc.
08.2017 - 07.2023
  • Good understanding of Infrastructure as Code (IaC) and experience implementing it using Ansible and Terraform.
  • Hands-on experience with Source Code Management (Version Control Systems) using Git.
  • Proficient in setting up and managing AWS infrastructure, including VPC, EC2, IAM, EBS, Security Groups, Auto-Scaling, RDS, and CloudFormation templates.
  • Managing multiple AWS accounts with multiple VPCs across environments, focusing on automation, integration, and cost optimization.
  • Developed strong understanding of ELB, networking principles, routing technologies, Transit Gateway, and DNS (Route53).
  • Experience in installing and configuring Kubernetes clusters on AWS.
  • Implementing continuous automation and integration using Ansible, Terraform, and Jenkins.
  • Proficient in monitoring AWS resources and deployed applications using CloudWatch: creating alarms and configuring notifications.
  • Hands-on experience with ELK stack (Elasticsearch, Logstash, Kibana) for centralized logging and analysis.
  • Skilled in troubleshooting and scripting to reduce manual effort, improve productivity, and minimize downtime.
  • Managing CI/CD pipelines, automating deployment processes, and ensuring smooth integration using Jenkins and Ansible.
  • Creating and maintaining documentation for automation and operational processes to comply with audit requirements and industry best practices.
  • Performing day-to-day operations, administration, and maintenance for enterprise applications.
  • Expertise in supporting enterprise distributed applications across a wide range of operating environments.

System Admin

Accenture Singapore (Payroll of Geco)
08.2016 - 08.2017
  • Expertise in Unix and Linux system installation, configuration, administration, including development and testing of backup, preventive maintenance, monitoring, and alerting.
  • Managing servers and applications hosted on cloud environments.
  • Virtual Machine administration: creating servers with requested CPU/RAM, configuration, patching, updates, and upgrades.
  • Managing Linux server upgrades using standard checklists and templates.
  • Administration of AD, FileNet, Q-Radar, Nagios servers.
  • User Management & Administration: managing user accounts, groups, and access levels.
  • Experienced in installation, configuration, and administration of WebSphere Application Server (WAS) and IBM HTTP Server on Linux and Windows, including automation of tasks.
  • Troubleshooting webserver/AppServer configuration and performance issues.
  • Setting up nodes, data sources, virtual hosts, and planning installation/configuration of WAS.
  • Skilled in plugin and configuration file updates to improve performance.
  • Worked with IBM on problem determination and resolution: submitting PMRs, running must-gather scripts, enabling traces, taking thread and heap dumps.
  • Architected horizontal and vertical clusters for a fault-tolerant, scalable, and highly available WebSphere environment.
  • Installed fix packs, eFixes, and cumulative fixes, including automation of updates.
  • Developed and maintained scripts to reduce business hours and improve productivity.
  • Performing day-to-day operations, administration, and maintenance in support of enterprise applications.
  • Managing application start-up/checklist tasks, log-files, queues, and transactions, resolving exceptions or escalating as needed.
  • Supporting enterprise distributed applications in high-volume, secure, 24/7 environments.

Project: SSNET

  • Responsible for managing infrastructure hosted on the cloud environment, ensuring 24x7 availability of servers and applications.
  • Tasks included server/application management, monitoring, and performance improvement.

Technology Stack Used:
Cloud Computing (IaaS), RHEL6, Shell Scripting, WebSphere Application Server, IBM HTTP Server, DB2, FileNet, Nagios, Q-Radar, IBM Integration Bus

System Admin

IBM Singapore (Payroll of Infinite)
10.2015 - 07.2016
  • Administration of WebSphere Application Server (WAS) and IBM HTTP Server across multiple environments.
  • Experienced in installing WAS Network Deployment packages on different operating systems.
  • Troubleshooting webserver/AppServer configuration and performance issues.
  • Monitoring server performance and resource utilization, performing application health checks, and conducting log analysis.
  • Setting up nodes, data sources, virtual hosts, and planning installation/configuration of WebSphere Application Server.
  • Configured Admin Console security on WebSphere: creating users with different roles, managing groups, and integrating with LDAP.
  • Deployed EAR applications on WAS Network Deployment in Development, System Test, and Performance Test environments, and resolved configuration/application issues.
  • Configured IBM MQ: queue managers, objects, queues, channels, and clustering.
  • Monitoring and maintaining queue managers, queues, channels, and listeners.
  • Installed fix packs for WebSphere Application Server and managed backups for WebSphere and MQ series.
  • Involved in planning migrations and upgrades, including installing upgrade fix packs and migrating to latest versions.
  • Attending and resolving tickets via the client’s proprietary ticketing system.
  • Experienced in operational 24/7 support, troubleshooting, monitoring, and maintenance of enterprise middleware services.

Project: Middleware Support in DBS

  • Part of the Middleware Support team, providing operational and application support to critical business services.
  • Responsibilities included troubleshooting, coordination with application teams, and ensuring 24x7 availability of mission-critical services.
  • Required strong problem-solving skills, ability to learn quickly, and effective communication with cross-functional teams.

Technology Stack Used:
RHEL6, AIX, Shell/Batch Scripting, WebSphere Application Server, IBM HTTP Server, IBM MQ


Senior Software Consultant

BNP PARIBAS, Singapore (Payroll of Nityo Infotech)
12.2014 - 09.2015
  • Strong knowledge of operating systems: Linux, AIX, and Windows.
  • Expertise in Unix/Linux system installation, configuration, administration, including development and testing of backups, preventive maintenance, monitoring, and alerting.
  • Writing and enhancing Unix/Batch scripts for effective system monitoring and other business requirements, improving performance and efficiency.
  • Monitoring server performance and resources, performing application health checks, and conducting log analysis.
  • Coordinating with cross-functional teams to resolve system and application issues.
  • Installation and deployment of Axway CFT on various application servers: AIX, Linux, and Windows.
  • Developed a file comparison tool using batch scripts for log monitoring.
  • End-to-end responsibility for managing file transfers, including design flow, troubleshooting, and consolidation of existing Axway CFT products.
  • Documenting business requirements, process flows, and data relationships for application development.
  • Working closely with operations, management, and third-party vendors to define, test, and validate requirements.
  • Translating client business requirements into functional specifications.
  • Knowledge and implementation of ITIL service management processes.
  • Providing 24/7 on-call support for application teams and data center personnel, including support outside office hours, weekends, and during major releases/deployments.

Project: Middleware Support

  • Part of the Middleware Support team, providing operational and application support to ensure servers and applications are highly available.
  • Responsible for monitoring performance, troubleshooting issues, and improving application/server performance.
  • Followed ITIL processes to ensure all SRS and incident tickets were processed within the business SLA.
  • Ensured 24x7 availability of critical business services.

Technology Stack Used:
RHEL6, CentOS, Shell/Batch Scripting, Windows Server, Axway CFT

Engineer

Sopra Group
05.2013 - 11.2014
  • Exposure to Red Hat Linux distributions; proficient in loading OS images to host machines.
  • Creating and managing virtual machines (KVM) in Linux environments.
  • Shell/Bash scripting (beginner) for automation and system tasks.
  • Installation and configuration of service packs and necessary software/modules.
  • Experienced in Axway products: GI, CFT, Map Designer, B2Bi.
  • Knowledge of EDI messaging standards: EDIFACT, X12, XML, IDOC, and experience creating maps (e.g., IDOC → EDIFACT).
  • End-to-end flow configuration in B2Bi (Business to Business Integration).
  • Hands-on with EDI standards including EDIFACT (ORDERS, ORDERSP, DESADV, DELFOR, INVOIC) and ANSI X12.
  • Management, analysis, development, testing, and support of EDI and data exchange solutions with internal and external trading partners.
  • Troubleshooting and correcting program errors and defects.
  • Data mapping and transformation using SQL, X12, XML, EDIFACT.
  • Implementing and maintaining customer and supplier EDI/XML trading partner integrations, from mapping/translators to ERP setup.
  • Application integration via AS2, HTTP, HTTPS, FTP, including trading partner setup.
  • Strong skills in systems analysis, problem-solving, troubleshooting, and requirements gathering.
  • Knowledge of service management and incident management processes.
  • Active participation in team meetings, production issue analysis, and knowledge transfer sessions for new team members.

Projects

Cardinal Healthcare

  • Supported mission-critical services enabling pharmacies, hospitals, and ambulatory care sites to focus on patient care while reducing costs and improving efficiency.
  • Responsibilities included setting up routes between community and partners using Axway GI, CFT, and Composer, ensuring 24x7 service availability.

Moët Hennessy – Development

  • Part of Enterprise Application Integration (EAI) and Business-to-Business (B2B) projects for LVMH.
  • Responsibilities included development, testing, deployment, and post-production support for EDI implementations.

Technology Stack Used:
Gateway Interchange (GI), CFT, Mapping Services, B2Bi, Shell Scripting, Linux (RHEL 6.0)

Production Support Engineer

IBM INDIA PVT. LTD(Payroll of Collabera solutions Pvt Ltd)
11.2012 - 05.2013
  • Responsible for maintenance and support of billing and contracting applications for Vodafone.
  • Monitoring ticket queues, including tickets generated from automated alarms.
  • Performing day-to-day operations, administration, and maintenance activities to ensure application stability.
  • Managing application startup/checklist tasks, log files, queues, and transaction monitoring, taking corrective action or escalating as needed.
  • Developed and automated logs monitoring shell scripts for efficient daily server/log monitoring.
  • Experience in analyzing and reviewing business and functional requirements with stakeholders and coordinating with development teams.
  • Enhanced existing Unix scripts to improve performance and operational efficiency.
  • Handling change requests, performing release and deployment activities, and ensuring compliance with client requirements.
  • Resolving production issues within SLA, performing root cause analysis, troubleshooting, and coordinating with relevant teams.
  • Participating in high-severity issue calls, coordinating with multiple teams to identify and resolve critical issues.
  • Writing Linux scripts to standardize support environments and automate manual tasks, including modifying existing shell scripts and SQL queries.
  • Performing application health checks and ensuring proper logging/documentation of issues in knowledge management portals.
  • Providing 24x7 production support, ensuring mission-critical services remain available to clients.


Project: E-Commerce Platform Services

  • Part of the team responsible for billing services for an e-commerce platform, ensuring 24x7 availability of mission-critical services that directly impact business transactions.
  • Responsibilities included monitoring, troubleshooting, scripting, and coordinating incident resolution across teams.


Technology Stack Used:
Linux (RHEL 6.0), BMC Remedy, Package Management (RPM, YUM)

IT Trainer

NIIT Limited, Bhutan
04.2011 - 06.2012
  • Ensured maintenance of systems and networks to enable smooth conduct of IT classes and training sessions.
  • Writing and executing SQL queries for data extraction and analysis.
  • Managed Encore Server, including backups, replication, and database administration for 500+ students enrolled in various IT courses.
  • Executed scripts on servers and performed database dumps for maintenance and reporting.
  • Functioned as Centre-in-Charge, accountable for the day-to-day operations of the IT Training Centre at the Vocational Training Institute, Chumey, Bhumthang, as part of the “Chiphen Rigpel Project”.
  • Led a team of 5 members, scheduling batches and allocating tasks based on individual KRAs.
  • Conducted IT training sessions and examinations across various modules for students.
  • Provided training on Microsoft Office (Word, Excel, PowerPoint) and C programming, enhancing students’ technical and programming skills.

Project: Chiphen Rigpel Project (Bhutan Government, 1+ Year)

  • An Indo-Bhutan friendship project aimed at conducting IT training sessions across Bhutan.
  • Role involved supporting production applications on Linux servers, including UNIX shell scripting, SQL queries, package installation, network management, and hardware support.
  • Ensured smooth operations of IT infrastructure, enabling effective training delivery to students in IT, Microsoft Office, and programming.

Technology Stack Used:
Linux, C, UNIX shell scripting, Microsoft Office, Hardware & Networking (Windows XP)

Education

B.Tech - Computer Science Engineering

Government College of Engineering And Ceramic Technology
Kolkata(INDIA)
01.2010

INTERMEDIATE SCIENCE -

D.S College Katihar
Katihar(INDIA)
01.2005

MADHYAMIK -

Zila School Katihar
Katihar(INDIA)
01.2003

Skills

  • Cloud Platforms & Services
    AWS
    : EC2, VPC, Transit Gateway, Site-to-Site VPN, S3, Route53, SQS, IAM, RDS, CloudWatch, CloudFormation
    GCP: Compute Engine, Cloud Storage, VPC, Cloud SQL, IAM (Monitoring & Logging)
    Strong expertise in provisioning, networking, scaling, monitoring, and securing multi-cloud environments
  • Infrastructure as Code & Automation
    Terraform, Terragrunt, Ansible, AWS CloudFormation – reproducible, scalable, and policy-driven infrastructure deployments
  • CI/CD & DevOps Tooling
    Jenkins (pipeline automation), Git (GitHub/GitLab/Bitbucket), Docker (containerization), Istio (service mesh for traffic routing, security, observability),Atlantis
  • Monitoring, Logging & Observability
    Prometheus (metrics collection & alerting), Grafana (visualization), ELK Stack (Elasticsearch, Logstash, Kibana), AWS CloudWatch, GCP Stackdriver – enabling observability, incident response, and performance optimization
  • Operating Systems & Scripting
    Linux (RHEL, CentOS, Ubuntu, Oracle Linux) – system administration, patching, performance tuning Shell scripting for automation and orchestration
  • ITSM & Collaboration Tools
    ITIL-aligned processes with BMC Remedy, ServiceNow, and JIRA for incident, change, and problem management
  • Application Server Management
    IBM WebSphere Application Server – deployment, configuration, and performance optimization of enterprise applications
  • Other Tools
    Helm, Confluence, Slack, MS Office (documentation & reporting)
  • Application Server Management
    IBM WebSphere Application Server – deployment, configuration, and performance optimization of enterprise applications
  • Other Tools
    Helm, Confluence, Slack, MS Office (documentation & reporting)
  • Application Server Management
    IBM WebSphere Application Server – deployment, configuration, and performance optimization of enterprise applications
  • Other Tools
    Helm, Confluence, Slack, MS Office (documentation & reporting)
  • Application Server Management
    IBM WebSphere Application Server – deployment, configuration, and performance optimization of enterprise applications
  • Other Tools
    Helm, Confluence, Slack, MS Office (documentation & reporting)

Certification

  • AWS Certified Solution Architect Associate
  • Certified Kubernetes Administrator (CKA)
  • HashiCorp Certified Terraform Associate

Languages

English
Hindi
Bengali

Timeline

Staff Devops Engineer

8x8
07.2023 - Current

Staff. Devops Engineer

Illumina Inc.
08.2017 - 07.2023

System Admin

Accenture Singapore (Payroll of Geco)
08.2016 - 08.2017

System Admin

IBM Singapore (Payroll of Infinite)
10.2015 - 07.2016

Senior Software Consultant

BNP PARIBAS, Singapore (Payroll of Nityo Infotech)
12.2014 - 09.2015

Engineer

Sopra Group
05.2013 - 11.2014

Production Support Engineer

IBM INDIA PVT. LTD(Payroll of Collabera solutions Pvt Ltd)
11.2012 - 05.2013

IT Trainer

NIIT Limited, Bhutan
04.2011 - 06.2012

B.Tech - Computer Science Engineering

Government College of Engineering And Ceramic Technology

INTERMEDIATE SCIENCE -

D.S College Katihar

MADHYAMIK -

Zila School Katihar

Awards

Recipient of Appreciation Award from Ministry of Information Communication Bhutan., Winner of Best Newcomer Award-2011, Appreciated by General Manager, Group Leader and Vice President of NIIT Ltd. Bhutan

CORE COMPETENCIES

  • Cloud & Infrastructure Management: AWS (EC2, VPC, Transit Gateway, S3, RDS, IAM, CloudFormation), GCP (Compute Engine,GKE,Cloud Storage, VPC, Cloud SQL)
  • CI/CD & Automation: Jenkins, Git, Docker, Helm, Terraform, Ansible, CloudFormation,Atlantis
  • Monitoring & Observability: Prometheus, Grafana, ELK Stack, AWS CloudWatch, GCP Stackdriver
  • Operations & Maintenance: Deployment, patching, scaling, performance tuning, incident and change management
  • Cross-Functional Collaboration: Working with development, QA, and operations teams for smooth releases and troubleshooting
  • Training & Mentorship: Coaching teams on DevOps best practices, CI/CD pipelines, and infrastructure automation
  • Application & Middleware Management: IBM WebSphere Application Server deployment, configuration, and performance optimization
  • Soft Skills: Problem-solving, SLA adherence, leadership, team motivation, documentation & reporting
Kumar Saurabh