Experienced DevOps Engineer with a strong background in automating and optimizing mission-critical deployments in AWS and GCP. Proficient in tools like Jenkins, Terraform, and Ansible to streamline development processes and ensure efficient code deployment. Adopt at managing and monitoring cloud infrastructure services and maintaining high availability in Kubernetes-based container clusters. Successfully implemented robust monitoring and logging solutions using ELK and Grafana, providing comprehensive visibility into system performance. Expertise in GitOps for managing infrastructure as code and integrating automation testing into CI/CD pipelines, enhancing efficiency and reducing manual effort. Passionate about leveraging technology to improve operational workflows and drive continuous improvement.
Awards
Work Experience
· Leading a team of 5, providing guidance and oversight to ensure the effective completion of tasks and successful implementation of projects.
· Collaborated with cross-functional teams, including developers and API teams, to streamline tasks and enhance workflow efficiency, ensuring smooth integration of services and timely completion of project milestones.
· Automated infrastructure provisioning using Terraform and GoCD, streamlining deployment processes across cloud environments.
· Reduced manual intervention, improving deployment efficiency and enhancing system reliability through continuous integration and delivery (CI/CD) pipelines.
· Implemented a highly available Kong Gateway solution using AWS EKS, Load Balancers, and S3 for DR setup.
· Leveraged Terraform for infrastructure as code (IaC) and Helm charts for seamless deployment of microservices.
· Architected and maintained an active-active Disaster Recovery (DR) setup, ensuring high availability and failover across AWS regions.
· Created automated CI/CD pipelines for deploying Kong updates and custom plugins across multiple environments (DEV, UAT, PROD).
· Led the migration of MuleSoft applications from on-premises infrastructure to AWS, ensuring smooth transition.
· Utilized Terraform for infrastructure provisioning and Helm charts for efficient deployment on AWS EKS.
· Implemented and managed ELK stack, Prometheus, and Jaeger tracing in AWS EKS for comprehensive monitoring and logging of Kong and MuleSoft applications.
· Enabled real-time visibility into application performance, ensuring early detection of issues and optimizing system reliability.
· Configured and deployed Nginx Ingress as a proxy for API servers, optimizing traffic routing and load balancing in a microservices architecture.
· Enhanced API performance and security by managing access control and SSL termination through Nginx.
· Resolved vulnerabilities across applications and infrastructure, implementing robust security measures to enhance overall system security and compliance.
· Provided on-call support for production-related issues
-
Creating and managing resource pools and adding the VM’s into resource pools.
-
Creating and managing virtual machines and installing VM Tools into VM’s.
-
Security hardening of Operating systems(windows) for Security Compliance.
-
Good Experience in Firmware upgrade Activities
-
Installing, Configuring, and troubleshooting vSphere (Esxi, vCenter and PSC) and providing the RCA.
-
Creating templates from VM’s and deploy VM’s from templates, ISO images, MDT and allocate resources
-
Reservation of Memory& CPU for High Critical servers and Enabling Hot-Plug options for CPU & RAM while building servers
-
Configurations of Virtual switches and NIC Teaming.
-
Performing Snapshots, Cloning, cold migrations, and hot migrations.
-
Configuring and troubleshooting of SV Motion.
-
Moving VM’s from one host to another Host using VMotion.
-
Performing P2V, V2V migrations
-
Mapping storage and creating Datastores
-
Managing VM Ware cluster. Enabling HA and DRS, FT features in a cluster.
-
Manage tasks, events, and alarms.
-
Patch Management in ESX Servers & Windows Servers through update Manager.
-
Installing and configuring RODC in Multiple Locations.
-
Working on Low, Medium, High, and critical tickets within SLA’s by analysing the issue and providing the permanent fixes and ensure that the business is not impacted.
-
Creating changes and implementing them for Incidents.
-
Managing Servers Remotely through DRAC in case of Server Failure.
-
Maintain Communication with Customers & Next Level Managers, handling escalations.
-
Handling daily calls internally and bridge meetings.
-
Installation and configuration of Network Printers.
-
Handling Backup, Antivirus & storage issues as per the incident ticket.
-
The installing software’s in the client server as per the incident and changes.
-
Installing Antivirus in the servers remotely using SEPM.
-
Applying latest service packs (patches) using WSUS server.
-
Managing partitions with RAID levels (RAID0, RAID1 and RAID5).
-
Managing System and Group policies.
-
Implementing Security for files and directories
-
Responding to customers tickets.
-
Experience with Linux servers in virtualized environments
-
Maintains UNIX/Linux Operating System to provide optimum performance and system availability.
-
Installing and configuring Red Hat Linux Operating System.
-
Experience installing, configuring, and maintaining services such as Java, Apache, MySQL,etc.
-
Shell scripting
-
Deploying applications in preproduction and production environments.
-
Handling major releases and version changes.
-
Troubleshooting application performance issues.