Site Reliability Engineer | DevOps Professional
DevOps Engineer with 4+ years of experience at Infosys, currently working as a Technology Analyst. Experienced in designing, deploying, and operating Kubernetes platforms, including building clusters from scratch on virtual machines and managing enterprise and managed setups using Rancher and AKS. Implemented secure service-to-service networking (MTLS) and traffic policies using Istio service mesh, and designed centralized monitoring and logging stacks to observe systems, investigate incidents, and perform root-cause analysis. Architected the DevOps foundation for Infosys’ latest AI offering, Infosys Topaz Fabric, covering automated build and deployment pipelines, Knative-based autoscaling, service mesh integration, and production-grade observability. One of the most interesting projects built is an AI agent that communicates with Kubernetes clusters via the Kubernetes API to assist with debugging and operational workflows. Also deployed and operated AI models on Kubernetes using vLLM, with a strong focus on reliability, scalability, and efficient resource utilization.I have managed 7 Kubernetes cluster for smooth flow of features from dev to qa to pre-production and production.