| Company | PhonePe Limited |
| Job Title | Site Reliability Engineer (SRE 2/3) |
| Location | Bengaluru, India |
| Job Type | Full Time |
| Work Mode | Likely Work From Office / Hybrid |
| Experience Required | 5 – 12 years in SRE / DevOps roles |
| Role Summary | Responsible for managing, scaling, and ensuring high availability of cloud infrastructure, automation, and networking for mission-critical systems. |
| Cloud Platforms | Microsoft Azure or AWS (deep expertise in one required) |
| Key Responsibilities | Cloud infrastructure management, networking setup, automation using Terraform, configuration management, database HA setup, monitoring & observability, incident management, cost optimization |
| Infrastructure Skills | Linux (Ubuntu), Virtual Machines, Cloud-native services (S3, CloudWatch, ADX, etc.) |
| Networking Skills | Firewalls, Route Tables, VPC/VNet, VPN (IPsec), ExpressRoute/Direct Connect, DNS, BGP |
| Automation & DevOps | Terraform, Ansible/Saltstack, scripting (Python, Go, Java), Docker |
| Database Skills | MySQL, Aerospike, replication, backup strategies |
| Monitoring Tools | Prometheus, Grafana, Loki, Victoria Metrics, Riemann |
| Middleware | Nginx, HAProxy, RabbitMQ |
| Key Concepts | SLO/SLI, Incident Management, RCA, Toil Reduction, High Availability Systems |
| Soft Skills | Problem solving, ownership mindset, collaboration, ability to work in high-pressure environments |
| Benefits | Insurance, PF, Gratuity, NPS, Parental benefits, Relocation, Education assistance, Car lease |