Position: SRE Transformation Lead
Location: Pune, Maharashtra
Experience
Relevant Experience: 12-15 years.
Objectives of this role
We are seeking an experienced and dynamic SRE (Site Reliability Engineering) Transformation Lead to drive our SRE initiatives, ensuring the stability, scalability, and performance of our systems. The ideal candidate will have a strong background in SRE practices, a deep understanding of self-healing mechanisms, and expertise in developing and managing dashboards for monitoring and alerting.
Responsibilities
Lead the transformation initiatives to implement SRE practices across the organization.
Design and implement self-healing solutions to automate incident response and resolution.
Develop and maintain playbooks and runbooks for automated recovery processes.
Collaborate with development teams to embed self-healing capabilities into applications and services.
Develop comprehensive monitoring strategies to ensure system health and performance.
Create and manage dashboards that provide real-time insights into system metrics and KPIs.
Implement alerting mechanisms to proactively identify and address potential issues.
Establish best practices for incident management and train teams on effective response strategies.
Work closely with cross-functional teams, including development, operations, and product management.
Experience working on chatbot development and integration.
Working experience with ITSM tools like ServiceNow.
Hands-on experience with ITSM tools like ServiceNow, including data extraction and streaming data to visualization tools.
Technical Skills
Strong knowledge of SRE principles, practices, and tools.
Expertise in developing and managing self-healing systems.
Proficiency in monitoring and dashboard tools (e.g., Incorta, Prometheus, Grafana, Datadog, Virtana, OpsRamp).
Solid understanding of cloud platforms (AWS, GCP, Azure) and containerization technologies (Docker, Kubernetes).
Experience with automation and scripting languages (Python, Bash, Ansible and Terraform).