
Overview
About the job:
Key responsibilities:1. Reliability and Performance:
- Design, implement, and maintain scalable and reliable systems and services
- Monitor system performance, availability, and reliability, proactively identifying and resolving issues.
2. Observability Implementation:
- Apply Databricks observability tools to develop and maintain dashboards, alerts, and reporting mechanisms that provide insights into system performance and usage.
- Establish and improve observability frameworks to supervise key performance indicators (KPIs) and service-level objectives (SLOs).
3. Incident Management:
- Respond to and fix production incidents, performing root cause analysis and implementing corrective actions to prevent future occurrences.
- Collaborate with multi-functional teams to ensure effective incident response processes and documentation.
4. Automation and Efficiency:
- Develop automation scripts and tools to streamline operational tasks, improve deployment processes, and enhance system reliability.
- Supply to the continuous improvement of deployment pipelines and infrastructure as code (IaC) practices.
5. Collaboration and Documentation:
- Work closely with development teams to understand application architectures and give to system design discussions.
- Document processes, best practices, and system architecture to facilitate knowledge sharing and onboarding.
6. Performance Optimization:
- Analyze system performance and application usage patterns to recommend and implement optimizations that improve efficiency and reduce costs.
Who can apply:
- have minimum 3 years of experience
- are Computer Science Engineering students
Only those candidates can apply who:
Salary:
Competitive salaryExperience:
3 year(s)Deadline:
2025-09-09 23:59:59Skills required:
Python, SQL, Problem Solving, Scala, Power BI, Effective Communication and GrafanaOther Requirements:
- Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent practical experience.
- 3 to 6 years of experience in Platform Engineering, DevOps, or a related field.
- Experience with Databricks, including its observability and monitoring features.
- Experience in Grafana observability platform
- Familiarity with cloud platforms (Azure)
- Programming languages skills: Python, Scala.
- SQL knowledge for data extraction and transformation
- Experience in Power BI development, both semantic models & visualizations
- Experience in Grafana visualizations
- Problem-solving skills and the ability to work in a fast-paced, collaborative environment.
- Good communication skills, with the ability to convey sophisticated technical concepts to non-technical collaborators.
- A proactive attitude with a focus on continuous improvement and learning.
- Open to explore and experiment new SRE processes and tools to support technical requirements of the D&A platform
- Willing to proactively seek new opportunities to learn and adopt new knowledge into practice.
About Company:
Procter & Gamble (P&G India) is part of the global P&G group, which was founded in the U.S. in 1837. It set up its Indian office in 1967 and, in 2025, reported revenues of about $2 billion. P&G India sells popular FMCG brands in personal care, hygiene, and household products, including Pampers, Ariel, Tide, Gillette, Pantene, and Vicks. The company has around 4,000 employees in India and runs several manufacturing and R&D facilities across the country. It continues to focus on premium hygiene products, sustainable packaging, rural outreach, and digital brand engagement.