Back to Jobs

3 Days ago

Site Reliability Engineer (SRE)

Apply Now

Information Technology

Full-Time

Keka HR

Overview

Site Reliability Engineer (SRE) – Observability & Azure Infrastructure

Location: Hyderabad

Type: Full-Time

Experience Level: 5 to 8 Years

Department: Engineering / DevOps

About The Role

We are looking for a highly skilled Site Reliability Engineer (SRE) to lead the implementation and management of our observability stack across Azure-hosted infrastructure and .NET Core applications. This role will focus on configuring and managing Open Telemetry, Prometheus, Loki, and Tempo, along with setting up robust alerting systems across all services — including Azure infrastructure and MSSQL databases.

You will work closely with developers, DevOps, and infrastructure teams to ensure the performance, reliability, and visibility of our .NET Core applications and cloud services.

Key Responsibilities

Observability Platform Implementation:
Design and maintain distributed tracing, metrics, and logging using OpenTelemetry, Prometheus, Loki, and Tempo.
Ensure complete instrumentation of .NET Core applications for end-to-end visibility. o Implement telemetry pipelines for application logs, performance metrics, and traces.

Monitoring & Alerting

Develop and manage SLIs, SLOs, and error budgets.
Create actionable, noise-free alerts using Prometheus Alertmanager and Azure Monitor. o Monitor key infrastructure components, applications, and databases with a focus on reliability and performance.
Azure & Infrastructure Integration:
Integrate Azure services (App Services, VMs, Storage, etc.) with the observability stack. o Configure monitoring for MSSQL databases, including performance tuning metrics and health indicators. o Use Azure Monitor, Log Analytics, and custom exporters where necessary.

Automation & DevOps

Automate observability configurations using Terraform, PowerShell, or other IaC tools.
Integrate telemetry validation and health checks into CI/CD pipelines.
Maintain observability as code for repeatable deployments and easy scaling.
Resilience & Reliability Engineering:
Conduct capacity planning to anticipate scaling needs based on usage patterns and growth.
Define and implement disaster recovery strategies for critical Azure-hosted services and databases.
Perform load and stress testing to identify performance bottlenecks and validate infrastructure limits.
Support release engineering by integrating observability checks and rollback strategies in CI/CD pipelines.
Apply chaos engineering practices in lower environments to uncover potential reliability risks proactively.
Collaboration & Documentation:
Partner with engineering teams to promote observability best practices in .NET Core development. o Create dashboards (Grafana preferred) and runbooks for system insights and incident response. o Document monitoring standards, troubleshooting guides, and onboarding materials.

Required Skills And Experience

4+ years of experience in SRE, DevOps, or infrastructure-focused roles.
Deep experience with .NET Core application observability using OpenTelemetry.
Proficiency with Prometheus, Loki, Tempo, and related observability tools.
Strong background in Azure infrastructure monitoring, including App Services and VMs.
Hands-on experience monitoring MSSQL databases (deadlocks, query performance, etc.).
Familiarity with Infrastructure as Code (Terraform, Bicep) and scripting (PowerShell, Bash).
Experience building and tuning alerts, dashboards, and metrics for production systems.

Preferred Qualifications

Azure certifications (e.g., AZ-104, AZ-400).
Experience with Grafana, Azure Monitor, and Log Analytics integration.
Familiarity with distributed systems and microservice architectures.
Prior experience in high-availability, regulated, or customer-facing environments.

Share job

Similar Jobs

View All

1 Hour ago

Data Analyst (Odia Speakers)

AI & Machine Learning Advancement

1 - 20 Yrs
Jharkhand, Andhra Pradesh, Odisha

For thousands of years, maps have provided humans with the knowledge they need to make decisions. As a Maps Evaluator, you will have the opportunity to provide ground truth for your town, city or country. At Peroptyx, we are looking for Data Ana...

More info

1 Hour ago

Data Analyst (Kannada Speakers)

AI & Machine Learning Advancement

1 - 20 Yrs
Karnataka, India

More info

3 Hours ago

Technical Lead - Backend Development - Node.Js

Finance & Banking

50,00,000 - 55,00,000 INR - Annual
6 - 8 Yrs
Bangalore

What youʼll be doing We are much more than our job descriptions, but here is where you will begin: ● Collaborate with stakeholders, including product owners, project managers, and scrum masters, to define and clarify project requirements. ● Transl...

More info

4 Hours ago

Engineering Manager

Finance & Banking

55,00,000 - 60,00,000 INR - Annual
8 - 12 Yrs
Bangalore

What youʼll be doing Weʼre much more than our job descriptions, but hereʼs where youʼll begin: ● Lead and deliver large-scale platform and product initiatives that impact millions of users. ● Collaborate with product, design, and business teams to...

More info

8 Hours ago

Technical Project Manager (WordPress)

Information Technology

6 - 10 Yrs
Ahmedabad

Location: Ahmedabad / Remote Experience: 6+ Years About Us: E2M Solutions is home to a growing remote team of WordPress experts building innovative digital solutions. We pride ourselves on delivering consistent value and excellence to our client...

More info

8 Hours ago

Senior WordPress Frontend Developer

Information Technology

5 - 10 Yrs
Ahmedabad

At E2M Solutions, we're building a powerhouse remote team to deliver high-performing, user-centric WordPress solutions. If you live and breathe frontend development and are looking to work on cutting-edge WordPress projects with a passionate global t...

More info

16 Hours ago

UI/UX Designer

Healthcare & Life Sciences

2 - 5 Yrs
Anywhere in India/Multiple Locations

We’re Hiring: UI/UX Designer (Full-time / Part-time / Freelance / Remote) 🌟 We are looking for a highly creative and passionate UI/UX Designer to join us in designing our website. 🔹 What We’re Looking For: Strong proficiency in Figma and/or Photos...

More info

1 Day ago

Senior Data Engineer

Information Technology

4 - 8 Yrs
Bangalore

We are looking for a skilled and driven Data Engineer to join our dynamic technology team. As a Data Engineer, you will be responsible for building and maintaining robust, scalable, and high-performance data pipelines and platforms. This role involve...

More info

Talk to us

Feel free to call, email, or hit us up on our social media accounts.

Email info@antaltechjobs.in