Back to Jobs

3 Days ago

Site Reliability Engineer – Technical Lead

Apply Now

Gurugram, Haryana, India

Information Technology

Full-Time

Veryon

Overview

Description

Why We Need You – The Mission & Our Vision

Veryon is a leading software and technology company that enables aviation teams around the world to improve efficiency and safety. Our products maximize uptime for aircraft maintenance teams through customer-driven innovation and world-class service.

With over 7,500 customers across 137 countries, we serve general and business aviation, military/defense, commercial aviation, and OEMs. Our values—Fueled by Customers, Win Together, Make It Happen, Innovate to Elevate—are the foundation of everything we do.

As a hands-on Technical Lead in Site Reliability Engineering, you will be directly responsible for designing, building, and implementing modern reliability practices to ensure uptime, resilience, and production excellence across Veryon’s systems. You’ll work closely with Engineering, DevOps, and Support teams to streamline software delivery to both internal and client environments, troubleshoot production issues, and build observability using Datadog, Dynatrace, and AWS-native tools. You will also be a mentor on best practices and a key contributor to reliability-focused architecture and deployment design.

What You’ll Accomplish – Your Performance Objectives

Objective #1 – First 30 Days

Complete onboarding and gain deep understanding of Veryon’s systems, release processes, and deployment environment on AWS.
Review existing application architecture, CI/CD flows, and monitoring implementations.
Begin implementing improvements to observability using Datadog and Dynatrace.
Collaborate with engineers and DevOps to identify bottlenecks in production releases and issue resolution.

Objective #2 – First 90 Days

Build or enhance monitoring dashboards and alerts for critical infrastructure and applications.
Define and begin implementing Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets.
Own and improve release workflows and ensure reliable software delivery to customer environments.
Take ownership of investigating production issues, ensuring timely resolution and coordination across teams.
Begin documenting Root Cause Analyses (RCAs) for production incidents and drive preventive improvements.
Partner with DevOps to optimize and automate CI/CD pipelines using GitLab or equivalent.

Objective #3 – First 12 Months

Deliver measurable improvements in system uptime, MTTR, and deployment success rate.
Build self-healing automation and rollback mechanisms for high-risk services.
Standardize and own the RCA process for production incidents to ensure continuous learning.
Implement robust controls and metrics to monitor software delivery health.
Support production readiness of new services through performance baselining and fault testing.
Establish and track health KPIs that inform operational decisions and product improvements.

Requirements

Key Job Responsibilities

Implement and manage observability, alerting, and dashboards using Datadog, Dynatrace, and AWS tools.
Take ownership of production deployments, ensuring successful delivery to client environments with minimal disruption.
Troubleshoot and resolve production issues across the stack (infrastructure, application, integration).
Lead Root Cause Analysis (RCA) documentation, follow-ups, and remediation planning.
Define and maintain service SLOs, SLIs, and error budgets with product and engineering teams.
Build automation for deployment, monitoring, incident response, and recovery.
Design CI/CD workflows that support safe and reliable delivery across distributed environments.
Partner with developers to ensure observability and reliability are part of the application design.
Mentor engineers in SRE principles, monitoring strategy, and scalable operations.

Experience And Skills We Seek

6+ years of experience in SRE, DevOps, or platform engineering roles.
Strong hands-on experience with AWS services (e.g., EC2, ECS/EKS, RDS, IAM, CloudWatch, Route 53, ELB, etc.) is required.
Deep familiarity with CI/CD pipelines and deployment strategies using GitLab CI, Jenkins, or equivalent.
Expertise in observability tools such as Datadog and Dynatrace for APM, logging, and alerting.
Solid experience troubleshooting distributed systems in production environments.
Proficiency in scripting and infrastructure as code (e.g., Python, Bash, Terraform, Ansible).
Working knowledge of containers and orchestration (Docker, Kubernetes).
Understanding of SRE principles (SLIs, SLOs, MTTR, incident response, etc.).
Excellent communication and documentation skills, especially for RCA and runbook creation.
Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field.

How We Work – The Core Values That We Live By

Fueled By Customers – Everything we do is to help our customers increase uptime. Transparent communication and customer empathy drive our decisions.

Win Together – Collaboration across teams is our core strength. We believe every person is vital to our success.

Make It Happen – We take initiative, follow through, and adapt as needed. We take ownership and tackle tough challenges.

Innovate to Elevate – We embrace change, experiment boldly, and continuously improve. We lead by setting a high bar for ourselves and our industry.

Share job

Similar Jobs

View All

1 Day ago

Xpetize - Automation Test Engineer - C#/Selenium

Information Technology

Gurugram, Haryana, India

Job Title : Automation Test EngineerExperience : 3-6 yearsLocation : Chennai and coimbatoreJob Type : Full-timeJob SummaryWe are seeking a highly skilled Automation Test Engineer with experience in C# and Azure DevOps to join our QA team. The ideal ...

More info

1 Day ago

Frontend CMS Web Developer - AEM Platform

Information Technology

Gurugram, Haryana, India

NO FRESHER WILL BE PREFERRED. HIRING. We are looking for Frontend CMS Web Developer as an Employee for IT Company. Location Hyderabad. Experience 5+ years.About The RoleWe are seeking a highly skilled Frontend CMS Web Developer to join our dynamic I...

More info

1 Day ago

Full Stack Developer - React.js/Next.js/Golang

Information Technology

Gurugram, Haryana, India

Position : Full Stack Developer (React/Next js + GoLang) / only backend (GoLang)Experience : 2 - 5 YearsLocation : Rajendra Place- DelhiPosition Type : Full-TimeJob OverviewWe are looking for a skilled Full Stack Developer to design, develop, an...

More info

1 Day ago

Semi Senior Python Developer - Remote Work

Information Technology

Gurugram, Haryana, India

At BairesDev®, we've been leading the way in technology projects for over 15 years. We deliver cutting-edge solutions to giants like Google and the most innovative startups in Silicon Valley. Our diverse 4,000+ team, composed of the world's Top 1% ...

More info

1 Day ago

Senior .NET Developer - Remote Work

Information Technology

Gurugram, Haryana, India

More info

1 Day ago

Software Engineer

Information Technology

Gurugram, Haryana, India

About The JobThe Red Hat Satellite Engineering team is seeking a Software Engineer who is highly motivated and a versatile Python Developer to join our dynamic team in Pune, India. This role offers a unique opportunity to work across both developmen...

More info

1 Day ago

Data Platform Engineer

Information Technology

Gurugram, Haryana, India

Project Role : Data Platform EngineerProject Role Description : Assists with the data platform blueprint and design, encompassing the relevant data platform components. Collaborates with the Integration Architects and Data Architects to ensure cohes...

More info

1 Day ago

Business Analyst

Information Technology

Gurugram, Haryana, India

Project Role : Business AnalystProject Role Description : Analyze an organization and design its processes and systems, assessing the business model and its integration with technology. Assess current state, identify customer requirements, and defin...

More info

Talk to us

Feel free to call, email, or hit us up on our social media accounts.

Email info@antaltechjobs.in