Back to Jobs

3 Days ago

Senior Site Reliability Engineer (SRE)

Apply Now

Chennai, Tamil Nadu, India

Information Technology

Full-Time

Aviato Consulting

Overview

Are you a seasoned SRE looking to make a significant impact on large-scale cloud environments? Want to deepen your GCP expertise while being mentored by an ex-Google SRE leader?

Aviato Consulting is seeking an experienced Senior Site Reliability Engineer to join our growing team. This isn't just another SRE role; it's an opportunity to own critical infrastructure, drive technical strategy, and shape the reliability culture for major Australian and EU clients, all within a supportive, G-inspired environment built on transparency and collaboration.

What's In It For You?

Learn from the Best: Report directly to and receive mentorship from our Head of SRE, an experienced ex-Google SRE Manager. Gain invaluable insights into scaling, reliability, and leadership honed at one of the world's tech giants.
High-Impact Projects: Take ownership of complex GCP environments for diverse, significant clients across Australia and the EU. Your work directly influences the stability and performance of critical systems.
Drive Innovation, Not Just Tickets: We empower our Senior SREs to think strategically. You'll architect solutions, implement cutting-edge practices (SLOs, error budgets, advanced automation), and proactively improve systems, not just react to issues.
A Culture That Works: Founded by ex-Googlers, we foster a transparent, collaborative, and low-bureaucracy environment where doing the right thing matters. We value SRE principles and give you the autonomy to implement them effectively.
Cutting-Edge Tech: Deepen your expertise with GCP, Kubernetes, Terraform, modern observability tooling (Grafana, Dynatrace, Sentry), and sophisticated CI/CD pipelines.

What You'll Do (Your Impact):

Own & Architect Reliability: Design, implement, and manage highly available, scalable, and resilient architectures on Google Cloud Platform (GCP) for key customer environments.
Lead GCP Expertise: Serve as a subject matter expert for GCP within the team and potentially wider organisation, driving best practices for security, cost optimization, and performance.
Master Kubernetes at Scale: Architect, deploy, secure, and manage production-grade Kubernetes clusters (GKE preferred), ensuring optimal performance and reliability for critical applications (including API platforms like Apigee, though prior Apigee experience isn't mandatory).
Drive Automation & IaC: Lead the design and implementation of robust automation strategies using Terraform, Ansible, and scripting (Python, Go, Bash) for provisioning, configuration management, and CI/CD pipelines (Jenkins, GitHub Actions, etc.).
Elevate Observability: Architect and refine comprehensive monitoring, logging, and alerting strategies using tools like Grafana, Dynatrace, and Sentry to ensure proactive issue detection and rapid response.
Lead Incident Response & Prevention: Spearhead incident management efforts, conduct blameless post-mortems, and drive the implementation of preventative measures to continuously improve system resilience.
Champion SRE Principles: Actively promote and embed SRE best practices (SLOs, SLIs, error budgets) within delivery teams and operational processes.
Mentor & Collaborate: Share your expertise, mentor junior team members (potentially), and collaborate effectively across teams to foster a strong reliability culture.

What You'll Bring (Your Expertise):

Proven SRE Experience: 5+ years of hands-on experience in a Site Reliability Engineering, DevOps, or Cloud Engineering role, with a significant focus on production systems.
Deep GCP Knowledge: Demonstrable, in-depth expertise in designing, deploying, and managing services within Google Cloud Platform (Compute Engine, GKE, Networking, IAM, Cloud SQL/Spanner, Pub/Sub, Monitoring/Logging etc.). GCP certifications are a plus.
Strong Kubernetes Skills: Proven experience managing Kubernetes clusters in production environments (GKE highly desirable). Understanding of networking, security, and operational best practices within Kubernetes.
Infrastructure as Code Mastery: Significant experience using Terraform in complex environments. Proficiency with configuration management tools (Ansible, Puppet, Chef) is beneficial.
Automation & Scripting Prowess: Strong proficiency in scripting languages like Python or Go, with experience in automating operational tasks and building tooling.
Observability Expertise: Experience implementing and leveraging monitoring, logging, and tracing tools (e.g., Prometheus, Grafana, ELK Stack, Dynatrace, Datadog, Sentry).
Problem-Solving Acumen: Strong analytical and troubleshooting skills, with experience leading incident response for critical systems.
Collaboration & Communication: Excellent communication skills and a collaborative mindset, with the ability to explain complex technical concepts clearly. Experience mentoring others is advantageous.
(Desirable): Experience with API Management platforms (Apigee, Kong, etc.), advanced networking concepts, or security hardening in cloud environments.

Technologies We Use (You'll Master):

Cloud: Google Cloud Platform (GCP)
Containerisation & Orchestration: Kubernetes (GKE), Docker
Infrastructure & Automation: Terraform, Ansible
Monitoring & Observability: Grafana, Dynatrace, Sentry, Google Cloud Operations Suite
CI/CD: Jenkins, GitHub Actions, Bamboo (or similar)
Scripting: Python, Go, Bash
Collaboration: JIRA, Confluence, Slack

Ready to Elevate Your SRE Career?

If you're a passionate Senior SRE ready to tackle complex challenges on GCP, work with leading clients, and benefit from exceptional mentorship in a fantastic culture, Aviato is the place for you. Apply now and help us build the future of reliable cloud infrastructure!

Share job

Similar Jobs

View All

1 Day ago

Data Analyst (Odia Speakers)

AI & Machine Learning Advancement

1 - 20 Yrs
Jharkhand, Andhra Pradesh, Odisha

For thousands of years, maps have provided humans with the knowledge they need to make decisions. As a Maps Evaluator, you will have the opportunity to provide ground truth for your town, city or country. At Peroptyx, we are looking for Data Ana...

More info

1 Day ago

Data Analyst (Kannada Speakers)

AI & Machine Learning Advancement

1 - 20 Yrs
Karnataka, India

More info

1 Day ago

Technical Lead - Backend Development - Node.Js

Finance & Banking

50,00,000 - 55,00,000 INR - Annual
6 - 8 Yrs
Bangalore

What youʼll be doing We are much more than our job descriptions, but here is where you will begin: ● Collaborate with stakeholders, including product owners, project managers, and scrum masters, to define and clarify project requirements. ● Transl...

More info

1 Day ago

Engineering Manager

Finance & Banking

55,00,000 - 60,00,000 INR - Annual
8 - 12 Yrs
Bangalore

What youʼll be doing Weʼre much more than our job descriptions, but hereʼs where youʼll begin: ● Lead and deliver large-scale platform and product initiatives that impact millions of users. ● Collaborate with product, design, and business teams to...

More info

1 Day ago

Technical Project Manager (WordPress)

Information Technology

6 - 10 Yrs
Ahmedabad

Location: Ahmedabad / Remote Experience: 6+ Years About Us: E2M Solutions is home to a growing remote team of WordPress experts building innovative digital solutions. We pride ourselves on delivering consistent value and excellence to our client...

More info

1 Day ago

Senior WordPress Frontend Developer

Information Technology

5 - 10 Yrs
Ahmedabad

At E2M Solutions, we're building a powerhouse remote team to deliver high-performing, user-centric WordPress solutions. If you live and breathe frontend development and are looking to work on cutting-edge WordPress projects with a passionate global t...

More info

2 Days ago

Python Developer - C++/EDA

Information Technology

Chennai, Tamil Nadu, India

Job DescriptionWe are seeking a highly skilled C++ Python Developer with a strong background in software development, scripting, and EDA tool integration. This role focuses on creating, enhancing, and maintaining tools used in silicon design and ver...

More info

2 Days ago

Business Analyst

Information Technology

Chennai, Tamil Nadu, India

Project Role : Business AnalystProject Role Description : Analyze an organization and design its processes and systems, assessing the business model and its integration with technology. Assess current state, identify customer requirements, and defin...

More info

Talk to us

Feel free to call, email, or hit us up on our social media accounts.

Email info@antaltechjobs.in