Free cookie consent management tool by TermsFeed Senior Site Reliability Engineer (SRE) | Antal Tech Jobs
Back to Jobs
3 Days ago

Senior Site Reliability Engineer (SRE)

decor
Chennai, Tamil Nadu, India
Information Technology
Full-Time
Aviato Consulting

Overview

Are you a seasoned SRE looking to make a significant impact on large-scale cloud environments? Want to deepen your GCP expertise while being mentored by an ex-Google SRE leader?

Aviato Consulting is seeking an experienced Senior Site Reliability Engineer to join our growing team. This isn't just another SRE role; it's an opportunity to own critical infrastructuredrive technical strategy, and shape the reliability culture for major Australian and EU clients, all within a supportive, G-inspired environment built on transparency and collaboration.

What's In It For You?

  • Learn from the Best: Report directly to and receive mentorship from our Head of SRE, an experienced ex-Google SRE Manager. Gain invaluable insights into scaling, reliability, and leadership honed at one of the world's tech giants.
  • High-Impact Projects: Take ownership of complex GCP environments for diverse, significant clients across Australia and the EU. Your work directly influences the stability and performance of critical systems.
  • Drive Innovation, Not Just Tickets: We empower our Senior SREs to think strategically. You'll architect solutions, implement cutting-edge practices (SLOs, error budgets, advanced automation), and proactively improve systems, not just react to issues.
  • A Culture That Works: Founded by ex-Googlers, we foster a transparent, collaborative, and low-bureaucracy environment where doing the right thing matters. We value SRE principles and give you the autonomy to implement them effectively.
  • Cutting-Edge Tech: Deepen your expertise with GCP, Kubernetes, Terraform, modern observability tooling (Grafana, Dynatrace, Sentry), and sophisticated CI/CD pipelines.

What You'll Do (Your Impact):

  • Own & Architect Reliability: Design, implement, and manage highly available, scalable, and resilient architectures on Google Cloud Platform (GCP) for key customer environments.
  • Lead GCP Expertise: Serve as a subject matter expert for GCP within the team and potentially wider organisation, driving best practices for security, cost optimization, and performance.
  • Master Kubernetes at Scale: Architect, deploy, secure, and manage production-grade Kubernetes clusters (GKE preferred), ensuring optimal performance and reliability for critical applications (including API platforms like Apigee, though prior Apigee experience isn't mandatory).
  • Drive Automation & IaC: Lead the design and implementation of robust automation strategies using Terraform, Ansible, and scripting (Python, Go, Bash) for provisioning, configuration management, and CI/CD pipelines (Jenkins, GitHub Actions, etc.).
  • Elevate Observability: Architect and refine comprehensive monitoring, logging, and alerting strategies using tools like Grafana, Dynatrace, and Sentry to ensure proactive issue detection and rapid response.
  • Lead Incident Response & Prevention: Spearhead incident management efforts, conduct blameless post-mortems, and drive the implementation of preventative measures to continuously improve system resilience.
  • Champion SRE Principles: Actively promote and embed SRE best practices (SLOs, SLIs, error budgets) within delivery teams and operational processes.
  • Mentor & Collaborate: Share your expertise, mentor junior team members (potentially), and collaborate effectively across teams to foster a strong reliability culture.

What You'll Bring (Your Expertise):

  • Proven SRE Experience: 5+ years of hands-on experience in a Site Reliability Engineering, DevOps, or Cloud Engineering role, with a significant focus on production systems.
  • Deep GCP Knowledge: Demonstrable, in-depth expertise in designing, deploying, and managing services within Google Cloud Platform (Compute Engine, GKE, Networking, IAM, Cloud SQL/Spanner, Pub/Sub, Monitoring/Logging etc.). GCP certifications are a plus.
  • Strong Kubernetes Skills: Proven experience managing Kubernetes clusters in production environments (GKE highly desirable). Understanding of networking, security, and operational best practices within Kubernetes.
  • Infrastructure as Code Mastery: Significant experience using Terraform in complex environments. Proficiency with configuration management tools (Ansible, Puppet, Chef) is beneficial.
  • Automation & Scripting Prowess: Strong proficiency in scripting languages like Python or Go, with experience in automating operational tasks and building tooling.
  • Observability Expertise: Experience implementing and leveraging monitoring, logging, and tracing tools (e.g., Prometheus, Grafana, ELK Stack, Dynatrace, Datadog, Sentry).
  • Problem-Solving Acumen: Strong analytical and troubleshooting skills, with experience leading incident response for critical systems.
  • Collaboration & Communication: Excellent communication skills and a collaborative mindset, with the ability to explain complex technical concepts clearly. Experience mentoring others is advantageous.
  • (Desirable): Experience with API Management platforms (Apigee, Kong, etc.), advanced networking concepts, or security hardening in cloud environments.

Technologies We Use (You'll Master):

  • Cloud: Google Cloud Platform (GCP)
  • Containerisation & Orchestration: Kubernetes (GKE), Docker
  • Infrastructure & Automation: Terraform, Ansible
  • Monitoring & Observability: Grafana, Dynatrace, Sentry, Google Cloud Operations Suite
  • CI/CD: Jenkins, GitHub Actions, Bamboo (or similar)
  • Scripting: Python, Go, Bash
  • Collaboration: JIRA, Confluence, Slack

Ready to Elevate Your SRE Career?

If you're a passionate Senior SRE ready to tackle complex challenges on GCP, work with leading clients, and benefit from exceptional mentorship in a fantastic culture, Aviato is the place for you. Apply now and help us build the future of reliable cloud infrastructure!

Share job
Similar Jobs
View All
1 Day ago
Data Analyst (Odia Speakers)
AI & Machine Learning Advancement
  • 1 - 20 Yrs
  • Jharkhand, Andhra Pradesh, Odisha
For thousands of years, maps have provided humans with the knowledge they need to make decisions. As a Maps Evaluator, you will have the opportunity to provide ground truth for your town, city or country. At Peroptyx, we are looking for Data Ana...
decor
1 Day ago
Data Analyst (Kannada Speakers)
AI & Machine Learning Advancement
  • 1 - 20 Yrs
  • Karnataka, India
For thousands of years, maps have provided humans with the knowledge they need to make decisions. As a Maps Evaluator, you will have the opportunity to provide ground truth for your town, city or country. At Peroptyx, we are looking for Data Ana...
decor
1 Day ago
Technical Lead - Backend Development - Node.Js
Finance & Banking
  • 50,00,000 - 55,00,000 INR - Annual
  • 6 - 8 Yrs
  • Bangalore
What youʼll be doing We are much more than our job descriptions, but here is where you will begin: ● Collaborate with stakeholders, including product owners, project managers, and scrum masters, to define and clarify project requirements. ● Transl...
decor
1 Day ago
Engineering Manager
Finance & Banking
  • 55,00,000 - 60,00,000 INR - Annual
  • 8 - 12 Yrs
  • Bangalore
What youʼll be doing Weʼre much more than our job descriptions, but hereʼs where youʼll begin: ● Lead and deliver large-scale platform and product initiatives that impact millions of users. ● Collaborate with product, design, and business teams to...
decor
1 Day ago
Technical Project Manager (WordPress)
Information Technology
  • 6 - 10 Yrs
  • Ahmedabad
Location: Ahmedabad / Remote Experience: 6+ Years About Us: E2M Solutions is home to a growing remote team of WordPress experts building innovative digital solutions. We pride ourselves on delivering consistent value and excellence to our client...
decor
1 Day ago
Senior WordPress Frontend Developer
Information Technology
  • 5 - 10 Yrs
  • Ahmedabad
At E2M Solutions, we're building a powerhouse remote team to deliver high-performing, user-centric WordPress solutions. If you live and breathe frontend development and are looking to work on cutting-edge WordPress projects with a passionate global t...
decor
2 Days ago
Python Developer - C++/EDA
Information Technology
  • Chennai, Tamil Nadu, India
Job DescriptionWe are seeking a highly skilled C++ Python Developer with a strong background in software development, scripting, and EDA tool integration. This role focuses on creating, enhancing, and maintaining tools used in silicon design and ver...
decor
2 Days ago
Business Analyst
Information Technology
  • Chennai, Tamil Nadu, India
Project Role : Business AnalystProject Role Description : Analyze an organization and design its processes and systems, assessing the business model and its integration with technology. Assess current state, identify customer requirements, and defin...
decor

Talk to us

Feel free to call, email, or hit us up on our social media accounts.
Social media