Free cookie consent management tool by TermsFeed Senior Cloud Infrastructure Engineer - Observability | Antal Tech Jobs
Back to Jobs
15 Weeks ago

Senior Cloud Infrastructure Engineer - Observability

decor
Bangalore, Karnataka, India
Information Technology
Full-Time
Splunk

Overview

Join us as we pursue our ground-breaking vision to make machine data accessible, usable, and valuable to everyone. We are a company filled with people who are passionate about our product and seek to deliver the best experience for our customers. At Splunk, we are committed to our work, customers, having fun, and most significantly to each other’s success.
The
Splunk Observability Cloud
provides full-fidelity monitoring and fixing across infrastructure, applications, and user interfaces, in real-time and at any scale, to help our customers keep their services reliable, innovate faster, and deliver great customer experiences. Infrastructure Software Engineers at Splunk are cloud-native systems engineers who use infrastructure-as-code, microservices, automation, and efficient design to build, operate, and scale our products.
Role
You will help us run one of the largest and most sophisticated cloud-scale, bigdata, and microservices platforms in the world. You will be responsible for enabling developers to operate highly available, scalable, and cost-efficient applications with low operational burden by handling and improving the reliability and resiliency of SRE-managed services and infrastructure. You thrive on automation, infrastructure-as-code, reliability engineering, and getting rid of tedious, manual tasks.
You will:
  • Design new services, tools, and monitoring to be implemented by the entire team.
  • Analyze the tradeoffs of the proposed design and make recommendations based on these tradeoffs.
  • Mentor new engineers to achieve more than they thought possible. You enjoy making other teams successful and are fulfilled through the success of others.
Work on reliability projects, including:
  • HA, Business Continuity Planning, disaster recovery, backup/restore, RTO, RPO
  • Chaos engineering
  • Application uptime and performance
  • Capacity management & planning
  • SLIs, SLOs, error budgets, and monitoring dashboards
  • Responsible for deployment and operations of large-scale distributed data stores and streaming services
  • Establishing design patterns for monitoring and benchmarking
  • Establishing and documenting production run books and guidelines for developers
  • Tooling, toil reduction, runbooks & automation to handle production environments
  • Incident management and improving MTTD/MTTR for services
  • Cloud cost optimization
Qualifications
Must-Have:
  • 9+ years of SRE experience in handling large-scale cloud-native microservices platforms.
  • 4+ years of strong hands-on experience deploying, handling, and monitoring large-scale Kubernetes clusters in the public cloud specifically AWS or GCP
  • Experience with infrastructure automation and scripting using Python and/or bash scripting.
  • Strong hands-on experience in monitoring tools such as Splunk, Prometheus, Grafana, ELK stack, etc. in order to build observability for large-scale microservices deployments.
  • Excellent problem-solving, triaging, and debugging skills in large-scale distributed systems
Preferred:
  • AWS Solutions Architect certification preferred.
  • Confluent Certified Administrator for Apache Kafka and/or Apache Cassandra Administrator Associate certifications are preferred
  • Experience with Infrastructure-as-Code using Terraform, CloudFormation, Google Deployment Manager, Pulumi, Packer, ARM, etc.
  • Experience with deployment and operations of large scale clusters for Cassandra, Kafka, Elastic Search, MongoDB, ZooKeeper, Redis, etc.
  • Experience with CI/CD frameworks and Pipeline-as-Code such as Jenkins, Spinnaker, Gitlab, Argo, Artifactory, etc.
  • Proven skills to effectively work across teams and functions to influence the design, operations, and deployment of highly available software.
Bachelors/Masters in Computer Science, Engineering, or related technical field, or equivalent practical experience.

We value diversity, equity, and inclusion at Splunk and are an equal employment opportunity employer. Qualified applicants receive consideration for employment without regard to race, religion, color, national origin, ancestry, sex, gender, gender identity, gender expression, sexual orientation, marital status, age, physical or mental disability or medical condition, genetic information, veteran status, or any other consideration made unlawful by federal, state, or local laws. We consider qualified applicants with criminal histories, consistent with legal requirements.

Note:

Share job
Similar Jobs
View All
9 Hours ago
Data Analyst (Telugu Speakers)
AI & Machine Learning Advancement
  • 1 - 20 Yrs
  • Andhra Pradesh, Telangana, India
For thousands of years, maps have provided humans with the knowledge they need to make decisions. As a Maps Evaluator, you will have the opportunity to provide ground truth for your town, city or country. At Peroptyx, we are looking for Data Ana...
decor
11 Hours ago
Technical Writer
Information Technology
  • 1300000 - 1600000 INR - Annual
  • 3 - 7 Yrs
  • Pune
THE POSITION We’re looking for a motivated, driven and collaborative Software Technical Writer that will be an integral member of a small software technical writing team. As a Technical Writer with my client, you will be working with multiple team...
decor
16 Hours ago
Quality Assurance Automation Engineer
Information Technology
  • 3 - 7 Yrs
  • Mumbai, Nashik, Mumbai (All Areas), Pune
Job Title: QA Automation Engineer Location: Hybrid ( Mumbai, Pune, Nashik ) Experience: 3+ years Job Summary: We are looking for a detail-oriented and experienced QA Automation Engineer to join our team. You will be responsible for designing an...
decor
1 Day ago
Senior Software Engineer
Information Technology
  • 2400000 - 2600000 INR - Annual
  • 6 - 10 Yrs
  • Hyderabad
Job Summary: Conceptualize, designs, codes, debugs and performs development activities in accordance with designated standards and procedures to meet specific project requirements.  Shares technical expertise and provides training and guidance to...
decor
1 Day ago
Data Engineer III
Information Technology
  • Bangalore, Karnataka, India
At American Express, our culture is built on a 175-year history of innovation, shared values and Leadership Behaviors, and an unwavering commitment to back our customers, communities, and colleagues. As part of Team Amex, you'll experience this powe...
decor
1 Day ago
Test Engineer Professional
Information Technology
  • Bangalore, Karnataka, India
About Mettler ToledoMETTLER TOLEDO is a global leader in precision instruments and services. We are renowned for innovation and quality across laboratory, process analytics, industrial, product inspection, and retailing applications. Our sales and s...
decor
1 Day ago
Hiring for DevOps Engineer | Upto 15-45 LPA
Information Technology
  • Bangalore, Karnataka, India
This 10 minute interview helps Round1 gauge your readiness for DevOps roles across 100+ tech & product companies. Take this interview once, and Round1 will actively search for DevOps roles for you, find the ones where you’re likely to get shortliste...
decor
1 Day ago
Automation ServiceNow Test Engineer
Information Technology
  • Bangalore, Karnataka, India
What Success Looks Like In This Role Performs Software QA Engineering for an assigned set of applications or system elements. Works with team members to review client requirements, design and associated functional specifications. Creates and deve...
decor

Talk to us

Feel free to call, email, or hit us up on our social media accounts.
Social media