Information Technology
Full-Time
ZettaMine Labs Pvt. Ltd.
Overview
Job Title: DevOps Site Reliability Engineer
Location: Bangalore (Work from Office)
Experience: 5–6 Years
Employment Type: Full-Time
Role Overview:
We are looking for a DevOps Site Reliability Engineer (SRE) with a strong blend of Python development, DevOps practices, and cloud infrastructure expertise. In this role, you'll ensure the reliability, scalability, and performance of critical services, automate infrastructure management, and support high-scale CI/CD pipelines for fast-paced development teams.
Key Responsibilities:
- Build and manage highly available, scalable, and fault-tolerant systems on AWS, Azure, or Google Cloud.
- Develop automation tools and solutions in Python to support reliability and efficiency.
- Collaborate with development and DevOps teams to build and maintain robust CI/CD pipelines (e.g., Jenkins, ArgoCD).
- Monitor system performance and reliability using tools like Grafana, Datadog, or Splunk.
- Design and manage Infrastructure as Code using tools like Terraform.
- Support and scale messaging systems like Kafka or RabbitMQ.
- Manage containerized applications and Kubernetes clusters.
- Work with databases such as PostgreSQL, Redis, and Elasticsearch to ensure performance and availability.
Required Skills:
- 5–6 years of experience, including:
- Solid Python coding experience (beyond scripting).
- Hands-on experience in DevOps or SRE roles.
- Experience with public cloud platforms (preferably AWS).
- Knowledge of CI/CD, observability, and infrastructure automation tools.
- Excellent problem-solving and communication skills.
Preferred Skills:
- Experience with message queues like Kafka or RabbitMQ.
- Familiarity with Terraform, Kubernetes, and monitoring/logging tools.
- Understanding of SQL and NoSQL databases.
Similar Jobs
View All
Talk to us
Feel free to call, email, or hit us up on our social media accounts.
Email
info@antaltechjobs.in