Bangalore, Karnataka, India
Information Technology
Full-Time
OpenFX
Overview
The core responsibilities for the job include the following:
In your first 6 to 12 months, you will:
- Own reliability and operability of 1 to 2 production services, including services on the money path.
- Design and maintain CI/CD pipelines that enable frequent, safe deployments.
- Define and improve observability across metrics, logs, and alerts.
- Participate in on-call rotations and lead incident response when needed.
- Identify and reduce operational risk by improving resilience and failure handling.
- Automate repetitive operational workflows such as deployments, rollbacks, and recovery steps.
- Partner with Backend, Security, and Infra teams to translate system requirements into infrastructure solutions.
- Improve reliability metrics such as availability, MTTR, alert quality, and incident frequency.
You will be measured on:
- System reliability: Services meet defined SLOs for availability and latency.
- Operational clarity: Issues are detected via monitoring rather than customer reports.
- Incident handling: Incidents are mitigated quickly with a limited blast radius.
- Automation: Manual operational work decreases over time.
- Judgment: Infrastructure tradeoffs are deliberate and defensible.
- Trust: Engineering teams rely on the platforms you operate.
Requirements:
- 3 to 7 years of experience operating production systems.
- Strong fundamentals in Linux, networking, and cloud infrastructure.
- Experience running containerized workloads in production.
- Hands-on experience with CI/CD pipelines.
- Experience designing, monitoring, and alerting systems.
- Proven experience handling production incidents.
- Ability to clearly explain operational decisions.
Preferred (accelerates ramp, not required):
- Experience with Kubernetes or similar orchestration platforms.
- Infrastructure as Code experience (Terraform or equivalent).
- Operational experience with PostgreSQL, Redis, Kafka, or similar systems.
- Experience in fintech or high transaction volume systems.
- Security and compliance exposure.
- Experience mentoring junior engineers.
Similar Jobs
View All
Talk to us
Feel free to call, email, or hit us up on our social media accounts.
Email
info@antaltechjobs.in