Senior Site Reliability Engineer/ DevOps Engineer/ Platform or Cloud Engineer - Hybrid (Navi Mumbai)
Overview
Location: Navi Mumbai (Please note that we are only considering candidates currently based in Mumbai. Applications from other locations will not be reviewed)
Position Type: Full-time, Permanent
Work model: Hybrid
Shifts: Rotational EU & UK coverage (7:30 AM to 4:30 PM INR / 1:00 PM to 10:00 PM INR, plus one night on-call shift per month)
Industry: Commodities & Energy
Report To: Manager, Site Reliability
About Us:
Founded in 1995, Zema Global Data Corporation empowers organisations to simplify complexity, reduce risk, and make faster, more confident decisions that drive measurable results. Over the past two years, Zema Global has accelerated its growth through strategic investment and acquisition to strengthen our global leadership. Together we’re helping our customers gain a Decisioning Advantage – one bold idea at a time. With a presence across global energy, commodity, and financial markets, Zema Global empowers customers to simplify complexity, reduce risk, and make faster, more confident decisions that drive measurable results.
At Zema Global, we Think Big, Make It Happen, and Win as One. We thrive on collaboration, creativity, and respect, united by a shared drive to innovate and deliver meaningful impact for our customers and communities. If you’re inspired by solving complex challenges and contributing to a culture that values purpose and performance, we invite you to join us.
POSITION OVERVIEW:
This is a backend-focused SRE / DevOps Engineer role responsible for system reliability, stability, and automation across AWS-based environments.
The person in this role ensures services are available, monitored, scalable and secure. They troubleshoot production issues, improve infrastructure, automate processes and work closely with internal product and engineering teams.
No direct client-facing exposure. Strong operational mindset required. This is hands-on infrastructure and reliability engineering, not pure software development.
KEY RESPONSIBILITIES
- Maintain and support the products and data systems: proactively monitor events, investigate issues, analyse solutions, and drive problems through to resolution.
- Work with the product team to define application hardening and define opportunities for chaos engineering.
- Use operational tools and monitoring platforms to gain in-depth knowledge, understanding, and ongoing monitoring of system availability, performance, and capacity.
- Work with business partners to establish Service Level Indicators and Objectives (SLIs and SLOs).
- Implement an alerting strategy that makes alerts actionable and unique.
- Assist with the management and support of Unix/Linux servers that run Commodity Data custom services.
- Adhere to best practices, develop efficiencies, and improve the department’s scalability.
- You will work very closely with other groups to resolve problems, deploy and release new products and create solutions to provide world-class service, solutions and support.
REQUIRED QUALIFICATIONS
Must Have
- Bachelor’s degree in Computer Science or equivalent practical experience.
- 3-5+ years of experience in DevOps, Site Reliability Engineering, or infrastructure-focused engineering roles.
Note: Candidates with 5+ years of experience may also position themselves for a Senior title depending on depth of experience.
- Working knowledge of Linux systems administration, including troubleshooting and supporting production environments.
- Hands-on experience with AWS services such as EC2, IAM, VPC, S3, and CloudWatch.
- Experience supporting and operating AWS-based infrastructure, including working within established environments.
- Strong scripting skills using Python.
- Strong Bash scripting experience.
- Experience troubleshooting production incidents and resolving infrastructure or application stability issues.
- Experience working with CI/CD tools and pipelines (Jenkins preferred).
- Understanding of DevOps principles and automation practices.
- Experience using monitoring and alerting tools to maintain system health and availability.
- Strong communication skills in English and ability to collaborate across engineering and product teams.
- High level of ownership, problem-solving mindset and ability to adapt quickly to new challenges.
Nice Have
- AWS certification(s)
- Experience defining SLIs / SLOs
- Experience designing alerting strategies
- Experience with chaos engineering concepts
- Windows systems knowledge
- Exposure to relational databases (SQL, MySQL, Oracle)
- Good knowledge of programming in one or more popular languages such as Java, C#, JavaScript, Ruby, PowerShell, etc.
- Experience managing AWS permissions and IAM policies
- Experience improving scalability and performance of systems
Why Zema Global?
- Be part of a rapidly growing company shaping how data drives decisions in energy and commodities.
- Work with cutting-edge technology alongside industry experts.
- Significant opportunity to impact strategy, revenue growth, and decision-making.
- Join a culture that values innovation, collaboration, and autonomy to drive meaningful change.
How to Apply
Please submit your PDF CV highlighting your relevant experience (English CVs only). Only shortlisted candidates will be contacted. No agency submissions, please.
*** No visa sponsorship is available for this position ***
Equality and Diversity: Zema Global is committed to diversity and inclusion. We encourage applications from all qualified individuals and do not discriminate based on race, gender, sexual orientation, disability, or any other protected status.