Free cookie consent management tool by TermsFeed Site Reliability Engineer (SRE) - Incident Commander | Antal Tech Jobs
Back to Jobs
2 Days ago

Site Reliability Engineer (SRE) - Incident Commander

decor
Hyderabad, Telangana, India
Space Exploration & Research, Information Technology
Full-Time
Siemens Digital Industries Software

Overview

Siemens Digital Industries Software is a leading provider of solutions for the design, simulation, and manufacture of products across many different industries. Formula 1 cars, skyscrapers, ships, space exploration vehicles, and many of the objects we see in our daily lives are being conceived and manufactured using our Product Lifecycle Management (PLM) software.

The DISW SRE organization is dedicated to enhancing service and application availability, optimizing processes by automating manual and repetitive tasks, and addressing complex technical challenges in a dynamic, collaborative, inclusive, and iterative environment. This position plays a crucial role in developing automated solutions and processes that support and sustain best-in-class cloud-based applications. The candidate will support the Siemens Xcelerator platform and will be for coordinating major incident response, maintaining partner communication during service-impacting events, and facilitating resolution in compliance with service level agreement (SLA). Strong communication & coordination skills are necessary to support core objectives. This roles success will be defined by product teams within DISW business units meeting their SLAs.

Key Responsibilities

  • Incident Management: Act as the primary point of contact and leader during major incidents, coordinating the response, communication, and resolution efforts across all involved teams.
  • Incident Response: Quickly assess the severity of incidents, determine the impact, and drive the appropriate response to restore services as quickly as possible.
  • Communication: Ensure clear, concise, and timely communication with stakeholders, including technical teams, management, and customers, throughout the incident lifecycle.
  • Post-Incident Analysis: Lead post-incident reviews to identify root causes, drive improvements, and implement preventive measures to reduce the likelihood of recurrence.
  • Collaboration: Work closely with SRE, DevOps, Development, and other relevant teams to ensure that incident management processes are well-defined and continuously improved.
  • Training & Preparedness: Conduct regular incident response drills, train teams on incident management processes, and ensure readiness for handling high-severity incidents.
  • Documentation: Maintain and update incident management documentation, ensuring that all procedures are up-to-date and accessible to all relevant teams.
  • Monitoring & Alerts: Collaborate with SRE and monitoring teams to define and refine alerting criteria, ensuring that incidents are detected and escalated promptly.
  • Continuous Improvement: Find opportunities to improve system reliability, scalability, and performance based on lessons learned from incidents.
  • 24x7 On-call rotation: Participate in 24x7 on-call rotation.

Qualifications:

  • Technical Skills: Familiar with cloud infrastructure (AWS, GCP, Azure), containerization (Docker, Kubernetes)
  • Certifications: Relevant certifications (e.g., AWS Certified Solutions Architect, Certified Kubernetes Administrator) are a plus.
  • Automation: Experience with automation tools and scripting languages (e.g., Python, Bash) to streamline incident response and remediation.
  • Stakeholder Management: Experience aligning with cross-functional teams including business and product stakeholders during and after incidents.
  • Metrics Ownership: Ability to define and track incident-related critical metrics (e.g., MTTR, MTTD) to drive accountability and improvement.
  • Experience: Enterprise IT environment with distributed environments
  • Communication: Outstanding English communication skills, both verbal and written, as well as, listening and synthesis skills.
  • Incident Response: Quickly assess the severity of incidents, determine the impact, and drive the appropriate response to restore services as quickly as possible.
  • Problem-Solving: Excellent troubleshooting and problem-solving skills, with the ability to quickly analyze complex systems.
  • Calm Under Pressure: Ability to remain calm, focused, and effective in high-pressure situations. The ability to make quick, confident decisions.
  • Leadership: Demonstrated experience in leading incident response efforts and managing cross-functional teams during critical situations.
  • Technical Skills: Familiar with Jira Service management (or equivalent i.e. ServiceNow), Datadog (or equivalent i.e. Grafana), PagerDuty (or equivalent), Atlassian Status page (or equivalent).
  • Driven Learner: Highly motivated and driven to learn new technologies, skills, and methodologies, continuously seeking to expand your knowledge and adapt to evolving industry trends.
  • Must be willing and available to work the core hours required

A collection of over 377,000 minds building the future, one day at a time in over 200 countries. We're dedicated to equality, and we welcome applications that reflect the diversity of the communities we work in. All employment decisions at Siemens are based on qualifications, merit, and business need. Bring your curiosity and creativity and help us shape tomorrow! We offer a comprehensive reward package which includes a competitive basic salary, bonus scheme, generous holiday allowance, pension, and private healthcare.

Disclaimer: Please note that, due to the current integration framework, this opportunity is currently available exclusively to employees of Altair and DISW. While there is a possibility that the position may be made available to all Siemens employees through a future external posting, this is not guaranteed. We appreciate your understanding and cooperation during this transitional period. This communication does not constitute a promise or guarantee of future employment opportunities beyond the current scope.

Transform the everyday

Accelerate transformation

#SWSaaS

Share job
Similar Jobs
View All
2 Hours ago
Sr Technical Consultant
Information Technology
  • 700000 - 2300000 INR - Annual
  • 5 - 8 Yrs
  • Pune
Position: Sr. Technical Consultant (Dotnet 6.0+) Experience: 5+ Years Job Title: ASP.NET Core 6.0 / Full stack Developer for Pune Location We are looking for a seasoned ASP.NET Core 6.0 / MVC Developer to join our innovative team. This ro...
decor
3 Hours ago
Director - Artificial Intelligence
Health & Wellbeing
  • 5000000 - 5500000 INR - Annual
  • 10 - 15 Yrs
  • Chennai, Hyderabad
Summary role description: Hiring for the Director – Artificial Intelligence for a leading healthcare technology provider delivering AI-driven solutions to modernize care for millions nationwide. Company description: Our client i...
decor
20 Hours ago
Senior Java Developer
Information Technology
  • 8 - 12 Yrs
  • Hyderabad
Your mission, roles and requirements: We are looking for an experienced Senior Java Developer with a strong background in Spring Boot, Microservices architecture, and enterprise-grade backend development. The ideal candidate will possess deep expe...
decor
1 Day ago
Software Engineer Trainee in Noida (Hybrid)
Space Exploration & Research, Information Technology
  • Hyderabad, Telangana, India
Key Responsibilities Learn, understand, and apply software development lifecycle concepts. Collaborate with cross-functional teams to design, develop, and test software solutions. Write clean, efficient, and maintainable code under the guidance o...
decor
1 Day ago
Senior Business Analyst, Q2C Strategy and Solutions (EST Time Zone)
Space Exploration & Research, Information Technology
  • Hyderabad, Telangana, India
To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.Job CategoryFinanceJob DetailsAbout SalesforceSalesforce is the #1 AI CRM, where humans with agents dr...
decor
1 Day ago
Technical Architect
Financial Inclusion & Economic Equity
  • 2500000 - 5000000 INR - Annual
  • 8 - 12 Yrs
  • Mumbai
Role Overview As a Solution Architect, you will own the cross-platform technical architecture across the full stack—web, mobile, backend, DevOps, and QA. You’ll bring clarity and cohesion to complex real-time fintech systems by aligning dist...
decor
1 Day ago
Senior Devops Engineer/ Senior Consultant Specialist
Space Exploration & Research, Information Technology
  • Hyderabad, Telangana, India
Some careers shine brighter than others.If you’re looking for a career that will help you stand out, join HSBC and fulfil your potential. Whether you want a career that could take you to the top, or simply take you in an exciting new direction, HSBC...
decor
1 Day ago
Qlik Admin with GCP/ Software Engineer
Space Exploration & Research, Information Technology
  • Hyderabad, Telangana, India
Some careers shine brighter than others.If you’re looking for a career that will help you stand out, join HSBC and fulfil your potential. Whether you want a career that could take you to the top, or simply take you in an exciting new direction, HSBC...
decor

Talk to us

Feel free to call, email, or hit us up on our social media accounts.
Social media