Free cookie consent management tool by TermsFeed Technical Lead - DevOps Engineer | Antal Tech Jobs
Back to Jobs
2 Days ago

Technical Lead - DevOps Engineer

decor
Social Good & Community Development
Full-Time
PTC

Overview

Our world is transforming, and PTC is leading the way. Our software brings the physical and digital worlds together, enabling companies to improve operations, create better products, and empower people in all aspects of their business.

Our people make all the difference in our success. Today, we are a global team of nearly 7,000 and our main objective is to create opportunities for our team members to explore, learn, and grow – all while seeing their ideas come to life and celebrating the differences that make us who we are and the work we do possible.

Job Details

As a senior SRE / Observability Engineer, you will be part of the Atlas Platform Engineering team and will:

  • Create and maintain observability standards and best practices
  • Review the current observability platform, identify areas for improvement, and guide the team in enhancing monitoring, logging, tracing, and alerting capabilities.
  • Expand the observability stack across multiple clouds, regions, and clusters, managing all observability data.
  • Design and implement monitoring solutions for complex distributed systems to provide deep insights into systems and services aiming at complete visibility of digital operations
  • Supporting the ongoing evaluation of new capabilities in the observability stack, conducting proof of concepts, pilots, and tests to validate their suitability.
  • Assist teams in creating clear, informative, and actionable dashboards to improve system visibility.
  • Automate monitoring and alerting processes, including enrichment strategies and ML-driven anomaly detection where applicable.
  • Provide technical leadership to the observability team with clear priorities ensuring agreed outcomes are achieved in a timely manner.
  • Work closely with R&D and product development teams (understand their requirements and challenges) to ensure seamless visibility into system and service performance.
  • Work closely with the Traffic Management team to identify and standardise on existing and new observability tools as part of a holistic solution
  • Conduct training sessions and create documentation for internal teams
  • Support the definition of SLI (service level indicators) and SLO (service level objectives) for the Atlas services.
  • Keep track of the error budget of each service
  • Participate in the emergency response process
  • Conduct RCAs (root cause analysis)
  • Help to automate repetitive tasks and reduce toil.


Qualifications:

People And Communication Qualifications

  • Be a strong team player
  • Have good collaboration and communication skills
  • Ability to translate technical concepts for non-technical audiences
  • Problem-solving and analytical thinking


Technical qualifications - general:

  • Familiarity with cloud platforms (Ideally Azure)
  • Familiarity with Kubernetes and Istio as the architecture on which the observability and Atlas services run, and how they integrate and scale.
  • Experience with infrastructure as code and automation
  • Knowledge of common programming languages and debugging techniques
  • Have a strong technical background and be hands on.
  • Linux and scripting languages (Bash, Python, Golang).
  • Significant Understanding of DevOps principles.


Technical Qualifications - Observability

  • Strong understanding of observability principles (metrics, logs, traces)
  • Experience with APM tools and distributed tracing
  • Proficiency in log aggregation and analysis
  • Knowledge and hands-on experience with monitoring, logging, and tracing tools such as Prometheus, Prometheus, Grafana, Datadog, New Relic, Sumologic, ELK Stack, or others
  • Knowledge of Open Telemetry, including OTEL collector and code instrumentation
  • Experience designing and building unified observability platforms that enable the use of data (metrics, logs, and traces) to determine quickly if their application or service is operating as desired.


Technical Qualifications – SRE

  • Understanding of the Google SRE principles
  • Experience in defining SLIs and SLOs
  • Experience in performing RCAs (root cause analysis)
  • Experience in system performance
  • Experience in incident response
  • Knowledge of status tools, such as Atlassian Status Page or similar
  • Knowledge of incident management and paging tools, such as PagerDuty or similar
  • Knowledge of ITIL (Information Technology Infrastructure Library) processes


Qualifications:

People And Communication Qualifications

  • Be a strong team player
  • Have good collaboration and communication skills
  • Ability to translate technical concepts for non-technical audiences
  • Problem-solving and analytical thinking


Technical qualifications - general:

  • Familiarity with cloud platforms (Ideally Azure)
  • Familiarity with Kubernetes and Istio as the architecture on which the observability platform runs, and how they integrate and scale.
  • Experience with infrastructure as code and automation
  • Knowledge of common programming languages and debugging techniques
  • Have a strong technical background and be hands on.
  • Linux and scripting languages (Bash, Python, Golang).
  • Significant Understanding of DevOps principles.


Technical Qualifications - Observability

  • Strong understanding of observability principles (metrics, logs, traces)
  • Experience with APM tools and distributed tracing
  • Proficiency in log aggregation and analysis
  • Knowledge and hands-on experience with monitoring, logging, and tracing tools such as Prometheus, Prometheus, Grafana, Datadog, New Relic, Sumologic, ELK Stack, or others
  • Knowledge of Open Telemetry, including OTEL collector and code instrumentation
  • Experience designing and building unified observability platforms that enable the use of data (metrics, logs, and traces) to determine quickly if their application or service is operating as desired.


Life at PTC is about more than working with today’s most cutting-edge technologies to transform the physical world. It’s about showing up as you are and working alongside some of today’s most talented industry leaders to transform the world around you.

If you share our passion for problem-solving through innovation, you’ll likely become just as passionate about the PTC experience as we are. Are you ready to explore your next career move with us?

We respect the privacy rights of individuals and are committed to handling Personal Information responsibly and in accordance with all applicable privacy and data protection laws. Review our Privacy Policy here."
Share job
Similar Jobs
View All
1 Day ago
Senior Azure Data Engineer - ETL/Power BI
Social Good & Community Development
We are seeking an experienced and hands-on Senior Azure Data Engineer with Power BI expertise to take on a dual role that combines technical leadership and active development.You will lead BI and data engineering efforts for enterprise-grade analyti...
decor
1 Day ago
Cyber Security Analyst - SOC
Social Good & Community Development
Experience : 3+years.Location : Nagpur.Notice period : 30days.Mandatory skills : SOC, Qradar , Endpoint corwdstrike.Job Description Responsible for conducting information security investigations as a result of security incidents identified by t...
decor
1 Day ago
QA Automation Tester - Selenium/Cypress
Social Good & Community Development
We are seeking a skilled and detail-oriented QA Automation Tester to join our Quality Assurance team.The ideal candidate will be responsible for designing and executing automated test cases, building test frameworks, and ensuring the delivery of hig...
decor
1 Day ago
AWS Cloud Security Consultant
Social Good & Community Development
Experience: 7+ YearsLocation: Pune or HyderabadNotice Period: Immediate to 30 Days🛡️ Role OverviewWe are looking for an experienced AWS Cloud Security Consultant (Specialist level) with a deep understanding of AWS native security capabilities. You w...
decor
1 Day ago
Big Data Engineer
Social Good & Community Development
This job is with Amazon, an inclusive employer and a member of myGwork – the largest global platform for the LGBTQ+ business community. Please do not contact the recruiter directly. DescriptionProfit Intelligence team in Amazon Retail is seeking a ...
decor
1 Day ago
Software Engineer II – Python, PySpark, AWS
Social Good & Community Development
hackajob is collaborating with J.P. Morgan to connect them with exceptional tech professionals for this role.You’re ready to gain the skills and experience needed to grow within your role and advance your career — and we have the perfect software en...
decor
1 Day ago
Credence Global Solutions - Software Developer - Javascript/.Net Core/MVC
Social Good & Community Development
We are looking for a skilled and enthusiastic Software Developer (.Net MVC / .Net Core) to join our growing technology team. The ideal candidate will have a strong background in Microsoft technologies and experience building scalable web application...
decor
1 Day ago
Senior Data Engineer
Social Good & Community Development
Job SummaryWe are looking for a skilled and motivated Software Engineer with strong experience in data engineering and ETL processes.The ideal candidate should be comfortable working with any object-oriented programming language, possess strong SQL ...
decor

Talk to us

Feel free to call, email, or hit us up on our social media accounts.
Social media