Overview
This is a remote position.
About Jetbro
Jetbro is a technology consulting and systems integration firm that builds reliable, scalable digital systems for organizations where technology is central to everyday operations.
We specialize in infrastructure-led digital transformation, enterprise modernization, observability engineering, and mission-critical system design. Our philosophy is simple: systems should work quietly, predictably, and under pressure. We value clarity over complexity and reliability over noise.
This role is part of a structured monitoring audit and advisory engagement for a large enterprise client running Mendix applications on Kubernetes.
About the Role
We are looking for a DevOps Engineer with 3+ years of hands-on experience in Kubernetes-based production environments, especially in monitoring and observability systems.
This engagement is focused on evaluating and strengthening an existing observability stack built around:
- Prometheus
- Grafana
- Kibana
- Loki
- Kubernetes
- PostgreSQL
This is not a greenfield setup. The infrastructure already exists. Your role is to:
- Audit what is configured
- Identify coverage gaps
- Recommend improvements
- Define KPI and SLO thresholds
- Improve alerting systems
- Suggest architectural optimizations
You will work closely with Jetbro’s Architect and Project Lead. This role requires structured thinking, documentation clarity, and production maturity.
Key Responsibilities
- Audit Prometheus scrape targets, exporters, and metric endpoints
- Review Grafana dashboards, alert rules, and data sources
- Assess log coverage across Kibana and Loki
- Map monitoring coverage across application, infrastructure, database, ingress, and platform layers
- Identify missing exporters, stale dashboards, broken panels, and alert gaps
- Analyze historical metrics to establish performance baselines
- Define SLOs, KPIs, warning thresholds, and breach thresholds
- Suggest Prometheus alert rules and Alertmanager routing strategies
- Implement KPI and SLO alerts within Grafana alert management
- Evaluate Kubernetes cluster topology and infrastructure usage patterns
- Recommend architecture optimizations based on observed load and behavior
- Document findings in structured audit and advisory reports
- Participate in weekly syncs and structured handover sessions
Requirements
Mandatory Requirements
- 3+ years of experience in DevOps or Platform Engineering
- Strong hands-on experience with Kubernetes production environments
- Experience working with Prometheus for metrics collection and alerting
- Experience configuring and reviewing Grafana dashboards and alerts
- Exposure to log management systems such as Kibana or Loki
- Strong understanding of observability across application, infra, DB, and ingress layers
- Experience defining or working with KPIs and SLOs
- Experience analyzing historical performance data
- Ability to troubleshoot production-level monitoring gaps
- Strong documentation and communication skills
Good To Have
- Experience with Mendix environments
- Experience monitoring JVM-based applications
- PostgreSQL performance monitoring exposure
- Experience with Alertmanager configuration
- Experience in audit or advisory-style engagements
Exposure to enterprise Kubernetes architectures
What we Care About
- Structured audit thinking over random checks
- Production maturity over theoretical knowledge
- Data-driven thresholding over guesswork
- Clear documentation over informal debugging
- Advisory mindset over task-based execution
This is a freelance role. We expect clear ownership, consistent availability for agreed hours, and strong delivery discipline.
Benefits
Benefits
- A chance to work on a greenfield project and influence architectural decisions.
- Competitive compensation and benefits.
- Flexible work environment (remote or hybrid options available).
- A collaborative and innovative team culture.