Overview
· Apache Druid
o Cluster setup, configuration, and production operations
o Real-time and batch ingestion (Kafka, streaming tasks, indexing services)
o Segment management, compaction, retention, and query optimization
o Troubleshooting performance and availability issues
· Trino
o Cluster deployment and tuning for large-scale distributed queries
o Connector configuration (Hive, Iceberg, Delta Lake, JDBC, etc.)
o Query optimization, memory management, and workload isolation
o Security configuration (authentication, authorization, access control)
· Python
o Strong proficiency in Python for automation and backend services
o Writing clean, maintainable, production-grade code
o Building tooling for deployment, monitoring, and operational workflows
o Experience with REST APIs, scripting, and data processing libraries
· Containerization & Orchestration
o Docker image creation and optimization
o Kubernetes deployment, scaling, and troubleshooting
o Helm charts and Kubernetes operators (preferred)
· Infrastructure & CI/CD
o Infrastructure as Code using Terraform, CloudFormation, or similar
o CI/CD pipelines (GitHub Actions, GitLab CI, Jenkins, ArgoCD, etc.)
o Blue-green and rolling deployment strategies
· Cloud Platforms
o Hands-on experience with AWS, GCP, or Azure
o Networking, storage, and compute optimization for data workloads
o Cost monitoring and optimization
· Monitoring and alerting using Prometheus, Grafana, ELK, OpenTelemetry, or similar
· Log aggregation, metrics, and distributed tracing
· Incident management, root cause analysis, and postmortems
· Capacity planning and performance benchmarking
Pay: ₹946,359.21 - ₹2,500,000.00 per year
Application Question(s):
- Immediate joiners or 30 days preferred
Experience:
- apache druid: 5 years (Preferred)
Work Location: In person