Overview
About the job:
Were looking for a Data Platform Engineer to take end-to-end ownership of large-scale data pipelines and distributed systems powering a next-generation Analytics platform.You will design, build, and operate production-grade data pipelines, working across Spark, cloud infrastructure, APIs, and graph systems to ensure high-throughput, low-latency, and reliable data processing at scale.
What Youll OwnData Pipeline & Platform EngineeringDesign and build end-to-end data pipelines (batch + streaming)Own large-scale Apache Spark workloads and distributed data processingImplement data ingestion transformation serving layersManage schema evolution, data contracts, and pipeline reliability
Distributed Systems & ScaleWork on systems handling high-volume graph datasets (entities + relationships)Optimize for latency, throughput, and fault toleranceDesign scalable architectures using Kafka / Spark / Flink / Beam
Cloud & InfrastructureDeploy and operate systems on GCP / AWS (GKE, Dataproc, Cloud Run, etc.)Build and maintain CI/CD pipelines for data and microservicesUse Docker, Kubernetes, Terraform for infrastructure automation
Data Reliability & ObservabilityImplement data quality checks, monitoring, and alertingEnsure data integrity across pipelines and servicesBuild systems to detect drift, inconsistencies, and failures in production
APIs & System IntegrationWork with GraphQL / REST / gRPC APIs for data access layersEnsure seamless integration between data systems and application layers
What Were Looking ForYou have atleast 4 years in Data Engineering / Platform Engineering / Distributed SystemsStrong hands-on experience with: Apache Spark / Distributed data processing, Cloud platforms (GCP or AWS), Streaming systems (Kafka / Flink / Beam)Solid programming skills in Python / Java / Scala / Node.jsExperience building and owning production data pipelines end-to-endUnderstanding of: Microservices architecture, Data modeling & large-scale system designAbility to debug and optimize systems in real production environments
Why This Role is DifferentYou own systems, not just componentsYou work on real scale (millions billions of data points)You solve distributed systems + graph + real-time problemsYou operate close to production impact, not isolated dev workYou influence architecture from day one in an early-stage environment
What Youll GetHigh ownership, low bureaucracy environmentWork on cutting-edge graph + AI-driven data systemsExposure to complex, real-world data problems (fraud, risk, intelligence)Fast growth with direct impact on core platform architecture
Who can apply:
- have minimum 4 years of experience
- are Computer Science Engineering students
Only those candidates can apply who:
Salary:
Competitive salary
Experience:
4 year(s)
Deadline:
2026-11-05 23:59:59