Overview
Title: Senior Data Engineer l AWS Glue & Kafka lAbout the role:
We’re seeking a hands-on Senior Data Engineer (:10 years’ experience) to deliver production data pipelines on AWS. You’ll design and build streaming (Kafka) and batch pipelines using Glue/EMR (PySpark), implement data contracts and quality gates, and set up CI/CD and observability. You’ve shipped real systems, coached teams, and you document as you go.
Requirements
What you’ll do:
• Architect and deliver lake/lakehouse data flows on AWS (S3 + Glue + Glue ETL/EMR).
• Build Kafka consumers/producers, manage schema evolution, resilience, and DLQs.
• Implement PySpark transformations, CDC merges, partitioning and optimization.
• Add quality/observability (tests, monitoring, alerting, lineage basics).
• Harden security (IAM least privilege, KMS, private networking).
• Create runbooks, diagrams, and handover materials.
What you’ll bring:
• Deep AWS (Glue, RDS. S3, EMR, IAM/KMS, CloudWatch).
• Strong Kafka (MSK/Confluent, schema registry, consumer group tuning).
• Python/PySpark in production with tests and CI/CD.
• Data modeling (bronze/silver/gold, CDC, SCD2) and data contracts.
• IaC (Terraform/CDK) and cost/performance tuning experience.
• Clear communication and stakeholder engagement.
Benefits
- Work on cutting-edge technologies and impactful projects.
- Opportunities for career growth and development.
- Collaborative and inclusive work environment.
- Competitive salary and benefits package.