Overview
Job Title: Data Engineer
Location: Remote – India
Department: Technology – Data Science, Analytics, and Business Intelligence
Contingent Type: Technical
Duration: 6 months
Job Description:
We are looking for a highly skilled and motivated Data Engineer to join our innovative technology team. The ideal candidate will have deep hands-on expertise in building and maintaining data engineering workflows using modern data tools and platforms such as Apache Airflow, PySpark, and Python, within the Cloudera Data Platform (CDP) environment.
This role also requires familiarity with DevOps practices and a keen interest or experience in AI/ML and Generative AI applications.
Key Responsibilities:
- Design and develop scalable data pipelines using Python, PySpark, and Apache Airflow.
- Build and optimize ETL workflows on Cloudera Data Platform (CDP).
- Implement robust data quality checks, monitoring, and alerting mechanisms.
- Ensure data security, governance, and regulatory compliance across all data processes.
- Collaborate with cross-functional teams to gather and deliver on data requirements.
- Troubleshoot and resolve issues in production data pipelines.
- Contribute to the architectural design and scalability of the data platform.
- Support engineering and analytics teams on AI/ML and Generative AI initiatives.
- Automate the deployment and monitoring of data workflows using DevOps tools.
- Stay informed on emerging trends in data engineering, AI/ML, and Gen AI technologies.
Required Skills:
- Strong proficiency in Python and PySpark
- Hands-on experience with Apache Airflow
- In-depth knowledge of Cloudera Data Platform (CDP)
- Understanding of DevOps tools and principles
- Exposure to AI/ML and Generative AI use cases
- Familiarity with data governance, quality, and compliance
Job Type: Contractual / Temporary
Contract length: 6 months
Pay: ₹350,000.00 - ₹380,000.00 per month
Experience:
- Data Engineer: 10 years (Required)
- Python: 10 years (Required)
- Banking: 10 years (Required)
- PySpark: 10 years (Required)
- Apache Airflow: 10 years (Required)
- Cloudera Data Platform: 10 years (Required)
- DevOps: 10 years (Required)
- AI/ML: 10 years (Required)
- GenAI: 10 years (Required)
- Data Governance: 10 years (Required)
- Quality: 10 years (Required)
- Compliance: 10 years (Required)
Work Location: Remote
Application Deadline: 28/07/2025