Noida, Uttar Pradesh, India
Space Exploration & Research, Information Technology
Full-Time
MontyCloud
Overview
Role Overview:
As a Senior Data Engineer, you will play a key role in designing, building, and operating data pipelines and systems that enable both traditional analytics and AI/ML-driven products. You will work hands-on with modern tools and frameworks that form the backbone of the AI data stack.
Key Responsibilities:
- Design, implement, and maintain high-quality data pipelines and ETL/ELT workflows, supporting both batch and streaming use cases.
- Build and maintain scalable data models and schemas to support analytics, AI/ML feature engineering, and knowledge management.
- Develop and maintain secure connectors and data onboarding pipelines for customer-provided data streams.
- Contribute to frameworks for contextualizing data and reconstructing event timelines to support intelligent automation and analytics.
- Collaborate with data scientists, ML engineers, and platform teams to operationalize data for AI/ML use cases.
- Leverage modern data engineering tools (e.g., dbt, Airflow, Kafka, Spark) and experiment with new components from the AI data stack (e.g., feature stores, vector DBs).
- Apply best practices in data quality, governance, and security for multi-tenant, cloud-native environments.
- Contribute to code reviews, technical documentation, and mentorship within the team.
Desired Skills
Must Have
- Hands-on experience with cloud-based data engineering (AWS preferred), including data lakes, warehouses, and streaming technologies.
- Strong programming skills (Python, Scala, or Java) and practical experience with modern data frameworks (Spark, Kafka, Airflow, dbt).
- Exposure to operationalizing data for AI/ML, including data preparation, feature engineering, or ML pipeline integration.
- Familiarity with at least some components of the modern AI data stack (feature stores, vector DBs, ML ops tools, etc.).
- Exposure to open table formats such as Apache Iceberg, Delta Lake, or Hudi in cloud data lake/lakehouse environments.
- Familiarity with data privacy standards (e.g., GDPR) and experience supporting data anonymization in production environments.
- Experience with data governance, security, and compliance in cloud environments.
Good to Have
- Experience with metadata-driven or semantic data systems.
- Prior work in SaaS, multi-tenant, or multi-cloud environments.
- Supporting internal and external consumers with data-driven and AI-enabled solutions.
Experience
- 6+ years of experience in data engineering or related roles.
Education
- Bachelor’s or master’s degree in computer science, Engineering, or related field (preferred).
Similar Jobs
View All
Talk to us
Feel free to call, email, or hit us up on our social media accounts.
Email
info@antaltechjobs.in