Back to Jobs

1 Day ago

Data Engineer - AWS & Pyspark

Apply Now

Chennai, Tamil Nadu, India

Information Technology

Full-Time

InfoCepts

Overview

Position: Data Engineer - AWS & Pyspark

Location: Nagpur/Pune

Type of Employment: Full-Time

Purpose of the Position: You will be a critical member of the InfoCepts Cloud Data Architect Team. We are seeking an experienced Data Engineer with strong expertise in Databricks, PySpark, AWS, and Python to design and deliver scalable data pipelines, high-performance ETL frameworks, and reliable data solutions. The ideal candidate has a solid understanding of distributed data processing, cloud architecture, and modern data engineering best practices.

Key Result Areas and Activities:

Data Engineering & ETL Development

Design, build, and optimize ETL/ELT pipelines using PySpark/Scala and Databricks on large-scale distributed data environments.
Develop reusable data ingestion frameworks, transformation modules, and feature engineering pipelines.
Ensure high-quality data processing with robust data validation, error handling, and observability.

Databricks Platform Engineering

Work extensively with the Databricks Lakehouse platform?clusters, notebooks, Delta Lake, MLflow, jobs, and workflows.
Implement best practices for Delta Lake, including schema evolution, time-travel, vacuuming, ZOrdering, partitioning, and optimization.
Collaborate on job orchestration using Databricks Workflows, Jobs API, or Airflow

AWS Cloud Engineering

Build and maintain data pipelines leveraging AWS services such as:
S3, Glue, Lambda, IAM, Step Functions, Athena, Redshift or Snowflake, CloudWatch
Implement secure data architectures following IAM, networking, encryption, and costoptimized design principles.
Integrate Databricks with AWS data sources and event-driven systems.
Working knowledge of OTF like Delta and Iceberg

Programming & Data Processing

Write high-quality, production-grade Python code (modular, optimized, reusable).
Develop PySpark jobs for batch and near real-time data transformations.
Optimize Spark performance (partitions, broadcast variables, caching, cluster tuning).

Data Architecture, Governance & Quality

Contribute to the design of data models, storage layers, and data lifecycle management.
Implement best practices for data governance, metadata management, and lineage tracking.
Ensure data reliability, performance, and accuracy across multiple environments.

Cross-Functional Collaboration

Partner with analysts, data scientists, product teams, and business stakeholders to understand requirements.
Document workflows, maintain Git-based version control, and participate in architecture reviews.
Support production pipelines, troubleshoot issues, and continuously enhance system performance.

Share job

Similar Jobs

View All

14 Minutes ago

AI Engineer/Architect

AI & Machine Learning Advancement

5 - 8 Yrs
Anywhere in India/Multiple Locations

Role Overview We are seeking an experienced AI Architect to design and govern end‑to‑end AI and ML architectures across a variety of enterprise use cases (e.g., prediction, personalization, recommendation, anomaly detection, automation). The ideal c...

More info

1 Day ago

Software Engineer, Cloud - Sustaining Engineering

Information Technology

Chennai, Tamil Nadu, India

Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, en...

More info

1 Day ago

ALLEN Overseas - Business Analyst

Information Technology

Chennai, Tamil Nadu, India

Job Description : Business Analyst - Business & Operations (Middle East)Position DetailsJob Title : Business Analyst - Business & Operations (Middle East)Reports To : Head - Business & OperationsLocation : Dubai (with occasional travel to India and o...

More info

1 Day ago

Data Engineer + Python + Pyspark + SQL + AWS

Information Technology

Chennai, Tamil Nadu, India

Position DescriptionFounded in 1976, CGI is among the largest independent IT and business consulting services firms in the world. With 94,000 consultants and professionals across the globe, CGI delivers an end-to-end portfolio of capabilities, from s...

More info

1 Day ago

Technical Specialist - Python Developer

Information Technology

Chennai, Tamil Nadu, India

Job DescriptionTitle: Senior Python DeveloperLocation: Hyderabad/Mumbai/BengaluruExp Level: 8+Education: Any DegreeKey Responsibilities:Design, develop, and maintain scalable, secure, and high-performance applications using Python, PHP, and JavaScrip...

More info

1 Day ago

Senior Software Engineer

Information Technology

Chennai, Tamil Nadu, India

Are you excited by the idea of joining a high-quality, talented team within a game-changing FinTech company?Do you want to tackle major problems in the consumer credit industry with a socially-responsible solution? If so, we'd love to hear from you!A...

More info

1 Day ago

Lead Software Engineer

Information Technology

Chennai, Tamil Nadu, India

🚀 Hiring: Lead – Software Engineer (Python/Django)📍 Location: Pune onsite💼 Experience: 5-6+ Years🔎 About the RoleWe are seeking an experienced Lead – Software Engineer to drive the design, development, and scalability of high-performance web applicat...

More info

1 Day ago

Senior Python Developer

Information Technology

Chennai, Tamil Nadu, India

OverviewCACTUS is a remote-first organization and we embrace an accelerate from anywhere culture. You may be required to travel to our Mumbai office based on business requirements or for company/team events.You will be a part of Cactus Labs which is ...

More info

Talk to us

Feel free to call, email, or hit us up on our social media accounts.

Email info@antaltechjobs.in