Free cookie consent management tool by TermsFeed Data Engineer - AWS & Pyspark | Antal Tech Jobs
Back to Jobs
1 Day ago

Data Engineer - AWS & Pyspark

decor
Chennai, Tamil Nadu, India
Information Technology
Full-Time
InfoCepts

Overview

Position: Data Engineer - AWS & Pyspark

Location: Nagpur/Pune

Type of Employment: Full-Time

Purpose of the Position:  You will be a critical member of the InfoCepts Cloud Data Architect Team. We are seeking an experienced Data Engineer with strong expertise in Databricks, PySpark, AWS, and Python to design and deliver scalable data pipelines, high-performance ETL frameworks, and reliable data solutions. The ideal candidate has a solid understanding of distributed data processing, cloud architecture, and modern data engineering best practices. 

Key Result Areas and Activities:

Data Engineering & ETL Development 

  • Design, build, and optimize ETL/ELT pipelines using PySpark/Scala and Databricks on large-scale distributed data environments. 
  • Develop reusable data ingestion frameworks, transformation modules, and feature engineering pipelines. 
  • Ensure high-quality data processing with robust data validation, error handling, and observability. 

Databricks Platform Engineering 

  • Work extensively with the Databricks Lakehouse platform?clusters, notebooks, Delta Lake, MLflow, jobs, and workflows. 
  • Implement best practices for Delta Lake, including schema evolution, time-travel, vacuuming, ZOrdering, partitioning, and optimization. 
  • Collaborate on job orchestration using Databricks Workflows, Jobs API, or Airflow

 AWS Cloud Engineering 

  • Build and maintain data pipelines leveraging AWS services such as:  
  • S3, Glue, Lambda, IAM, Step Functions, Athena, Redshift or Snowflake, CloudWatch 
  • Implement secure data architectures following IAM, networking, encryption, and costoptimized design principles. 
  • Integrate Databricks with AWS data sources and event-driven systems. 
  • Working knowledge of OTF like Delta and Iceberg 

Programming & Data Processing 

  • Write high-quality, production-grade Python code (modular, optimized, reusable). 
  • Develop PySpark jobs for batch and near real-time data transformations. 
  • Optimize Spark performance (partitions, broadcast variables, caching, cluster tuning). 

Data Architecture, Governance & Quality 

  • Contribute to the design of data models, storage layers, and data lifecycle management. 
  • Implement best practices for data governance, metadata management, and lineage tracking. 
  • Ensure data reliability, performance, and accuracy across multiple environments. 

Cross-Functional Collaboration  

  • Partner with analysts, data scientists, product teams, and business stakeholders to understand requirements. 
  • Document workflows, maintain Git-based version control, and participate in architecture reviews. 
  • Support production pipelines, troubleshoot issues, and continuously enhance system performance. 
Share job
Similar Jobs
View All
14 Minutes ago
AI Engineer/Architect
AI & Machine Learning Advancement
  • 5 - 8 Yrs
  • Anywhere in India/Multiple Locations
Role Overview We are seeking an experienced AI Architect to design and govern end‑to‑end AI and ML architectures across a variety of enterprise use cases (e.g., prediction, personalization, recommendation, anomaly detection, automation). The ideal c...
decor
1 Day ago
Software Engineer, Cloud - Sustaining Engineering
Information Technology
  • Chennai, Tamil Nadu, India
Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, en...
decor
1 Day ago
ALLEN Overseas - Business Analyst
Information Technology
  • Chennai, Tamil Nadu, India
Job Description : Business Analyst - Business & Operations (Middle East)Position DetailsJob Title : Business Analyst - Business & Operations (Middle East)Reports To : Head - Business & OperationsLocation : Dubai (with occasional travel to India and o...
decor
1 Day ago
Data Engineer + Python + Pyspark + SQL + AWS
Information Technology
  • Chennai, Tamil Nadu, India
Position DescriptionFounded in 1976, CGI is among the largest independent IT and business consulting services firms in the world. With 94,000 consultants and professionals across the globe, CGI delivers an end-to-end portfolio of capabilities, from s...
decor
1 Day ago
Technical Specialist - Python Developer
Information Technology
  • Chennai, Tamil Nadu, India
Job DescriptionTitle: Senior Python DeveloperLocation: Hyderabad/Mumbai/BengaluruExp Level: 8+Education: Any DegreeKey Responsibilities:Design, develop, and maintain scalable, secure, and high-performance applications using Python, PHP, and JavaScrip...
decor
1 Day ago
Senior Software Engineer
Information Technology
  • Chennai, Tamil Nadu, India
Are you excited by the idea of joining a high-quality, talented team within a game-changing FinTech company?Do you want to tackle major problems in the consumer credit industry with a socially-responsible solution? If so, we'd love to hear from you!A...
decor
1 Day ago
Lead Software Engineer
Information Technology
  • Chennai, Tamil Nadu, India
🚀 Hiring: Lead – Software Engineer (Python/Django)📍 Location: Pune onsite💼 Experience: 5-6+ Years🔎 About the RoleWe are seeking an experienced Lead – Software Engineer to drive the design, development, and scalability of high-performance web applicat...
decor
1 Day ago
Senior Python Developer
Information Technology
  • Chennai, Tamil Nadu, India
OverviewCACTUS is a remote-first organization and we embrace an accelerate from anywhere culture. You may be required to travel to our Mumbai office based on business requirements or for company/team events.You will be a part of Cactus Labs which is ...
decor

Talk to us

Feel free to call, email, or hit us up on our social media accounts.
Social media