Back to Jobs

4 Weeks ago

Senior Data Engineer (SQL+Pyspark)

Apply Now

560331 - 1941150 Indian Rupee - Yearly

Bangalore, Karnataka, India

Information Technology

Full-Time

NS Global Corporation

Overview

Position Summary:

As a Senior Data Engineer, you will play a pivotal role in designing, developing, and optimizing data pipelines and workflows that support large-scale data processing using Apache Spark (PySpark) and advanced SQL techniques. You will work closely with data analysts, data scientists, and platform engineers to build reliable and scalable data infrastructure. Your focus will be on transforming raw data into actionable insights while ensuring data quality, security, and performance.

Key Responsibilities:Data Pipeline Development & Optimization:

Build scalable and efficient ETL/ELT pipelines using PySpark, handling batch and real-time data workloads.
Write advanced SQL queries to cleanse, aggregate, and transform large datasets for analytical and operational use cases.
Optimize performance of data processing jobs using Spark configurations, partitioning, and caching strategies.
Maintain and improve existing data pipelines, refactoring code and improving performance as needed.

Data Architecture & Integration:

Work with structured, semi-structured (JSON, Parquet, Avro), and unstructured data.
Design and implement data models and schemas in cloud data warehouses (e.g., Snowflake, BigQuery, Redshift) and data lakes.
Integrate data from various sources including relational databases (MySQL, PostgreSQL), APIs, streaming platforms (Kafka, Kinesis), and external data providers.

Collaboration & Strategy:

Collaborate with data scientists and BI analysts to understand data needs and design solutions that meet performance and scalability requirements.
Partner with DevOps and platform teams to deploy data pipelines and workflows using orchestration tools such as Apache Airflow or Prefect.
Participate in design reviews, code reviews, and engineering discussions to contribute to the team's best practices and technical direction.

Monitoring, Testing & Governance:

Monitor data pipeline performance, failures, and system health using logging and alerting tools.
Implement data quality checks, testing strategies (unit, integration), and lineage tracking.
Support data governance, compliance, and documentation efforts by enforcing standards and creating metadata catalogs.

Required Qualifications:

Bachelor’s or Master’s degree in Computer Science, Engineering, Information Systems, or a related technical field.
6+ years of professional experience in a Data Engineering role.
Strong programming skills in Python with expertise in PySpark for distributed data processing.
Advanced SQL skills — ability to write complex joins, CTEs, window functions, and performance-optimized queries.
Hands-on experience with Apache Spark in a cloud environment (Databricks, AWS EMR, GCP DataProc, Azure Synapse).
Familiarity with data lake architectures and cloud storage (S3, GCS, Azure Blob).
Experience with version control systems (e.g., Git), CI/CD pipelines, and working in Agile/Scrum environments.

Preferred Qualifications:

Experience with streaming technologies (Kafka, Spark Structured Streaming, Flink).
Background in data modeling, data warehouse design, or data lakehouse architecture.
Knowledge of cloud infrastructure and services (AWS, GCP, Azure).
Familiarity with containerization tools (Docker, Kubernetes).
Understanding of data governance, security, and privacy frameworks (GDPR, HIPAA, etc.).

Soft Skills:

Excellent problem-solving and critical-thinking abilities.
Strong communication skills and the ability to explain technical concepts to non-technical stakeholders.
Ability to work independently and as part of a cross-functional team.
Eagerness to learn and adapt to new technologies and methodologies.

Job Types: Full-time, Permanent

Pay: ₹560,330.55 - ₹1,941,152.15 per year

Schedule:

Monday to Friday

Work Location: In person

Share job

Similar Jobs

View All

1 Day ago

Program Manager

Information Technology

15 - 18 Yrs
Gurgaon / Gurugram

More info

1 Day ago

Technical Fullstack Architect - Node.js

Information Technology

50,00,000 - 60,00,000 INR - Annual
12 - 18 Yrs
Hyderabad

About the Role: We are seeking a Fullstack Technical Architect with deep expertise in backend development using Node.js and proficiency in frontend technologies like React or any modern JavaScript framework. You will play a key role in building an...

More info

1 Day ago

Principal Engineer - Fullstack

Information Technology

30,00,000 - 40,00,000 INR - Annual
8 - 12 Yrs
Mumbai

Looking for candidate who is enthusiastic to work in a Startup environment and build things from Scratch individually Candidate has past experience in scalable consumer facing applications managing latency and traffic FullStack Individual Contribu...

More info

1 Day ago

Asst. Manager / Dy. Manager – Talent Acquisition

Automotive

4,00,000 - 8,00,000 INR - Yearly
4 - 8 Yrs
West Bengal

We are looking for dynamic and experienced professionals for the Talent Acquisition team at our Kharagpur Plant location. The incumbent will be responsible for managing the full-cycle recruitment process for both technical and functional roles across...

More info

1 Day ago

Software Developer

Cybersecurity & Privacy

2 - 5 Yrs
Gujarat, Maharashtra

Summary role description: Hiring for a Solution Architect – Databricks for global technology consulting and system integration firm specializing in data engineering, AI and ML. Company description: Our client is a US-headquartere...

More info

2 Days ago

Axis My India - Data Analyst - Python/Pandas

Pharmaceuticals

Hyderabad, Telangana, India

About The CompanyAxis My India is Indias foremost Consumer Data Intelligence Company, which in partnership with Google is building a single-stop People Empowerment Platform, the a app, that aims to change peoples awareness, accessibility, and utiliz...

More info

2 Days ago

Data Engineer

Pharmaceuticals

Hyderabad, Telangana, India

Job Title - Data EngineerAbout TazapayTazapay is a cross border payment service provider. They offer local collections via local payment methods, virtual accounts and cards in over 70 markets. The merchant does not need to create local entities anyw...

More info

2 Days ago

Master Data Analyst – Application Support

Pharmaceuticals

Hyderabad, Telangana, India

About Inchcape Shipping ServicesAt Inchcape, our vision is to have a connected world, in which our customers trade successfully and make better decisions in every port, everywhere. We use technology and our global network to help our partners connec...

More info

Talk to us

Feel free to call, email, or hit us up on our social media accounts.

Email info@antaltechjobs.in