Free cookie consent management tool by TermsFeed Pyspark Data Engineer | Antal Tech Jobs
Back to Jobs
4 Days ago

Pyspark Data Engineer

decor
Space Exploration & Research, Information Technology
Other
Citi

Overview

We are seeking a highly motivated and intuitive Python Developer to join our dynamic team, focusing on critical data migration and profiling initiatives. The ideal candidate will be a self-starter with strong engineering principles, capable of designing and implementing robust solutions for handling large datasets and complex data flows. This role offers an exciting opportunity to work on challenging projects that drive significant impact within our data ecosystem.

Responsibilities:

  • Develop, test, and deploy high-quality Python code for data migration, data profiling, and data processing.
  • Design and implement scalable solutions for working with large and complex datasets, ensuring data integrity and performance.
  • Utilize PySpark for distributed data processing and analytics on large-scale data platforms.
  • Develop and optimize SQL queries for various database systems, including Oracle, to extract, transform, and load data efficiently.
  • Integrate Python applications with JDBC-compliant databases (e.g., Oracle) for seamless data interaction.
  • Implement data streaming solutions to process real-time or near real-time data efficiently.
  • Perform in-depth data analysis using Python libraries, especially Pandas, to understand data characteristics, identify anomalies, and support profiling efforts.
  • Collaborate with data architects, data engineers, and business stakeholders to understand requirements and translate them into technical specifications.
  • Contribute to the design and architecture of data solutions, ensuring best practices in data management and engineering.
  • Troubleshoot and resolve technical issues related to data pipelines, performance, and data quality.

Qualifications:

  • 4-7 years of relevant experience in the Financial Service industry
  • Strong Proficiency in Python:
  • Excellent command of Python programming, including object-oriented principles, data structures, and algorithms.
  • PySpark Experience:
  • Demonstrated experience with PySpark for big data processing and analysis.
  • Database Expertise:
  • Proven experience working with relational databases, specifically Oracle, andconnecting applications using JDBC.
  • SQL Mastery:
  • Advanced SQL querying skills for complex data extraction, manipulation, andoptimization.
  • Big Data Handling:
  • Experience in working with and processing large datasets efficiently.
  • Data Streaming:
  • Familiarity with data streaming concepts and technologies (e.g., Kafka, SparkStreaming) for processing continuous data flows.
  • Data Analysis Libraries:
  • Proficient in using data analysis libraries such as Pandas for data manipulationand exploration.
  • Software Engineering Principles:
  • Solid understanding of software engineering best practices,including version control (Git), testing, and code review.
  • Problem-Solving:
  • Intuitive problem-solver with a self-starter mindset and the ability to work independently and as part of a team.

Education:

  • Bachelor’s degree/University degree or equivalent experience

This job description provides a high-level review of the types of work performed. Other job-related duties may be assigned as required.

  • Preferred Skills & Qualifications (Good to Have):
    • Experience in developing and maintaining reusable Python packages or libraries for data engineering tasks.
    • Familiarity with cloud platforms (e.g., AWS, Azure, GCP) and their data services.
    • Knowledge of data warehousing concepts and ETL/ELT processes.
    • Experience with CI/CD pipelines for automated deployment.
    ------------------------------------------------------

    Job Family Group:

    Technology

    ------------------------------------------------------

    Job Family:

    Applications Development

    ------------------------------------------------------

    Time Type:

    Full time

    ------------------------------------------------------

    Most Relevant Skills

    Please see the requirements listed above.

    ------------------------------------------------------

    Other Relevant Skills

    For complementary skills, please see above and/or contact the recruiter.

    ------------------------------------------------------

    Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.

    If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.

    View Citi’s EEO Policy Statement and the Know Your Rights poster.
Share job
Similar Jobs
View All
16 Hours ago
Associate Devops Lead - GCP
Information Technology
  • 2400000 - 3500000 INR - Annual
  • 6 - 10 Yrs
  • Greater Noida, Noida
Responsibilities Design and deploy complex, multi-tier applications on GCP, ensuring scalability, reliability, and cost-efficiency. Manage and optimize workloads using GCP services like Compute Engine, Kubernetes Engine, BigQuery, Cloud Funct...
decor
16 Hours ago
Director/ Senior Director - Data Delivery Partner (CPG Domain)
Information Technology
  • 6000000 - 8000000 INR - Annual
  • 16 - 23 Yrs
  • Hyderabad
Role Overview: We are seeking an experienced Account Delivery Head – Director level to lead end-to end delivery for strategic accounts in the Consumer Packaged Goods (CPG) domain, with a strong focus on Data Engineering, Advanced Analytics, and Da...
decor
23 Hours ago
Quality Engineering Architect
Information Technology
  • 9 - 12 Yrs
  • Ahmedabad, Indore, Hyderabad
Your mission, roles and requirements: Design and implement scalable automation frameworks while defining the overall testing tool landscape for the organization. The role focuses on building robust test harnesses, significantly reducing testing cy...
decor
1 Day ago
Senior Maps Data Engineer
AI & Machine Learning Advancement
  • 6 - 10 Yrs
  • Hyderabad
Job Opening: Maps Data Engineer Location: Hyderabad Experience: 6+ years About Antal: Antal International, East Patel Nagar Delhi, is a leading recruitment consultancy having expertise in connecting top talent across IT, Manufact...
decor
1 Day ago
Maps Data Engineer
AI & Machine Learning Advancement
  • 4 - 7 Yrs
  • Hyderabad
Job Opening: Maps Data Engineer Location: Hyderabad Experience: 4+ years About Antal: Antal International, East Patel Nagar Delhi, is a leading recruitment consultancy having expertise in connecting top talent across IT, Manufact...
decor
2 Days ago
ETL Developer/Data Engineer
Information Technology
  • Bangalore, Karnataka, India
DescriptionAbout the Organization :G N Solutions Pvt. Ltd. is a trusted IT company providing state-of- the-art solutions, services and products to our clients spread across diverse domains and geographies. We are one of the privileged IBM Business Pa...
decor
2 Days ago
Vision Group - Senior Software Engineer
Information Technology
  • Bangalore, Karnataka, India
DescriptionRequired Mandatory Skills : Architecture Design Dot Net .Net Core JavaScript SQL Server Azure Cloud MicroservicesJob Responsibilities Responsible for delivering high quality software on time Works closely with Engineering leads and other d...
decor
2 Days ago
Cloud Native Architect - Azure
Information Technology
  • Bangalore, Karnataka, India
EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will c...
decor

Talk to us

Feel free to call, email, or hit us up on our social media accounts.
Social media