Free cookie consent management tool by TermsFeed Senior Big Data Engineer - Assistant Vice President | Antal Tech Jobs
Back to Jobs
1 Week ago

Senior Big Data Engineer - Assistant Vice President

decor
Bangalore, Karnataka, India
Information Technology
Full-Time
Citi

Overview

Big Data Engineer (PySpark & Apache Airflow)

Role Overview:

We are actively seeking a highly skilled and dedicated Big Data Engineer specializing in PySpark and Apache Airflow to enhance our data platform capabilities. This critical role involves designing, developing, and orchestrating complex data pipelines that underpin our advanced analytics and machine learning initiatives. You will be responsible for leveraging PySpark for efficient data processing and utilizing Apache Airflow for robust workflow management, ensuring data quality, reliability, and scalability across our large-scale datasets.

Key Responsibilities:

  • Design, develop, and maintain robust, scalable, and efficient big data pipelines primarily using PySpark for data ingestion, transformation, and processing.
  • Implement and manage data workflows using Apache Airflow, including designing DAGs (Directed Acyclic Graphs), configuring operators, and optimizing task dependencies for reliable and scheduled data pipeline execution.
  • Optimize PySpark jobs and data workflows for performance, cost-efficiency, and resource utilization across distributed computing environments.
  • Collaborate closely with data scientists, AI/ML engineers, and other stakeholders to translate analytical and machine learning requirements into highly performant and automated data solutions.
  • Develop and implement data quality checks, validation rules, and monitoring mechanisms within PySpark jobs and Airflow DAGs to ensure data integrity and consistency.
  • Troubleshoot, debug, and resolve issues in PySpark code and Airflow pipeline failures, ensuring high availability and reliability of data assets.
  • Contribute to the architecture and evolution of our data platform, advocating for best practices in data engineering, automation, and operational excellence.
  • Ensure data security, privacy, and compliance throughout the data lifecycle within the pipelines.

Required Skills and Qualifications:

  • 7+ Years of Expert-level proficiency in PySpark for building and optimizing large-scale data processing applications.
  • Strong hands-on experience with Apache Airflow, including DAG development, custom operators/sensors, connections, and deployment strategies.
  • Proven experience in designing, building, and operating production-grade distributed data pipelines.
  • Solid understanding of big data architectures, distributed computing principles, and data warehousing concepts.
  • Proficiency in data modeling, schema design, and various data storage formats (e.g., Parquet, ORC, Delta Lake).
  • Experience with cloud platforms such as AWS, Azure, or Google Cloud Platform (GCP), specifically their big data services (e.g., EMR, Databricks, HDInsight, Dataflow) and object storage (S3, ADLS, GCS).
  • Demonstrated experience with version control systems, particularly Git.
  • Excellent problem-solving, analytical, and debugging skills.
  • Ability to work effectively both independently and as part of a collaborative, agile team.

Desired (Plus) Skills:

  • Experience with containerization technologies (e.g., Docker, Kubernetes) for deploying PySpark applications or Airflow.
  • Familiarity with CI/CD practices for data pipelines.
  • Understanding of machine learning concepts and experience with data preparation for AI/ML models.
  • Knowledge of other orchestration tools or workflow managers.

Education:

  • Bachelor’s degree/University degree or equivalent experience

This job description provides a high-level review of the types of work performed. Other job-related duties may be assigned as required.

------------------------------------------------------

Job Family Group:

Technology

------------------------------------------------------

Job Family:

Applications Development

------------------------------------------------------

Time Type:

Full time

------------------------------------------------------

Most Relevant Skills

Please see the requirements listed above.

------------------------------------------------------

Other Relevant Skills

For complementary skills, please see above and/or contact the recruiter.

------------------------------------------------------

Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.

If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.

View Citi’s EEO Policy Statement and the Know Your Rights poster.

Share job
Similar Jobs
View All
16 Hours ago
Principal Engineer - UI/UX
Information Technology
  • 6 - 11 Yrs
  • Mumbai (All Areas)
looking for a technically savvy and experienced Principal Engineer to take up front-end development efforts. You will design and develop elegant interfaces that exceed client expectations in terms of value and benefit. You will collaborate on scalabi...
decor
16 Hours ago
Principal Engineer - Beckend
Information Technology
  • 7 - 11 Yrs
  • Mumbai (All Areas)
Principal Engineer - Backend Mumbai, Maharashtra, India | Product Engineering | Full-time We are looking for a technically savvy and experienced senior developer to lead development efforts. You will help the team grow in size and skills, optim...
decor
16 Hours ago
Principal Engineer - DevOps/DBA
Information Technology
  • 7 - 11 Yrs
  • Mumbai (All Areas)
we are looking for a technically savvy and passionate Principal DevOps Engineer or Senior Database Administrator to cater to the development and operations efforts in product. You will choose and deploy tools and technologies to build and support a r...
decor
18 Hours ago
C++ Developer | 5 - 8 Years
Information Technology
  • 30 - 45 INR - Yearly
  • 5 - 12 Yrs
  • Pune
About the Client: The company has been a global leader in delivering cutting-edge in-flight entertainment and connectivity (IFEC) solutions for over 40 years. About the Role: Title: SDE 3 C++ - 5 to 8 Years SDE 4 C++ - 8 to 13 Years ...
decor
2 Days ago
Business Analyst(GRC Domain) Internship in Hyderabad
Information Technology
  • Gurugram, Haryana, India
Selected Intern's Day-to-day Responsibilities Include Gather, analyze, and document business requirements. Translate business needs into functional specifications and user stories. Support in preparing business requirement documents (BRD), functi...
decor
2 Days ago
Creditsafe - Data Test Engineer
Information Technology
  • Gurugram, Haryana, India
We are looking for a Test Engineer who will become part of our team building and testing the Creditsafe data.You will be working closely with the database teams and data engineering to build specific systems facilitating the extraction and transform...
decor
2 Days ago
Business Advisory Analyst
Information Technology
  • Gurugram, Haryana, India
Skill required: Finance & Accounting - Budgeting and ForecastingDesignation: Business Advisory AnalystQualifications:Any GraduationYears of Experience:3 to 5 yearsAbout AccentureAccenture is a global professional services company with leading capabi...
decor
2 Days ago
IN_Senior Associate_.Net Developer_Emerging Businesses_Advisory_Gurugram
Information Technology
  • Gurugram, Haryana, India
Line of ServiceAdvisory Industry/SectorNot Applicable SpecialismEmerging Technologies Management LevelSenior Associate Job Description & SummaryAt PwC, our people in software and product innovation focus on developing cutting-edge software solutions...
decor

Talk to us

Feel free to call, email, or hit us up on our social media accounts.
Social media