Back to Jobs
2 Days ago
Senior Data Engineer (Pyspark, Hadoop, Scala, Hive)- Assistant Vice President
Chennai, Tamil Nadu, India
Information Technology
Full-Time
Citi
Overview
Senior Data Engineer
We are seeking a highly skilled and motivated Senior Data Engineer to design, develop, and implement cutting-edge data engineering solutions using modern big data and cloud technologies. In this role, you will collaborate with product owners, data scientists, analysts, and technologists to deliver scalable, high-performance data products in an agile and collaborative environment. You will also play a key role in migrating legacy workloads to the cloud, optimizing data pipelines, and mentoring team members on best practices in data engineering.
Key Responsibilities
------------------------------------------------------
Job Family Group:
Technology
------------------------------------------------------
Job Family:
Data Science
------------------------------------------------------
Time Type:
Full time
------------------------------------------------------
Most Relevant Skills
Please see the requirements listed above.
------------------------------------------------------
Other Relevant Skills
For complementary skills, please see above and/or contact the recruiter.
------------------------------------------------------
Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.
If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.
View Citi’s EEO Policy Statement and the Know Your Rights poster.
We are seeking a highly skilled and motivated Senior Data Engineer to design, develop, and implement cutting-edge data engineering solutions using modern big data and cloud technologies. In this role, you will collaborate with product owners, data scientists, analysts, and technologists to deliver scalable, high-performance data products in an agile and collaborative environment. You will also play a key role in migrating legacy workloads to the cloud, optimizing data pipelines, and mentoring team members on best practices in data engineering.
Key Responsibilities
- Design and develop scalable big data solutions using platforms like Hadoop, Snowflake, or other modern data ecosystems.
- Collaborate with domain experts, product managers, analysts, and data scientists to build robust and efficient data pipelines.
- Lead the migration of legacy workloads to cloud platforms (AWS, Azure, or GCP) while ensuring seamless integration and optimization.
- Develop and implement cloud-native solutions for data processing and storage.
- Partner with data scientists to build data pipelines from heterogeneous sources and provide engineering support for data science applications.
- Enable advanced analytics and machine learning workflows by delivering high-quality data pipelines.
- Implement CI/CD pipelines to automate data engineering workflows across cloud and on-premises platforms.
- Drive automation to improve efficiency and reduce manual intervention in data processes.
- Research and evaluate open-source technologies and recommend their integration into the data platform to enhance functionality and scalability.
- Act as a technical expert and mentor team members on big data and cloud technologies.
- Define and enforce coding standards, reusable components, and consistent patterns for data engineering processes.
- Convert SAS-based pipelines into modern frameworks like PySpark, Scala, or Java for execution on Hadoop and non-Hadoop ecosystems.
- Optimize big data applications for performance and scalability across platforms.
- Analyze evolving business requirements and recommend enhancements or alternatives to current systems.
- Evaluate new IT developments and industry standards to ensure the data platform remains cutting-edge.
- Foster a collaborative and high-performing team environment.
- Ensure compliance with applicable laws, regulations, and organizational policies.
- Apply sound ethical judgment and escalate control issues transparently.
- 8+ years of experience with Hadoop (Cloudera) and big data technologies.
- Advanced knowledge of the Hadoop ecosystem, including HDFS, MapReduce, Hive, Pig, Impala, Spark, Kafka, Kudu, and Solr.
- Proficiency in Java, Python, or Scala.
- Hands-on experience with Spark programming (PySpark, Scala, or Java).
- Familiarity with Apache Beam is a plus.
- Experience with cloud platforms like AWS, Azure, or GCP.
- Proven ability to deploy and manage data solutions on cloud platforms.
- Expertise in designing and developing data pipelines for ingestion, transformation, and processing.
- Experience with Snowflake or Delta Lake is a strong advantage.
- Hands-on experience with containerization tools like Docker and Kubernetes.
- Proficiency in DevOps practices, including source control, CI/CD, and automated deployments.
- Experience with Python libraries for machine learning and data science workflows.
- Strong knowledge of data structures, algorithms, distributed storage, and compute systems.
- 1+ year of SAS experience preferred.
- 1+ year of Hadoop administration experience preferred.
- Strong problem-solving and analytical skills.
- Excellent interpersonal and teamwork abilities.
- Proven leadership experience, including mentoring and managing a team of data engineers and analysts.
- A proactive, "can-do" attitude for solving complex business problems.
- Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent experience).
------------------------------------------------------
Job Family Group:
Technology
------------------------------------------------------
Job Family:
Data Science
------------------------------------------------------
Time Type:
Full time
------------------------------------------------------
Most Relevant Skills
Please see the requirements listed above.
------------------------------------------------------
Other Relevant Skills
For complementary skills, please see above and/or contact the recruiter.
------------------------------------------------------
Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.
If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.
View Citi’s EEO Policy Statement and the Know Your Rights poster.
Similar Jobs
View All
Talk to us
Feel free to call, email, or hit us up on our social media accounts.
Email
info@antaltechjobs.in