
Overview
Position Description:
At CGI, were a team of builders. We call our employees members because all who join CGI are building their own company - one that has grown to 72,000 professionals located in 40 countries. Founded in 1976, CGI is a leading IT and business process services firm committed to helping clients succeed. We have the global resources, expertise, stability and dedicated professionals needed to achieve. At CGI, were a team of builders. We call our employees members because all who join CGI are building their own company - one that has grown to 72,000 professionals located in 40 countries. Founded in 1976, CGI is a leading IT and business process services firm committed to helping clients succeed. We have the global resources, expertise, stability and dedicated professionals needed to achieve results for our clients - and for our members. Come grow with us. Learn more at www.cgi.com.
This is a great opportunity to join a winning team. CGI offers a competitive compensation package with opportunities for growth and professional development. Benefits for full-time, permanent members start on the first day of employment and include a paid time-off program and profit participation and stock purchase plans. We wish to thank all applicants for their interest and effort in applying for this position, however, only candidates selected for interviews will be contacted. No unsolicited agency referrals please.
Job Title: Software Engineer- Python ETL / Airflow
Position: Software Engineer
Experience: 3+ Years
Category: Development
location: Chennai / Bangalore/Hyderabad/Pune
Position ID: J0125-1336
Employment Type: Full Time
Your future duties and responsibilities:
1. ETL Development:
o Design, develop, and optimize scalable ETL workflows and pipelines using Python and Apache Airflow.
o Handle data extraction, transformation, and loading from various sources into GCP services like Cloud Storage and BigQuery.
2. Airflow Implementation & Management:
o Set up, configure, and maintain Apache Airflow in production preferably on GCP (Composer or on VM).
o Develop dynamic and reusable DAGs and workflows for complex ETL processes.
o Monitor and debug Airflow workflows to ensure reliability and performance.
3. GCP Integration.
o Utilize GCP services for ETL processes, including BigQuery, Cloud Storage, Dataflow, and Pub/Sub.
o Implement and manage secure connections to GCP services using IAM roles and service accounts.
4. Optimization & Scaling:
o Handle large-scale data processing efficiently, optimizing for performance and cost.
o Automate data quality checks, error handling, and retry mechanisms in workflows.
5. Collaboration & Documentation:
o Collaborate with Architects, analysts, and other stakeholders to understand ETL requirements.
o Document workflows, configurations, and best practices for Airflow pipelines.
Required qualifications to be successful in this role:
Required Skills & Qualifications:
Technical Skills:
o Strong proficiency in Python programming with experience in handling large datasets.
o In-depth knowledge of Apache Airflow in production environments.
o Familiarity with Airflow operators, hooks, XComs, task dependencies, and dynamic DAG creation.
o Hands-on experience with GCP services like BigQuery, Cloud Storage, Dataflow, and Cloud SQL.
o Proficiency in SQL for data extraction, transformation, and aggregation.
Experience:
o 3+ years of experience in ETL development and data engineering.
o 1+ years of experience setting up and managing Airflow on GCP (e.g., Composer or manual setup).
o Experience in handling data security, encryption, and IAM policies on GCP.
o Excellent debugging, problem-solving, and performance optimization skills.
Education: Computer Science (BE / BTech / MTech / MS) from Tier I Premier institutes