Overview
About the job:
Hiring for a next-generation technology company building sovereign, secure, and scalable digital platforms across AI, Web3.0, cloud, and health-tech.
Hands-on Senior Data Engineer to build and maintain scalable data pipelines, lake house architecture, and analytics infrastructure. Work with modern open-source technologies to enable data-driven insights and GenAI-powered analytics capabilities. Key Responsibilities Data Ingestion & ETL Build and maintain data pipelines using Airbyte from multiple sources Configure connectors for databases (PostgreSQL, MySQL, MongoDB), APIs, SaaS applications Implement CDC for real-time data replication or using kafka streaming Design incremental and full-load sync strategies Ensure data validation and quality checks Workflow Orchestration Develop and maintain Airflow DAGs for ETL workflows Implement error handling, retry logic, and monitoring Schedule complex multi-step data pipelines Integrate with databases, APIs, and external systemsData Warehouse & Analytics Design and optimize ClickHouse schemas for OLAP Write advanced SQL queries and optimize performance Implement materialized views and aggregations Manage data partitioning and retention policies Build data marts for analytics use cases Lake House Architecture Implement data lake house on object storage (Cloudian) Design data partitioning strategies (by date, region) Work with Parquet/ORC file formats Implement access control and data governance BI & Dashboards Support Apache Superset and Power BI dashboard development
Who can apply:
- have minimum 4 years of experience
- are Computer Science Engineering students
Only those candidates can apply who:
Salary:
₹ 25,00,000 - 35,00,000 /year
Experience:
4 year(s)
Deadline:
2026-09-30 23:59:59