Overview
About the job:
We are seeking a skilled Data Engineer to join our growing AdTech team. In this role, you will design, build, and maintain high-performance ETL pipelines and large-scale data processing systems. You will work with massive datasets and distributed frameworks to power Adsremedy's data-driven advertising solutions across Programmatic, In-App, CTV, and DOOH platforms.
Key Responsibilities:
1. Design, develop, and maintain scalable ETL pipelines on self-managed infrastructure.
2. Process and optimize large-scale datasets (terabytes of data) with high reliability and performance.
3. Build robust data processing workflows using Apache Spark (preferred) and/or Apache Flink.
4. Integrate, clean, and transform data from multiple internal and external sources.
5. Partner with data scientists, analysts, and business stakeholders to enable actionable insights.
6. Monitor, troubleshoot, and optimize data pipelines for operational excellence.
7. Ensure data quality, consistency, and performance across all workflows.
8. Participate in code reviews and uphold best practices in data engineering.
9. Collaborate with QA teams to deliver production-ready systems.
10. Mentor junior engineers and promote knowledge sharing within the team.
11. Stay updated with emerging tools, frameworks, and industry trends.
Who can apply:
- have minimum 1 years of experience
- are from Mumbai only
- are Computer Science Engineering students
Only those candidates can apply who:
Salary:
₹ 3,15,000 - 4,20,000 /year
Experience:
1 year(s)
Deadline:
2026-04-24 23:59:59
Other perks:
5 days a week, Health Insurance
Skills required:
Java, Python, SQL and Scala
Other Requirements:
1. 1+ years of experience building ETL pipelines using Apache Spark and/or Apache Flink.
2. Hands-on experience with big data caching solutions such as ScyllaDB, Aerospike, or similar.
3. Strong understanding of data lake architectures and tools like Delta Lake.
4. Proven experience handling terabytes of data in distributed environments.
5. Proficiency in Scala, Python, or Java.
6. Experience with cloud data platforms such as AWS S3, Azure Data Lake, or Google BigQuery.
7. Strong knowledge of SQL, data modeling, and data warehousing concepts.
8. Familiarity with Git and CI/CD workflows.
9. Excellent problem-solving skills and ability to work in a fast-paced, collaborative environment.
10. Experience with Apache Kafka for real-time data streaming
11. Familiarity with Apache Airflow or similar orchestration tools
About Company:
Adsremedy Media LLP
We are an AdTech (Advertising Technology) company that has built a cutting-edge Real-Time Bidding technology capable of handling high volumes of traffic at milliseconds latency. We are revolutionizing the digital advertising industry by connecting online advertisers and publishers in a highly efficient and automated manner. As we continue to scale our operations, we are seeking energetic and passionate people to join us in our journey.