
Overview
About Tarento:
Tarento is a fast-growing technology consulting company headquartered in Stockholm, with a strong presence in India and clients across the globe. We specialize in digital transformation, product engineering, and enterprise solutions, working across diverse industries including retail, manufacturing, and healthcare. Our teams combine Nordic values with Indian expertise to deliver innovative, scalable, and high-impact solutions.
We're proud to be recognized as a Great Place to Work, a testament to our inclusive culture, strong leadership, and commitment to employee well-being and growth. At Tarento, you’ll be part of a collaborative environment where ideas are valued, learning is continuous, and careers are built on passion and purpose.
Overview
An Apache Superset Data Engineer is responsible for designing, building, and maintaining robust data pipelines and analytics frameworks, with a strong focus on data visualization and dashboarding using Apache Superset. This role bridges data engineering and business intelligence, ensuring that data is accessible, accurate, and actionable for stakeholders through interactive dashboards and reports.
Key Responsibilities
- Design, develop, and optimize data pipelines to collect, process, and transform large structured and semi-structured datasets from multiple sources
- Build and maintain data warehouses and data marts, ensuring data is modeled for efficient querying and reporting
- Develop, customize, and maintain interactive dashboards and reports in Apache Superset for experimentation insights, KPI tracking, and business decision-making
- Collaborate with data analysts, business stakeholders, and BI teams to understand requirements and translate them into effective Superset dashboards and visualizations
- Ensure data quality through validation, feature engineering, and exploratory data analysis
- Monitor and analyze A/B testing results, providing actionable insights to optimize business strategies
- Implement and document standard processes for statistical testing, data quality, and analytics workflows
- Integrate Superset with various databases (MySQL, PostgreSQL, etc.) and manage database connections and drivers
- Support real-time and batch data processing using modern data engineering tools (e.g., Spark, Airflow, Python)
- Maintain and enhance the security, scalability, and performance of Superset deployments
- Communicate results and recommendations to both technical and non-technical stakeholders
Required Skills and Qualifications
- Strong experience with Apache Superset for building dashboards, reports, and custom visualizations
- Proficiency in SQL and experience with RDBMS (MySQL, PostgreSQL, Oracle, etc.)
- Solid programming skills in Python (or R), with experience in data processing and automation
- Hands-on experience with data modeling, ETL/ELT pipelines, and data warehousing concepts
- Familiarity with big data tools and frameworks (e.g., Spark, Hadoop, Airflow)
- Experience working with cloud environments (AWS, Azure, GCP) is a plus
- Knowledge of data governance, data security, and access control best practices
- Strong analytical and problem-solving skills, with attention to detail and data quality
- Excellent communication skills, able to explain technical concepts to non-technical users
- Bachelor’s degree in Computer Science, Engineering, Mathematics, or a related field