Overview
Bei Roche kannst du ganz du selbst sein und wirst für deine einzigartigen Qualitäten geschätzt. Unsere Kultur fördert persönlichen Ausdruck, offenen Dialog und echte Verbindungen. Hier wirst du für das, was du bist, wertgeschätzt, akzeptiert und respektiert. Dies schafft ein Umfeld, in dem du sowohl persönlich als auch beruflich wachsen kannst. Gemeinsam wollen wir Krankheiten vorbeugen, stoppen und heilen und sicherstellen, dass jeder Zugang zur Gesundheitsversorgung hat – heute und in Zukunft. Werde Teil von Roche, wo jede Stimme zählt.
Die Position
We are seeking an experienced Data Engineer to join our IVD Data Insights team, with strong expertise in building scalable data pipelines and ETL solutions using Python and modern orchestration tools. This role combines hands-on development with responsibilities such as guiding developers, overseeing design and unit testing, and ensuring well-documented, high-quality solutions.
You will work closely with business stakeholders to understand requirements and deliver reliable cloud-based data products that support analytics, insights, and data-driven decision-making. The role also requires experience handling sensitive healthcare data and implementing data privacy and protection mechanisms, including data anonymization, masking, and secure data processing practices, ensuring compliance with healthcare and data protection regulations such as HIPAA and GDPR.
Success in this role requires proven experience with cloud platforms, the software development lifecycle, data engineering best practices, data modeling, and building secure, privacy-aware data pipelines that enable scalable and compliant data solutions.
REQUIRED EXPERIENCE, SKILLS& QUALIFICATIONS
- Around 5–8 years of experience working with high-performance data products and large-scale data systems.
- Strong expertise in Python with hands-on experience in building and maintaining ETL pipelines, including data processing/manipulation libraries (e.g., pandas, PySpark, Dask).
- Proficiency in designing and developing scalable data pipelines and ETL processes, using tools such as AWS Glue, PySpark, Spark SQL, and orchestration frameworks like Airflow or AWS Step Functions.
- Expertise with AWS cloud services relevant to data engineering, including Glue, EMR, ECS, Lambda, Lake Formation along with other components for data processing and orchestration.
- Experience with databases across different paradigms (columnar, NoSQL, and MPP), such as Redshift, DynamoDB, Aurora, Postgres, and Snowflake.
- Experience implementing data privacy and protection mechanisms such as anonymization, pseudonymization, data masking, and tokenization in large-scale data pipelines. Familiarity with healthcare data regulations (HIPAA, GDPR) and implementing privacy-by-design principles in cloud data platforms.
- Proficiency in software engineering best practices, including version control (Git), containerization (Docker), unit/integration testing, and CI/CD pipelines.
- Strong data analysis skills with the ability to aggregate, transform, and prepare data for reporting and analytics.
- Knowledge of security, compliance, and design best practices for data solutions.
- Familiarity with API development and working with JSON/XML data formats.
- Proven ability to lead and mentor a team of Data Engineers to design, develop, and deliver data products.
- Excellent interpersonal, analytical, and communication skills for effective collaboration with stakeholders and cross-functional teams.
- Experience with reporting and visualization tools such as Tableau or Apache Superset is a plus.
- Build scalable end-to-end data pipelines to integrate and model datasets from diverse sources, ensuring alignment with functional and non-functional requirements.
- Translate business requirements and end-to-end designs into technical implementations that leverage system capabilities.
- Define and champion reusable, extensible, scalable, and maintainable solutions, while considering cost-benefit trade-offs.
- Conduct technical walk-throughs to ensure clear communication of system architecture.
- Collaborate with data engineering teams to deliver advanced cloud-based data products to clients.
- Collaborate with security, compliance, and governance teams to ensure data platforms meet regulatory requirements and internal privacy policies.
- Interact with business and functional stakeholders to comprehend data requirements and downstream analytics needs.
- Validate technology solutions, produce concise design documentation, and contribute to work estimates.
- Cultivate a data-driven culture within the team and spearhead impactful data engineering projects.
- Stay informed about data engineering trends and integrate data best practices into software development, ensuring data integrity, scalability, and efficiency in alignment with the Roche motto: "Doing now what patients need next."
- Experience in the Healthcare Laboratory (IVD) domain is a plus.
- Demonstrated ability to collaborate effectively with cross-functional teams in a fast-paced and dynamic environment.
- Proven track record of conducting root cause analyses on both internal and external data and processes to address specific business inquiries and identify areas for enhancement.
Eine gesündere Zukunft treibt uns zur Innovation an. Mehr als 100.000 Mitarbeiter weltweit arbeiten gemeinsam daran, wissenschaftliche Fortschritte zu erzielen und sicherzustellen, dass jeder Zugang zur Gesundheitsversorgung hat – heute und für zukünftige Generationen. Durch unser Engagement werden über 26 Millionen Menschen mit unseren Medikamenten behandelt und mehr als 30 Milliarden Tests mit unseren Diagnostik-Produkten durchgeführt. Wir ermutigen uns gegenseitig, neue Möglichkeiten zu erkunden, Kreativität zu fördern und hohe Ziele zu setzen, um lebensverändernde Gesundheitslösungen zu liefern.
Gemeinsam können wir eine gesündere Zukunft gestalten.
Roche ist ein Arbeitgeber, der die Chancengleichheit fördert.