Overview
We're looking for a Python engineer with strong data processing experience to build and maintain our ML training infrastructure. You'll design scalable data pipelines, optimize training workflows, and work directly with large-scale robotics datasets. Experience with PyTorch is highly valuable.
Tech Stack
Core: Python, PyTorch, Ray, Airflow, Pandas
Key Responsibilities
- Design, build, and maintain scalable data pipelines for ML training workflows
- Process and transform large-scale robotics datasets for model training
- Optimize data ingestion, preprocessing, and feature engineering pipelines
- Collaborate with ML engineers to streamline model training and evaluation
- Implement distributed computing solutions using Ray for parallel processing
- Build and maintain Airflow DAGs for orchestrating complex data workflows
- Write clean, well-tested, production-ready code
About Company: Technoculture Research is re-imagining how the world measures health. We build micro-scale electrochemical laboratories that bring lab-grade accuracy directly into the hands of clinicians, community health workers, and even patients at home. Our platform integrates microfabricated electrodes, novel surface chemistries, and microfluidics to run protein, nucleic-acid, and metabolite assays within minutes. By replacing costly optical detection with electron sensing, we significantly reduce instrument and per-test costs, making precision diagnostics truly accessible. Our mission is to make diagnostics abundant, so every critical health decision is guided by immediate, affordable results.