Overview
About the job:
We're looking for a Python engineer with strong data processing experience to build and maintain our ML training infrastructure. You'll design scalable data pipelines, optimize training workflows, and work directly with large-scale robotics datasets. Experience with PyTorch is highly valuable.Tech Stack
Core: Python, PyTorch, Ray, Airflow, Pandas
Key responsibilities:
1. Design, build, and maintain scalable data pipelines for ML training workflows
2. Process and transform large-scale robotics datasets for model training
3. Optimize data ingestion, preprocessing, and feature engineering pipelines
4. Collaborate with ML engineers to streamline model training and evaluation
5. Implement distributed computing solutions using Ray for parallel processing
6. Build and maintain Airflow DAGs for orchestrating complex data workflows
7. Write clean, well-tested, production-ready code
Who can apply:
- are Computer Science Engineering students
Only those candidates can apply who:
Salary:
₹ 3,00,000 - 4,00,000 /yearExperience:
0 year(s)Deadline:
2026-01-09 23:59:59Skills required:
Python, PyTorch and PandasOther Requirements:
1. Strong proficiency in Python (3.x) with a focus on data processing
2. Hands-on experience with data pipeline tools (Airflow, Luigi, or similar)
3. Experience with PyTorch or other deep learning frameworks
4. Proficiency with Pandas, NumPy, and data manipulation at scale
5. Familiarity with distributed computing (Ray, Dask, or Spark)
6. Experience with cloud platforms (AWS/GCP/Azure) and containerization
7. Bachelor's in Computer Science, Engineering, or equivalent experience
8. Experience with robotics data (sensor streams, trajectories, simulations)
9. Background in ML model training pipelines and experiment tracking
10. Familiarity with data versioning tools (DVC, MLflow, W&B)
11. Contributions to open-source projects
About Company:
Technoculture Research is re-imagining how the world measures health. We build micro-scale electrochemical laboratories that bring lab-grade accuracy directly into the hands of clinicians, community health workers, and even patients at home. Our platform integrates microfabricated electrodes, novel surface chemistries, and microfluidics to run protein, nucleic-acid, and metabolite assays within minutes. By replacing costly optical detection with electron sensing, we significantly reduce instrument and per-test costs, making precision diagnostics truly accessible. Our mission is to make diagnostics abundant, so every critical health decision is guided by immediate, affordable results.