Back to Jobs

2 Days ago

Data Scientist - LLM - BluemachineAI

Apply Now

3500000 - 4000000 INR - Yearly

Space Exploration & Research, Information Technology

Full-Time

HuntingCube

Overview

Job Description

About the Role

We are looking for a hands-on Senior Data Scientist / Research Scientist who can own end-to-end training and fine-tuning of open-source Large Language Models (LLMs)—from data curation and experimentation to evaluation and production deployment.

This is a builder role, not a script-runner role. You will work on Indian languages and code-switching (Hinglish, etc.), improve instruction following and tool/function calling reliability, and optimize models for low latency and high throughput in production.

Key Responsibilities

Model Training & Fine-Tuning

Train and fine-tune open-source LLMs using:

Continued pre-training, SFT, preference optimization (DPO / IPO / ORPO)
Full fine-tuning, LoRA / QLoRA based approaches

Improve model performance on:

Indian languages and multilingual / code-mixed inputs
Strong instruction following
Reliable tool/function calling with structured JSON outputs

Data & Pipelines

Build and maintain high-quality training pipelines for:

Instruction datasets, tool-call traces, multilingual corpora
Synthetic data generation

Implement:

De-duplication, contamination checks
Quality, safety, and PII filtering

Evaluation & Monitoring

Design evaluation frameworks and dashboards:

Offline and online evaluation, regression testing
Tool-calling accuracy, schema validity, multilingual benchmarks
Latency, throughput, and cost metrics

Performance & Serving Optimization

Optimize models for production:

Quantization (AWQ / GPTQ / bits-and-bytes)
Distillation, speculative decoding, KV-cache optimization

Deploy and serve models using:

vLLM, TGI, TensorRT-LLM, ONNX (as applicable)

Alignment & Reliability

Reduce hallucinations and improve refusal behavior
Enforce deterministic and structured outputs
Apply prompting + training strategies for robust compliance

Collaboration & Research

Work closely with engineering teams on:

Model packaging, CI-based evaluation, A/B testing
Monitoring quality drift in production

Read research papers, propose experiments, and convert ideas into measurable improvements

Required Qualifications (Must-Have)

4–6 years of experience in ML / Data Science with hands-on LLM training & fine-tuning
Proven ability to drive end-to-end model improvement: Data → Training → Evaluation → Production constraints → Iteration
Strong understanding of:

Transformers, tokenization, multilingual modeling
Fine-tuning methods: LoRA / QLoRA, full fine-tuning, continued pre-training
Alignment techniques: SFT, DPO / IPO / ORPO

Experience building or improving tool/function calling reliability
Strong coding skills in Python, deep experience with PyTorch
Experience with distributed training:

DeepSpeed / FSDP / Accelerate
Multi-GPU / multi-node setups

Solid ML fundamentals: optimization, regularization, error analysis, scaling intuition

Good to Have (Nice-to-Have)

Experience with Indian language NLP:

Indic scripts, transliteration, normalization, code-mixing

Experience with large-scale pre-training or continued pre-training
Practical serving experience:

vLLM, TGI, TensorRT-LLM
Quantization calibration and performance profiling

Exposure to data governance, privacy, and dataset documentation

Tech Stack

Modeling & Training: PyTorch, Hugging Face Transformers & Datasets, PEFT
Distributed Training: DeepSpeed, FSDP, Accelerate
Experiment Tracking: Weights & Biases, MLflow
Serving: vLLM, TGI, TensorRT-LLM
Infra: Docker, Kubernetes
Optional: Ray, Airflow, Spark
Bonus: Vector DB / RAG stack familiarity

What Success Looks Like (First 90–180 Days)

Ship a fine-tuned open-source LLM with measurable improvements in:

Instruction following and tool-calling correctness
Indian language and code-switching performance
Lower latency and higher throughput at comparable quality

Build a repeatable training + evaluation pipeline:

Dataset versioning, training recipes, evaluation harness, regression gates

Define a roadmap for future improvements:

Distillation, preference tuning, multilingual expansion

Required Skills

['LLM Fine-Tuning', 'LLM training', 'NLP', 'Python']

Additional Information

Interview Process

30-min Intro & Role Fit
Technical Deep Dive (LLM training, evals, production constraints)
Take-Home / Live Exercise:

Design an LLM fine-tuning + evaluation plan for tool calling & Indic languages

Systems Round:

Training vs serving trade-offs, cost/latency, failure modes

Culture & Collaboration Round

Share job

Similar Jobs

View All

3 Hours ago

DevSecOps Engineer - WFO

Information Technology

4 - 7 Yrs
Mumbai

Job Title: DevSecOps Engineer Experience: 4 to 7 Years Location: Andheri (East), Mumbai Work Mode: Work From Office Shift Timing: 9:30 AM to 6:30 PM About the Role We are looking for a highly skilled DevSecOps Engineer to join our growing t...

More info

3 Hours ago

DevSecOps Engineer – US Shift - WFH

Information Technology

4 - 7 Yrs
Anywhere in India/Multiple Locations

Job Title: DevSecOps Engineer Experience: 4 to 7 Years Location: Remote (Work From Home) Shift Timing: 7:00 PM to 3:00 AM (US Shift) About the Role We are seeking a skilled DevSecOps Engineer to support our global infrastructure and applicat...

More info

1 Day ago

Software Engineer (Java Backend)

Fintech

1000000 - 1400000 INR - Yearly
2 - 3 Yrs
Telangana, Hyderabad

Job requirement: Our Fintech client is looking to hire individual (s) who have passionate about applying technology to solve complex business challenges. This role will enable innovative development of modular technology solutions delivering ...

More info

1 Day ago

Lead Python Developer

Space Exploration & Research, Information Technology

Experience: 6.00 + yearsSalary: Confidential (based on experience)Expected Notice Period: 15 DaysShift: (GMT+05:30) Asia/Kolkata (IST)Opportunity Type: RemotePlacement Type: Full Time Contract for 12 Months(40 hrs a week/160 hrs a month)(*Note: This ...

More info

1 Day ago

Engineering Advocacy Lead Software Engineer

Space Exploration & Research, Information Technology

Job DescriptionAs a Vice President - Engineering Advocacy Lead at JPMorgan Chase within the Payments Engineering & Architecture team, you're responsible for how 10,000 Payments engineers adopt architecture standards and AI-assisted development practi...

More info

1 Day ago

Manual and Automation Test Engineer

Space Exploration & Research, Information Technology

Manual and Automation Test EngineerExperience - 5-7yrsNotice period: ImmediateLocation : Trivandrum/KochiBudget : 10L-12LKey ResponsibilitiesAnalyze and identify project requirements and translate them into effective testing strategiesDevelop and mai...

More info

1 Day ago

Java Fullstack Developer

Space Exploration & Research, Information Technology

Position OverviewWe are looking for experienced Java Full Stack and Java Backend Developers who can design, develop, and maintain high‑quality applications. The ideal candidate should have solid hands‑on experience in Java, Spring Boot, and Microse...

More info

1 Day ago

Software Engineer III - AL/ML Platform

Space Exploration & Research, Information Technology

Job DescriptionWe have an exciting and rewarding opportunity for you to take your software engineering career to the next level.As a Software Engineer III at JPMorganChase within the Corporate and Investment Bank you serve as a seasoned member of an ...

More info

Talk to us

Feel free to call, email, or hit us up on our social media accounts.

Email info@antaltechjobs.in