Free cookie consent management tool by TermsFeed Data Scientist - LLM - BluemachineAI | Antal Tech Jobs
Back to Jobs
2 Days ago

Data Scientist - LLM - BluemachineAI

decor
3500000 - 4000000 INR - Yearly
Space Exploration & Research, Information Technology
Full-Time
HuntingCube

Overview

Job Description

About the Role

We are looking for a hands-on Senior Data Scientist / Research Scientist who can own end-to-end training and fine-tuning of open-source Large Language Models (LLMs)—from data curation and experimentation to evaluation and production deployment.

This is a builder role, not a script-runner role. You will work on Indian languages and code-switching (Hinglish, etc.), improve instruction following and tool/function calling reliability, and optimize models for low latency and high throughput in production.

Key Responsibilities

Model Training & Fine-Tuning

  • Train and fine-tune open-source LLMs using:
    • Continued pre-training, SFT, preference optimization (DPO / IPO / ORPO)
    • Full fine-tuning, LoRA / QLoRA based approaches
  • Improve model performance on:
    • Indian languages and multilingual / code-mixed inputs
    • Strong instruction following
    • Reliable tool/function calling with structured JSON outputs
Data & Pipelines

  • Build and maintain high-quality training pipelines for:
    • Instruction datasets, tool-call traces, multilingual corpora
    • Synthetic data generation
  • Implement:
    • De-duplication, contamination checks
    • Quality, safety, and PII filtering
Evaluation & Monitoring

  • Design evaluation frameworks and dashboards:
    • Offline and online evaluation, regression testing
    • Tool-calling accuracy, schema validity, multilingual benchmarks
    • Latency, throughput, and cost metrics
Performance & Serving Optimization

  • Optimize models for production:
    • Quantization (AWQ / GPTQ / bits-and-bytes)
    • Distillation, speculative decoding, KV-cache optimization
  • Deploy and serve models using:
    • vLLM, TGI, TensorRT-LLM, ONNX (as applicable)
Alignment & Reliability

  • Reduce hallucinations and improve refusal behavior
  • Enforce deterministic and structured outputs
  • Apply prompting + training strategies for robust compliance

Collaboration & Research

  • Work closely with engineering teams on:
    • Model packaging, CI-based evaluation, A/B testing
    • Monitoring quality drift in production
  • Read research papers, propose experiments, and convert ideas into measurable improvements
Required Qualifications (Must-Have)

  • 4–6 years of experience in ML / Data Science with hands-on LLM training & fine-tuning
  • Proven ability to drive end-to-end model improvement: Data → Training → Evaluation → Production constraints → Iteration
  • Strong understanding of:
    • Transformers, tokenization, multilingual modeling
    • Fine-tuning methods: LoRA / QLoRA, full fine-tuning, continued pre-training
    • Alignment techniques: SFT, DPO / IPO / ORPO
  • Experience building or improving tool/function calling reliability
  • Strong coding skills in Python, deep experience with PyTorch
  • Experience with distributed training:
    • DeepSpeed / FSDP / Accelerate
    • Multi-GPU / multi-node setups
  • Solid ML fundamentals: optimization, regularization, error analysis, scaling intuition
Good to Have (Nice-to-Have)

  • Experience with Indian language NLP:
    • Indic scripts, transliteration, normalization, code-mixing
  • Experience with large-scale pre-training or continued pre-training
  • Practical serving experience:
    • vLLM, TGI, TensorRT-LLM
    • Quantization calibration and performance profiling
  • Exposure to data governance, privacy, and dataset documentation
Tech Stack

  • Modeling & Training: PyTorch, Hugging Face Transformers & Datasets, PEFT
  • Distributed Training: DeepSpeed, FSDP, Accelerate
  • Experiment Tracking: Weights & Biases, MLflow
  • Serving: vLLM, TGI, TensorRT-LLM
  • Infra: Docker, Kubernetes
  • Optional: Ray, Airflow, Spark
  • Bonus: Vector DB / RAG stack familiarity

What Success Looks Like (First 90–180 Days)

  • Ship a fine-tuned open-source LLM with measurable improvements in:
    • Instruction following and tool-calling correctness
    • Indian language and code-switching performance
    • Lower latency and higher throughput at comparable quality
  • Build a repeatable training + evaluation pipeline:
    • Dataset versioning, training recipes, evaluation harness, regression gates
  • Define a roadmap for future improvements:
    • Distillation, preference tuning, multilingual expansion
Required Skills

['LLM Fine-Tuning', 'LLM training', 'NLP', 'Python']

Additional Information

Interview Process

  • 30-min Intro & Role Fit
  • Technical Deep Dive (LLM training, evals, production constraints)
  • Take-Home / Live Exercise:
    • Design an LLM fine-tuning + evaluation plan for tool calling & Indic languages
  • Systems Round:
    • Training vs serving trade-offs, cost/latency, failure modes
  • Culture & Collaboration Round
Share job
Similar Jobs
View All
3 Hours ago
DevSecOps Engineer - WFO
Information Technology
  • 4 - 7 Yrs
  • Mumbai
Job Title: DevSecOps Engineer Experience: 4 to 7 Years Location: Andheri (East), Mumbai Work Mode: Work From Office Shift Timing: 9:30 AM to 6:30 PM About the Role We are looking for a highly skilled DevSecOps Engineer to join our growing t...
decor
3 Hours ago
DevSecOps Engineer – US Shift - WFH
Information Technology
  • 4 - 7 Yrs
  • Anywhere in India/Multiple Locations
Job Title: DevSecOps Engineer Experience: 4 to 7 Years Location: Remote (Work From Home) Shift Timing: 7:00 PM to 3:00 AM (US Shift) About the Role We are seeking a skilled DevSecOps Engineer to support our global infrastructure and applicat...
decor
1 Day ago
Software Engineer (Java Backend)
Fintech
  • 1000000 - 1400000 INR - Yearly
  • 2 - 3 Yrs
  • Telangana, Hyderabad
Job requirement: Our Fintech client is looking to hire individual (s) who have passionate about applying technology to solve complex business challenges. This role will enable innovative development of modular technology solutions delivering ...
decor
1 Day ago
Lead Python Developer
Space Exploration & Research, Information Technology
Experience: 6.00 + yearsSalary: Confidential (based on experience)Expected Notice Period: 15 DaysShift: (GMT+05:30) Asia/Kolkata (IST)Opportunity Type: RemotePlacement Type: Full Time Contract for 12 Months(40 hrs a week/160 hrs a month)(*Note: This ...
decor
1 Day ago
Engineering Advocacy Lead Software Engineer
Space Exploration & Research, Information Technology
Job DescriptionAs a Vice President - Engineering Advocacy Lead at JPMorgan Chase within the Payments Engineering & Architecture team, you're responsible for how 10,000 Payments engineers adopt architecture standards and AI-assisted development practi...
decor
1 Day ago
Manual and Automation Test Engineer
Space Exploration & Research, Information Technology
Manual and Automation Test EngineerExperience - 5-7yrsNotice period: ImmediateLocation : Trivandrum/KochiBudget : 10L-12LKey ResponsibilitiesAnalyze and identify project requirements and translate them into effective testing strategiesDevelop and mai...
decor
1 Day ago
Java Fullstack Developer
Space Exploration & Research, Information Technology
Position OverviewWe are looking for experienced Java Full Stack and Java Backend Developers who can design, develop, and maintain high‑quality applications. The ideal candidate should have solid hands‑on experience in Java, Spring Boot, and Microse...
decor
1 Day ago
Software Engineer III - AL/ML Platform
Space Exploration & Research, Information Technology
Job DescriptionWe have an exciting and rewarding opportunity for you to take your software engineering career to the next level.As a Software Engineer III at JPMorganChase within the Corporate and Investment Bank you serve as a seasoned member of an ...
decor

Talk to us

Feel free to call, email, or hit us up on our social media accounts.
Social media