Gurugram, Haryana, India
Space Exploration & Research, Information Technology
Full-Time
Recode Solutions

Overview
Job Summary
We are looking for a hands-on Data Scientist with deep expertise in NLP and Generative AI to help build and refine the intelligence behind our agentic AI systems. You will be responsible for fine-tuning, prompt engineering, and evaluating LLMs that power our digital workers across real-world workflows.
Key Responsibilities
Fine-tune and evaluate LLMs (e.g., Mistral, LLaMA, Qwen) using frameworks like Unsloth, HuggingFace, and DeepSpeed
Develop high-quality prompts and RAG pipelines for few-shot and zero-shot performance
Analyze and curate domain-specific text datasets for training and evaluation
Conduct performance and safety evaluation of fine-tuned models
Collaborate with engineering teams to integrate models into agentic workflows
Stay up to date with the latest in open-source LLMs and GenAI tools, and rapidly prototype experiments
Apply efficient training and inference techniques (LoRA, QLoRA, quantization, etc.)
Requirements
- 3+ years of experience in Natural Language Processing (NLP) and machine learning applied to text
Strong coding skills in python
Hands-on experience fine-tuning LLMs (e.g., LLaMA, Mistral, Falcon, Qwen) using frameworks like Unsloth, HuggingFace Transformers, PEFT, LoRA, QLoRA, bitsandbytes
Proficient in PyTorch (preferred) or TensorFlow, with experience in writing custom training/evaluation loops
Experience in dataset preparation, tokenization (e.g., Tokenizer, tokenizers), and formatting for instruction tuning (ChatML, Alpaca, ShareGPT formats)
Familiarity with retrieval-augmented generation (RAG) using FAISS, Chroma, Weaviate, or Qdrant
Strong knowledge of prompt engineering, few-shot/zero-shot learning, chain-of-thought prompting, and function-calling patterns
Exposure to agentic AI frameworks like CrewAI, Phidata, LangChain, LlamaIndex, or AutoGen
Experience with GPU-accelerated training/inference and libraries like DeepSpeed, Accelerate, Flash Attention, Transformers v2, etc.
Solid understanding of LLM evaluation metrics (BLEU, ROUGE, perplexity, pass@k) and safety-related metrics (toxicity, bias)
Ability to work with open-source checkpoints and formats (e.g., safetensors, GGUF, HF Hub, GPTQ, ExLlama)
Comfortable with containerized environments (Docker) and scripting for training pipelines, data curation, or evaluation workflows
Nice to Haves
Experience in Linux (Ubuntu)
Terminal/Bash Scripting
Date Opened
07/08/2025
Job Type
Full time
Years of Experience
3 - 6 Years
Domain
Chemicals
City
Chennai
State/Province
Tamil Nadu
Country
India
Zip/Postal Code
600001
Similar Jobs
View All
Talk to us
Feel free to call, email, or hit us up on our social media accounts.
Email
info@antaltechjobs.in