Overview
About the job:
Key responsibilities:1. Collaborate directly with the founding team on core AI model development and strategy
2. Fine-tune and customize LLMs for specific use cases and performance requirements
3. Evaluate and benchmark different language models to determine optimal solutions
4. Optimize model performance using LoRA, QLoRA, and other parameter-efficient methods
5. Implement and experiment with RLHF workflows
6. Design and execute training pipelines for custom model development
7. Research and implement cutting-edge techniques in model optimization and efficiency
8. Create new AI solutions that solve real-world problems at scale
9. Lead technical initiatives and mentor junior team members as the team grows
Why this role is special:
1. Work directly with founders on core product decisions and technical strategy
2. Shape the AI architecture from inception to production scale
3. Lead research initiatives and influence the direction of AI capabilities
4. Access cutting-edge research and implement the latest techniques
5. Work with state-of-the-art hardware and computational resources
6. Collaborate with brilliant minds and learn from industry experts
7. Join as a founding team member with significant equity and growth potential
8. Take on leadership opportunities as the team expands
9. Gain industry recognition through publications and open-source contributions
10. Work primarily in the office at the CBD Bangalore location for maximum collaboration
Who can apply:
- have minimum 1 years of experience
Only those candidates can apply who:
Salary:
₹ 4,00,000 - 5,00,000 /yearExperience:
1 year(s)Deadline:
2025-10-09 23:59:59Other perks:
Informal dress code, Free snacks & beveragesSkills required:
Python, Deep Learning, Model Evaluation, Generative AI Development and Model fine-tuningOther Requirements:
1. Strong understanding of LLM architectures (transformers, attention) and hands-on experience fine-tuning GPT, LLaMA, Mistral in production.
2. Proficiency in training/fine-tuning pipelines: data prep, hyperparameter tuning, evaluation, with knowledge of LoRA/QLoRA and parameter-efficient techniques.
3. Practical exposure to RLHF and human preference learning.
4. Expert-level Python programming with PyTorch and Hugging Face Transformers.
5. Knowledge of distributed training, model parallelization, GPU optimization, and efficient model serving.
6. Strong CS fundamentals (algorithms, data structures, system design) and solid math background (linear algebra, statistics, optimization).
7. Experience (0–1 yrs) with deep learning & LLMs; proven record of fine-tuning and deploying models.
8. Familiarity with MLOps pipelines, evaluation frameworks, benchmarking.
9. Bachelor’s degree from IIT/NIT/BITS (CS or related).
10. Self-starter, motivated, and passionate about advancing AI; able to solve complex technical problems and thrive in ambiguous environments.
About Company:
Icecreamlabs is an AI venture studio. We build AI-first startups tackling complex enterprise problems.