Itanagar, Arunachal Pradesh, India
Information Technology
Full-Time
Alliance Tek Solutions

Overview
Job Title : AI Engineer in India (Remote)
About Us:
- We are at the forefront of AI and machine learning, and we’re looking for motivated individuals to contribute to the next generation of intelligent models.
- The ideal candidate will have experience working with Turing.com and a strong background in data annotation, prompt engineering, and model fine-tuning.
- You will play a critical role in refining AI systems, providing essential human feedback, and enhancing overall model performance.
Required Skills :
- RHLF, LLM, NLP, BERT
Key Qualifications:
Experience in RLHF:
- Deep understanding of Reinforcement Learning (especially RLHF) for LLMs and how it applies to improving AI models.
- Hands-on experience in fine-tuning LLMs through iterative human feedback.
Data Expertise:
- Prior experience in annotating datasets for AI/ML models with a focus on quality control.
- Experience with annotation tools and platforms like Labelbox, Prodigy, or Turing’s proprietary tools.
Technical Proficiency:
- Familiarity with LLM frameworks like GPT-3/4, BERT, and advanced NLP models.
- Strong command of Python, SQL, or related programming languages for handling data processing tasks.
- Understanding of prompt engineering, and experience with platforms like Hugging Face or LangChain is a plus.
Turing.com Experience:
- Prior work experience at Turing.com (or similar remote work platforms), with a focus on AI, data annotation, or similar roles.
- Understanding of the remote work dynamic and experience collaborating with distributed teams.
Preferred Qualifications:
- Minimum 2-5 years of experience in model fine-tuning, prompt engineering, and human-in-the-loop systems.
- Familiarity with cloud platforms (AWS, Azure) and MLOps best practices.
- Previous work on reinforcement learning pipelines in large-scale AI projects.
What You’ll Do:
- You will play a key role in annotating and curating data for the training and fine-tuning of large language models (LLMs), ensuring annotations are accurate, consistent, and project-aligned.
- You’ll implement Reinforcement Learning with Human Feedback (RLHF) techniques, providing structured human feedback to guide model outputs and continuously fine-tune models to improve performance.
Why You Should Apply:
- Flexible without any restrictions, opportunity – work whenever it fits your schedule!
- Remote – work from anywhere in India!
Job Type: Full-time
Pay: ₹1,000.00 - ₹1,300.00 per hour
Work Location: Remote
Similar Jobs
View All
Talk to us
Feel free to call, email, or hit us up on our social media accounts.
Email
info@antaltechjobs.in