Overview
Job Title: Senior AI Engineer
Job Location: Chennai
Job Summary:
We are looking for a highly skilled Senior AI Engineer with deep expertise in Generative AI to lead the customization, optimization, and deployment of advanced language models. The ideal candidate will have hands-on experience fine-tuning LLaMA or similar large language models, building scalable AI pipelines, and driving innovation across AI product development. This role will be pivotal in advancing our AI capabilities and delivering impactful, next-gen solutions tailored to business needs.
Key Responsibilities:
Generative AI Development & Model Fine-tuning:
- Design and implement solutions using state-of-the-art Generative AI models, with a focus on LLaMA, Azure OpenAI models (e.g., GPT-4, Whisper), and other LLM architectures.
- Fine-tune pre-trained foundation models (e.g., LLaMA, GPT, Mistral, Azure OpenAI) on domain- specific datasets for downstream applications.
- Leverage techniques such as supervised fine-tuning, PEFT, LoRA, and RLHF to adapt models to specific business needs.
- Optimize large model performance for production using quantization, pruning, and distillation techniques.
Model Deployment & Inference Optimization:
- Package and deploy fine-tuned models into scalable production environments, including cloud- native or edge-based systems.
- Implement inference strategies using frameworks like Hugging Face Transformers, DeepSpeed, or ONNX Runtime.
- Ensure low-latency, high-throughput model inference pipelines are resilient, secure, and cost- effective.
AI Systems Engineering & Tooling:
- Develop robust data ingestion, preprocessing, and augmentation pipelines to support GenAI training workflows.
- Build internal tools and utilities to automate experimentation, evaluation, and model lifecycle management.
- Collaborate with MLOps teams to integrate model monitoring, logging, and alerting systems for continuous performance tracking.
Research, Evaluation & Innovation:
- Stay abreast of the latest research in Generative AI, Transformer-based models, and open-source advancements.
- Evaluate new architectures and contribute to the selection of best-fit models and training strategies.
- Run ablation studies, performance benchmarks, and bias/fairness audits to improve model quality and interpretability.
Leadership and Collaboration:
- Mentor junior AI engineers and researchers in LLM fine-tuning, evaluation, and deployment best practices.
- Work closely with cross-functional teams—data scientists, product owners, software engineers— to deliver AI-powered features and solutions.
- Contribute to technical strategy and roadmap planning for GenAI initiatives across the organization.
Key Requirements:
Educational Background:
- Bachelor’s or Master’s degree in Computer Science, Artificial Intelligence, Machine Learning, or a related field.
Experience:
- 5+ years of experience in AI/ML engineering, with at least 2 years focused on large language models and Generative AI.
- Proven experience fine-tuning models such as LLaMA, GPT, Falcon, Azure OpenAI models, or similar open-source LLMs.
Technical Skills:
- Strong programming skills in Python and experience with AI frameworks such as PyTorch, TensorFlow, and Hugging Face Transformers.
- Hands-on expertise in fine-tuning techniques like LoRA, QLoRA, PEFT, and prompt tuning.
- Solid understanding of model evaluation metrics, prompt engineering, and alignment techniques.
Infrastructure & Deployment:
- Experience deploying models in production using tools like Docker, Kubernetes, and cloud platforms (AWS, GCP, or Azure).
- Familiarity with Azure ML services for model deployment and management is a plus.
- Familiarity with MLOps tools for model tracking, CI/CD, and pipeline automation.
Data Handling:
- Ability to manage and preprocess large, unstructured datasets for model training and evaluation.
- Familiarity with data labeling, synthetic data generation, and augmentation strategies.
- Proactive mindset with a passion for staying up to date with AI advancements and emerging technologies.
- Exposure to multi-modal models (text, image, audio) or RAG (Retrieval-Augmented Generation) frameworks.
Job Types: Full-time, Permanent
Pay: ₹454,389.37 - ₹1,452,338.02 per year
Benefits:
- Paid sick time
- Provident Fund
Schedule:
- Day shift
- Monday to Friday
Work Location: In person