Overview
We are seeking a forward-thinking AI Engineer to architect and deploy intelligent
systems powered by Large Language Models LLMs Retrieval Augmented Generation
RAG and advanced chunking strategies This role is ideal for engineers passionate about
building scalable production grade AI applications that leverage cutting edge retrieval
and generation techniques.
Role Description
Design and implement RAG pipelines using hybrid search vector keyword
semantic ranking and metadata enrichment
Apply chunking strategies fixed size recursive semantic agentic to optimize
document segmentation for retrieval and embedding
Finetune and evaluate LLMs e.g. GPT LLaMA Ollama for tasks such as QA
summarization NER and sentiment analysis
Build multigenetic systems
Integrate AI capabilities into enterprise applications using Azure OpenAI
Cognitive Search Azure Functions and Power Automate
Collaborate with stakeholders to translate business needs into scalable AI
solutions
Ensure model reliability performance and ethical compliance across
deployments
Skills
8-12 years of experience as Full stack developer with at least 24 years in
Generative AI or RAG systems
Proficiency in Python
Azure AI Studio Azure Cognitive Services Vector DBs e.g. Pinecone FAISS
Chunking techniques and embedding strategies
MLOps tools MLflow Docker CICD pipelines
Experience with prompt engineering model evaluation
Mandatory Familiarity with Agentic Systems and Multiagent Orchestration
Technical Soft Skills -
Python Expert
AI Solutions Expert
Azure Cognitive Solutions Good
Partner Client Communications Expert
Desired Skills
Technical Soft Skills Expertise Level Expert Good Knowledge
ADO Azure DevOps Good Knowledge
This job is provided by Shine.com