Overview
Key Skills:
Large Language Models (LLM):
Experience with LangChain, LangGraph Proficiency in building agentic patterns like ReAct, ReWoo, LLMCompiler Multi-modal
Retrieval-Augmented Generation (RAG):
Expertise in multi-modal AI systems (text, images, audio, video) Designing and optimizing chunking strategies and clustering for large data processing
Streaming & Real-time Processing:
Experience in audio/video streaming and real-time data pipelines Low-latency inference and deployment architectures
NL2SQL:
Natural language-driven SQL generation for databases Experience with natural language interfaces to databases and query optimization
API Development:
Building scalable APIs with FastAPI for AI model serving
Containerization & Orchestration:
Proficient with Docker for containerized AI services Experience with orchestration tools for deploying and managing services
Data Processing & Pipelines:
Experience with chunking strategies for efficient document processing Building data pipelines to handle large-scale data for AI model training and inference
AI Frameworks & Tools:
Experience with AI/ML frameworks like TensorFlow, PyTorch Proficiency in LangChain, LangGraph, and other LLM-related technologies
Prompt Engineering:
Expertise in advanced prompting techniques like Chain of Thought (CoT) prompting, LLM Judge, and self-reflection prompting Experience with prompt compression and optimization using tools like LLMLingua, AdaFlow, TextGrad, and DSPy Strong understanding of context window management and optimizing prompts for performance and efficiency
Job Type: Full-time
Pay: ₹2,000,000.00 - ₹3,400,000.00 per year
Benefits:
- Health insurance
- Provident Fund
Schedule:
- Monday to Friday
Work Location: In person
Application Deadline: 05/07/2025