Hyderabad, Telangana, India
Information Technology
Full-Time
Uplers
Overview
Experience: 5.00 + years
Salary: INR 3000000-4000000 / year (based on experience)
Expected Notice Period: 30 Days
Shift: (GMT+05:30) Asia/Kolkata (IST)
Opportunity Type: Remote
Placement Type: Full Time Indefinite Contract(40 hrs a week/160 hrs a month)
(*Note: This is a requirement for one of Uplers' client - Pentimenti AI)
What do you need for this opportunity?
Must have skills required:
CI/CD, GCPVertex, LangChain, Sagemaker, TensorFlow, LLM, MLOps, Pytorch, rag, Vector Database, Cloud Server (Google / AWS), Python
Pentimenti AI is Looking for:
WHY THIS ROLE MATTERS
Agentic platforms are the third wave of AI adoption, letting organisations delegate complex multi‑step work to autonomous LLM‑powered “knowledge robots.” Winning teams pair Retrieval‑Augmented Generation (RAG) with low‑latency inference to deliver factual answers at scale. You will own that stack—research, optimisation, and production deployment—so we ship features that feel like magic to end‑users.
————————————————————————
Position Overview
Core Qualifications
Bonus Skills
SUCCESS METRICS (FIRST 6 MONTHS)
Ship v1 RAG pipeline with < 800 ms P95 latency and ≥ 90 % factual score.Cut inference cost per 1 k tokens by ≥ 40 %. Publish a white‑paper/blog on agent orchestration improving tool reliability by 25 %. Build & mentor a team of 3–5 engineers; institute automated eval harnesses and CI/CD for model releases.
————————————————————————
TECH STACK YOU’LL OWN
Python
Compensation & Benefits
HIRING PROCESS
Our goal is to make hiring reliable, simple, and fast. Our role will be to help all our talents find and apply for relevant contractual onsite opportunities and progress in their career. We will support any grievances or challenges you may face during the engagement.
(Note: There are many more opportunities apart from this on the portal. Depending on the assessments you clear, you can apply for them as well).
So, if you are ready for a new challenge, a great work environment, and an opportunity to take your career to the next level, don't hesitate to apply today. We are waiting for you!
Salary: INR 3000000-4000000 / year (based on experience)
Expected Notice Period: 30 Days
Shift: (GMT+05:30) Asia/Kolkata (IST)
Opportunity Type: Remote
Placement Type: Full Time Indefinite Contract(40 hrs a week/160 hrs a month)
(*Note: This is a requirement for one of Uplers' client - Pentimenti AI)
What do you need for this opportunity?
Must have skills required:
CI/CD, GCPVertex, LangChain, Sagemaker, TensorFlow, LLM, MLOps, Pytorch, rag, Vector Database, Cloud Server (Google / AWS), Python
Pentimenti AI is Looking for:
WHY THIS ROLE MATTERS
Agentic platforms are the third wave of AI adoption, letting organisations delegate complex multi‑step work to autonomous LLM‑powered “knowledge robots.” Winning teams pair Retrieval‑Augmented Generation (RAG) with low‑latency inference to deliver factual answers at scale. You will own that stack—research, optimisation, and production deployment—so we ship features that feel like magic to end‑users.
————————————————————————
Position Overview
- Own the agentic & RAG roadmap — design, prototype, and launch LLM agents (planner–executor, multi‑agent, tool‑calling) that hit sub‑second P95 latency in production.
- Invent & productionise RAG pipelines — embedding strategy, vector‑DB design (Weaviate, Pinecone), hybrid search, evaluations, guard‑rails.
- Fine‑tune frontier models with PEFT/LoRA, RLHF, safety alignment; publish research that moves the needle.
- Optimise inference — quantisation (INT4/8), speculative decoding, TensorRT‑LLM/vLLM or Ray Serve to cut cost per token by ≥ 40 %.
- Lead & mentor a small, high‑agency team; codify MLOps, CI/CD, observability, and data‑governance best practices.
- Partner with product & design to turn research into delightful user features that 10× customer ROI.
Core Qualifications
- EXPERIENCE – 5+ years in software/ML, including 2+ years shipping LLM/NLP products at scale.
- DEEP‑LEARNING STACK – Expert in Python and PyTorch (TensorFlow / JAX welcome); CUDA or Triton kernels a plus.
- AGENTIC & RAG FRAMEWORKS – Hands‑on with LangChain, LlamaIndex, CrewAI; vector DBs — Weaviate, Pinecone, Qdrant.
- MODEL OPTIMISATION – Quantisation, distillation, AWS Neuron or GPU kernel tuning.
- CLOUD & MLOPS – Kubernetes, Ray, SageMaker or GCP Vertex; Terraform/Pulumi IaC; structured observability.
- COMMUNICATION & LEADERSHIP – Writes crisp design docs and guides cross‑functional teams.
Bonus Skills
- Multimodal agent systems (vision‑language, audio‑language).
- Privacy‑preserving ML (federated learning, differential privacy).
- OSS contributions to LangChain, Weaviate, Pinecone, Triton, vLLM.
SUCCESS METRICS (FIRST 6 MONTHS)
Ship v1 RAG pipeline with < 800 ms P95 latency and ≥ 90 % factual score.
TECH STACK YOU’LL OWN
Python
- PyTorch
- JAX
- Ray Serve
- Kubernetes
- LangChain/LlamaIndex
- Weaviate/Pinecone
- vLLM/TensorRT‑LLM
- AWS Bedrock/SageMaker
- PG‑vector
- Prometheus + Grafana
Compensation & Benefits
- Base: ₹ 30–40 LPA (India)
- Remote‑first flexibility, quarterly on‑sites.
HIRING PROCESS
- Intro chat — vision & culture fit.
- Deep‑dive — solve an open‑ended agent/RAG problem in our codebase.
- Research case — present past optimisation work or a roadmap proposal.
- Values & leadership interview with founders.
- Offer — we aim to close within 2 weeks.
- Step 1: Click On Apply! And Register or Login on our portal.
- Step 2: Complete the Screening Form & Upload updated Resume
- Step 3: Increase your chances to get shortlisted & meet the client for the Interview!
Our goal is to make hiring reliable, simple, and fast. Our role will be to help all our talents find and apply for relevant contractual onsite opportunities and progress in their career. We will support any grievances or challenges you may face during the engagement.
(Note: There are many more opportunities apart from this on the portal. Depending on the assessments you clear, you can apply for them as well).
So, if you are ready for a new challenge, a great work environment, and an opportunity to take your career to the next level, don't hesitate to apply today. We are waiting for you!
Similar Jobs
View All
Talk to us
Feel free to call, email, or hit us up on our social media accounts.
Email
info@antaltechjobs.in