Back to Jobs

2 Days ago

Software Developer - AI

Apply Now

800000 - 1500000 INR - Yearly

Mumbai, Maharashtra, IN

Information Technology

Full-Time

AXS Solutions and Consulting

Overview

Job Title: AI Developer
Location: Mumbai (On-site)
Experience: 3 - 5 yrs
Role Summary:
We are seeking an experienced AI Developer to lead the fine-tuning, deployment, and optimization of the custom Proniti AI model based on the prevailing Ai models architecture (26B-A4B MoE / 31B Dense). You will be responsible for transforming the base model into a highly secure, autonomous reasoning engine capable of executing complex standard operating procedure (SOP) gap analyses and regulatory reporting.
Key Responsibilities:
Model Fine-Tuning: Configure and execute Supervised Fine-Tuning (SFT) pipelines using Parameter-Efficient Fine-Tuning (PEFT) methodologies. Utilize Quantized Low-Rank Adaptation (QLoRA) with frameworks like Hugging Face TRL and Unsloth (using bitsandbytes nf4 quantization) to adapt the model without catastrophic forgetting.
Sovereign Infrastructure Deployment: Manage the deployment of the model on sovereign Indian cloud infrastructure. Work directly with Dedicated infrastructure, NVIDIA H100 or L40S GPU clusters hosted in Mumbai-based Tier IV data centers to ensure data privacy and ultra-low latency.
Inference Optimization: Deploy and configure the vLLM inference engine. You will optimize the server using flags like --gpu-memory-utilization for long context management and enable Gemma 4's specific parsers (--reasoning-parser gemma4).
Agentic Tool Orchestration: Implement native tool-calling capabilities by mapping Proniti's backend APIs to the model's <|toolcall> and <|toolresponse> control tokens, powering the autonomous Reporting Agent.
Constrained Decoding: Implement structured JSON output generation via vLLM's guided decoding engine to guarantee that the AI generates perfectly structured data payloads for the Proniti Compliance Dashboard.
Security Governance: Integrate the open-source Agent Governance Toolkit to provide deterministic, sub-millisecond policy enforcement, preventing risks like tool misuse or prompt injections.
Requirements:
3–5+ years of experience in Deep Learning, NLP, and AI Systems Engineering.
Strong proficiency in Python, PyTorch, and the Hugging Face ecosystem.
Proven hands-on experience with LLM/SLM fine-tuning techniques (LoRA, QLoRA) and quantization.
Deep understanding of inference servers (specifically vLLM) and GPU memory optimization (KV caching, PagedAttention).
Experience building autonomous AI agents and utilizing JSON schemas for strict output decoding.

Share job

Similar Jobs

View All

1 Day ago

IT Trainer

Information Technology

21000 - 25000 INR - Monthly
Mumbai, Maharashtra, IN

Responsibilities:Prepare and deliver training sessions as per the program’s guidelines and materials.Conduct IT sessions with students as per the schedule and methodology.Ensure student attendance and active participation in sessions.Manage day-to-...

More info

1 Day ago

Deputy Manager - IT Infrastructure

Automotive

6 - 8 Yrs
Haryana, Gurgaon / Gurugram, Noida

Your mission, roles and requirements: We are seeking a highly skilled IT Infrastructure Person with a strong background in server and network infrastructure, VMware technologies, Windows servers, cloud solutions, automation, and scripting. The i...

More info

2 Days ago

Principal Software Engineer

Information Technology

15 - 18 Yrs
Chennai

Role Overview: Principal Software Engineer – Cloud Platform We are seeking a highly experienced Principal Software Engineer with deep expertise in AWS and Infrastructure as Code (IaC) to design, build, and scale resilient cloud platforms. This rol...

More info

2 Days ago

VCF Cloud Architect

Information Technology

Delhi, DL, India

TCS Hiring for VCF Cloud Architect Role : VCF Cloud Architect Exp : 10- 20 yrs Location : Chennai/ Bangalore/ Kolkata Architecture & Design Define the target architecture for VCF using standard vs. consolidated models; separate management and VI ...

More info

2 Days ago

GCP Data Architect

Information Technology

Delhi, DL, India

TCS is inviting applications for GCP Data Architect Experience Range: 8+ Years Location: Chennai/Bangalore/Hyderabad/Pune/Gurgaon Job Description - 1. Experience with Google data services (BigQuery, BigTable, Dataproc, Dataflow, Airflow or Cloud com...

More info

2 Days ago

Associate - Data Scientist ( Consumer Products AI)

Information Technology

Delhi, DL, India

Associate - Data Scientist About us Bain & Company is a global consultancy that helps the world’s most ambitious change makers define the future. Across 65 offices in 40 countries, we work alongside our clients as one team with a shared ambition to a...

More info

2 Days ago

QA Engineer (Manual Tester)

Information Technology

Mumbai, Maharashtra, IN

Role Overview: We are looking for a QA Engineer (Manual Tester) who is detail-oriented and serious about product quality. This is a hands-on execution role focused on manual testing of web & mobile applications.You will work closely with product,desi...

More info

2 Days ago

Data Engineer Data Quality & Platform

Information Technology

Mumbai, Maharashtra, IN

About the job:Or Client is building a real-time data platform that powers core product decisions and customer-facing systems.This role is for engineers who:Own data pipelines end-to-endCare deeply about data correctnessCan debug real production issue...

More info

Talk to us

Feel free to call, email, or hit us up on our social media accounts.

Email info@antaltechjobs.in