Back to Jobs

1 Day ago

Machine Learning Engineer - LLM/Python

Apply Now

Mysore, Karnataka, India

Information Technology

Full-Time

RCM Business Solutions

Overview

Job Overview

We are seeking a highly skilled and motivated LLM Evaluation Framework Developer to design, build, and maintain robust frameworks for evaluating large language models (LLMs). You will work closely with ML researchers, engineers, and product teams to define metrics, automate evaluations, integrate datasets, and ensure model behaviour aligns with safety, quality, and performance expectations.

Key Responsibilities

Design and implement evaluation frameworks for benchmarking LLMs across dimensions such as accuracy, robustness, reasoning, safety, and hallucination.

Develop modular pipelines to support automatic, semi-automatic, and human- in-the-loop evaluations.

Integrate and customize tools like Giskard, RAGAS, DeepEval, Opik/Comet, TruLens, or similar.

Define and implement custom metrics for specific use cases like RAG, Agent performance, Guardrails compliance, etc.

Curate or generate high-quality evaluation datasets for various domains (e.g., medical, finance, legal, general QA, code generation).

Collaborate with LLM application developers to instrument tracing and logging to capture model behaviour in real-world flows.

Implement dashboarding and reporting to visualize performance trends, regressions, and comparison across model versions.

Evaluate model responses using structured prompts, chain-of-thought techniques, adversarial tests, and A/B comparisons.

Support red-teaming and stress testing efforts to identify vulnerabilities or ethical risks in model outputs.

Required Skills & Qualifications

Core Technical Skills :

Proficiency in Python with experience in NLP, ML/LLM libraries (e.g. Hugging Face, Lang Chain, OpenAI SDK, Cohere).
Experience building evaluation pipelines or benchmarks for ML/LLM systems.
Familiarity with RAG evaluation, agentic evaluation, safety/guardrail testing, and LLM performance metrics.
Strong grasp of prompt engineering, retrieval techniques, and generative model behaviour.

Experience With Tools Such As

Giskard, RAGAS, DeepEval, TruLens, Lang Smith, Opik/Comet, Weights & Biases, or similar.
Working knowledge of vector stores (e.g., FAISS, Weaviate, Pinecone) and embedding-based evaluation.

Testing & DevOps

Familiarity with CI/CD pipelines, unit and integration testing for LLM apps.
Understanding of data versioning, model versioning, and test reproducibility.

Preferred Qualifications

Prior experience developing or maintaining LLM-based applications (chatbots, copilots, RAG systems).
Background in ML research, applied NLP, or machine learning infrastructure.
Exposure to LLM guardrails design (e.g., jailbreaking prevention, content filtering).
Experience with open-source contribution in the LLM evaluation or tooling space.

Soft Skills

Strong communication and documentation abilities.
Comfort working in ambiguous, fast-paced, and research-heavy environments.
Passion for ensuring LLM reliability, safety, and responsible deployment.

(ref:hirist.tech)

Share job

Similar Jobs

View All

15 Hours ago

UI Developer

Information Technology

2 - 5 Yrs
Mumbai

Location: Mumbai, India Job Type: Full-Time Department: Design & Front-End Development Role Summary We’re on the hunt for a detail-oriented and creative UI Developer to bring intuitive, responsive, and visually striking interfaces to life...

More info

15 Hours ago

Full Stack Developer

Information Technology

2 - 5 Yrs
Mumbai

Location: Mumbai, India Employment Type: Full-time Department: Engineering / Technology Role Overview We’re looking for a proactive and skilled Full Stack Developer to join our dynamic team. The ideal candidate will be passionate about bu...

More info

1 Day ago

Devops Cloud Engineer

Information Technology

3000000 - 3500000 INR - Monthly
7 - 14 Yrs
Mumbai

Minimum Experience/Training Required: • At least 7 years of relevant experience, with a strong track record in deploying solutions/applications on AWS cloud environment • Proven ability to work across structured, semi-structured, and unstructured d...

More info

1 Day ago

Data Engineer in Bangalore (Hybrid)

Information Technology

Mysore, Karnataka, India

We're seeking a passionate Data Engineer to join our fast-paced, early-stage startup and help build the data infrastructure that powers next-generation AI models. You'll work alongside a world-class team of engineers from Stanford and IIT, designing...

More info

1 Day ago

Financial Markets- Data Engineer

Information Technology

Mysore, Karnataka, India

At PwC, our people in finance consulting specialise in providing consulting services related to financial management and strategy. These individuals analyse client needs, develop financial solutions, and offer guidance and support to help clients op...

More info

1 Day ago

Principal Data Scientist

Information Technology

Mysore, Karnataka, India

Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data a...

More info

1 Day ago

Epergne Solutions - UI Developer - React.js

Information Technology

Mysore, Karnataka, India

UI Developer React.js with Multi-Cloud ExperienceLocation : Noida, Gurgaon, Hyderabad, Bangalore, Pune, Overview :ResponsibilitiesAre you a skilled UI Developer passionate about crafting exceptional user experiences and working with cutting-edge clo...

More info

1 Day ago

Artificial Intelligence/Machine Learning Engineer

Information Technology

Mysore, Karnataka, India

Job Title : AI/ML Engineer / Junior Data ScientistLocation : Bangalore / PuneExperience : 05 YearsEmployment Type : Full-TimeSalary : 5 15 LPA (Based on experience and skillset)About The RoleWe are looking for a passionate and driven AI/ML Engi...

More info

Talk to us

Feel free to call, email, or hit us up on our social media accounts.

Email info@antaltechjobs.in