Back to Jobs

4 Days ago

AI LLM Test Engineer

Apply Now

Jaipur, Rajasthan, India

Information Technology

Full-Time

Sagebridge pvt ltd

Overview

Job Responsibilities:

· Design, develop, and execute automated evaluation suites and test cases specifically targeting AI/LLM components, focusing on aspects like response quality, factual accuracy, safety, and task completion.

· Implement and manage batch testing processes using curated datasets to assess model performance, identify regressions, and benchmark different model versions or prompts.

· Develop, maintain, and enhance test and evaluation frameworks using libraries such as Promptflow, DeepEval, Ragas, and similar LLM evaluation tools.

· Define and implement comprehensive test strategies to evaluate LLM outputs for accuracy, relevance, coherence, safety (toxicity, bias), hallucination reduction, and consistency, using both automated metrics and potentially qualitative review processes.

· Collaborate closely with developers, data scientists, and prompt engineers to understand model behavior, identify edge cases, potential biases, and failure modes in AI models and agents

· Test and validate components of Retrieval-Augmented Generation (RAG) pipelines, including retriever performance, chunking strategies, and generator quality.

· Evaluate the end-to-end functionality and performance of AI-driven workflows within telecom applications against defined benchmarks.

· Continuously research and improve testing methodologies and metrics for AI/LLM applications, incorporating industry best practices in automated evaluation and validation.

· Document evaluation results and findings, providing actionable feedback to development teams to enhance AI model robustness, reliability, and overall quality.

Job Type: Full-time

Pay: ₹9,486.26 - ₹49,562.64 per month

Benefits:

Work from home

Schedule:

Monday to Friday

Application Question(s):

Should have knowledge of AI/ML/LLM development

Experience:

Python: 2 years (Required)
Selenium Automation: 2 years (Required)
Promptflow or DeepEval or Ragas,: 1 year (Preferred)
Machine learning/LLM: 2 years (Required)
REST API: 2 years (Preferred)
Test Strategy: 2 years (Preferred)

Work Location: Remote

Share job

Similar Jobs

View All

15 Hours ago

Java Developer – Payments Domain

Information Technology

4 - 7 Yrs
Mumbai (All Areas)

We are hiring Java Developers with 4–6 years of hands-on experience in backend development, particularly within the Payments or FinTech domain. The ideal candidate should possess a strong foundation in Java technologies and be capable of working in a...

More info

16 Hours ago

SAP Functional Architect

Information Technology

40,00,000 - 45,00,000 INR - Annual
12 - 15 Yrs
Bangalore, Chennai

We are seeking an experienced SAP Pre-Sales Architect with a strong functional background and deep expertise in at least one SAP functional area. The ideal candidate will have extensive knowledge of cross-module integrations and a proven track record...

More info

17 Hours ago

Senior React Native Developer

Information Technology

7 - 12 Yrs
Jaipur

The NineHertz is on the lookout for a Senior React Native Developer who is passionate about mobile app development and thrives in a fast-paced environment. This is a fantastic opportunity to work with a dynamic team, drive innovation, and help delive...

More info

19 Hours ago

Senior Data & AI Analytics Engineer (Remote)

AI & Machine Learning Advancement

18,00,000 - 24,00,000 INR - Annual
5 - 8 Yrs
Pune

Job Ref: NT-DAAI-003 Experience: 5–8 years Client: A prestigious AI-first tech company Notice: Early joiners preferred (Immediate- 30 days) We are hiring on behalf of a prestigious AI-first technology client for a Senior Data & AI Analytics En...

More info

19 Hours ago

AI Engineering Manager (Remote)

Information Technology

40,00,000 - 50,00,000 INR - Annual
10 - 15 Yrs
Pune

Experience: 10 to 15 years Location: Remote Notice Period: Immediate to 30 days preferred Client: Leading mid-sized firm specializing in AI-driven solutions Overview: We are looking for an AI Engineering Manager to lead a dynamic team of ...

More info

20 Hours ago

Senior Generative AI Engineer

Information Technology

6 - 10 Yrs
Anywhere in India/Multiple Locations

Experience: 6 to 10 relevent years Location: Remote Notice Period: Immediate to 30 days preferred Client: India based prestigious enterprise in the AI domain Overview: We are seeking a seasoned Generative AI Engineer to spearhead the devel...

More info

2 Days ago

QA Engineer (Manual & Automation Testing)

Information Technology

Noida, Uttar Pradesh, India

About 23 Ventures 23 Ventures specializes in building technology to help startups and early-stage ideas achieve product-market fit, scale, and stay focused. We partner with startups and early-stage ideas to provide resources, practical advice, and e...

More info

2 Days ago

Senior Full Stack Developer - Node.js/Express.js

Information Technology

Noida, Uttar Pradesh, India

Job OverviewWe are looking for a Full-Stack Developer with 4+ years of experience in software development.ResponsibilitiesThe ideal candidate will be proficient in both frontend and backend technologies, capable of building scalable and high-perform...

More info

Talk to us

Feel free to call, email, or hit us up on our social media accounts.

Email info@antaltechjobs.in