Free cookie consent management tool by TermsFeed AI LLM Test Engineer | Antal Tech Jobs
Back to Jobs
4 Days ago

AI LLM Test Engineer

decor
Jaipur, Rajasthan, India
Information Technology
Full-Time
Sagebridge pvt ltd

Overview

Job Responsibilities:

· Design, develop, and execute automated evaluation suites and test cases specifically targeting AI/LLM components, focusing on aspects like response quality, factual accuracy, safety, and task completion.

· Implement and manage batch testing processes using curated datasets to assess model performance, identify regressions, and benchmark different model versions or prompts.

· Develop, maintain, and enhance test and evaluation frameworks using libraries such as Promptflow, DeepEval, Ragas, and similar LLM evaluation tools.

· Define and implement comprehensive test strategies to evaluate LLM outputs for accuracy, relevance, coherence, safety (toxicity, bias), hallucination reduction, and consistency, using both automated metrics and potentially qualitative review processes.

· Collaborate closely with developers, data scientists, and prompt engineers to understand model behavior, identify edge cases, potential biases, and failure modes in AI models and agents

· Test and validate components of Retrieval-Augmented Generation (RAG) pipelines, including retriever performance, chunking strategies, and generator quality.

· Evaluate the end-to-end functionality and performance of AI-driven workflows within telecom applications against defined benchmarks.

· Continuously research and improve testing methodologies and metrics for AI/LLM applications, incorporating industry best practices in automated evaluation and validation.

· Document evaluation results and findings, providing actionable feedback to development teams to enhance AI model robustness, reliability, and overall quality.

Job Type: Full-time

Pay: ₹9,486.26 - ₹49,562.64 per month

Benefits:

  • Work from home

Schedule:

  • Monday to Friday

Application Question(s):

  • Should have knowledge of AI/ML/LLM development

Experience:

  • Python: 2 years (Required)
  • Selenium Automation: 2 years (Required)
  • Promptflow or DeepEval or Ragas,: 1 year (Preferred)
  • Machine learning/LLM: 2 years (Required)
  • REST API: 2 years (Preferred)
  • Test Strategy: 2 years (Preferred)

Work Location: Remote

Share job
Similar Jobs
View All
15 Hours ago
Java Developer – Payments Domain
Information Technology
  • 4 - 7 Yrs
  • Mumbai (All Areas)
We are hiring Java Developers with 4–6 years of hands-on experience in backend development, particularly within the Payments or FinTech domain. The ideal candidate should possess a strong foundation in Java technologies and be capable of working in a...
decor
16 Hours ago
SAP Functional Architect
Information Technology
  • 40,00,000 - 45,00,000 INR - Annual
  • 12 - 15 Yrs
  • Bangalore, Chennai
We are seeking an experienced SAP Pre-Sales Architect with a strong functional background and deep expertise in at least one SAP functional area. The ideal candidate will have extensive knowledge of cross-module integrations and a proven track record...
decor
17 Hours ago
Senior React Native Developer
Information Technology
  • 7 - 12 Yrs
  • Jaipur
The NineHertz is on the lookout for a Senior React Native Developer who is passionate about mobile app development and thrives in a fast-paced environment. This is a fantastic opportunity to work with a dynamic team, drive innovation, and help delive...
decor
19 Hours ago
Senior Data & AI Analytics Engineer (Remote)
AI & Machine Learning Advancement
  • 18,00,000 - 24,00,000 INR - Annual
  • 5 - 8 Yrs
  • Pune
Job Ref: NT-DAAI-003 Experience: 5–8 years Client: A prestigious AI-first tech company  Notice: Early joiners preferred (Immediate- 30 days) We are hiring on behalf of a prestigious AI-first technology client for a Senior Data & AI Analytics En...
decor
19 Hours ago
AI Engineering Manager (Remote)
Information Technology
  • 40,00,000 - 50,00,000 INR - Annual
  • 10 - 15 Yrs
  • Pune
Experience: 10 to 15 years Location: Remote  Notice Period: Immediate to 30 days preferred Client: Leading mid-sized firm specializing in AI-driven solutions Overview: We are looking for an AI Engineering Manager to lead a dynamic team of ...
decor
20 Hours ago
Senior Generative AI Engineer
Information Technology
  • 6 - 10 Yrs
  • Anywhere in India/Multiple Locations
Experience: 6 to 10 relevent years Location: Remote Notice Period: Immediate to 30 days preferred Client: India based prestigious enterprise in the AI domain Overview: We are seeking a seasoned Generative AI Engineer to spearhead the devel...
decor
2 Days ago
QA Engineer (Manual & Automation Testing)
Information Technology
  • Noida, Uttar Pradesh, India
About 23 Ventures 23 Ventures specializes in building technology to help startups and early-stage ideas achieve product-market fit, scale, and stay focused. We partner with startups and early-stage ideas to provide resources, practical advice, and e...
decor
2 Days ago
Senior Full Stack Developer - Node.js/Express.js
Information Technology
  • Noida, Uttar Pradesh, India
Job OverviewWe are looking for a Full-Stack Developer with 4+ years of experience in software development.ResponsibilitiesThe ideal candidate will be proficient in both frontend and backend technologies, capable of building scalable and high-perform...
decor

Talk to us

Feel free to call, email, or hit us up on our social media accounts.
Social media