Free cookie consent management tool by TermsFeed Senior Test Engineer-GenAI Testing | Antal Tech Jobs
Back to Jobs
3 Days ago

Senior Test Engineer-GenAI Testing

decor
Pune, Maharashtra, India
Information Technology
Full-Time
Kongsberg Digital

Overview

About The Role

We are seeking a passionate and forward-thinking Senior QA Engineer to lead quality assurance for Generative AI (GenAI) solutions embedded within our Digital Twin platform. This is a high-impact role that goes beyond traditional QA—focusing on the nuanced evaluation, reliability, and guardrails of AI-powered systems in production.

You will be responsible not just for testing, but also for establishing evaluation frameworks, defining AI quality benchmarks, and upskilling other QA engineers in GenAI testing methods. The ideal candidate brings a mix of structured QA discipline, hands-on familiarity with GenAI systems (LLMs, RAG, agents), and a strong sense of ownership.

Key Responsibilities

  • Design and implement end-to-end QA strategies for applications using Node.js, integrated with LLMs, retrieval-augmented generation (RAG), and Agentic AI workflows.
  • Establish comprehensive benchmarks and quality metrics for GenAI components including accuracy, coherence, relevance, stability, and safety.
  • Develop structured evaluation datasheets for LLM behaviour validation: test prompts, expected responses, classification criteria, and scoring rubrics.
  • Perform data quality testing for RAG databases and ensure relevant, high-quality retrieval to minimize hallucinations and improve grounding.
  • Conduct A/B testing across model versions, prompt designs, and system configurations to measure and compare output quality.
  • Define methodologies and simulate non-deterministic behaviours using Agentic AI testing techniques.
  • Collaborate closely with developers, product owners, and AI engineers to test prompt engineering pipelines, function-calling interfaces, and fallback logic.
  • Build QA automation where applicable and integrate GenAI evaluations into CI/CD pipelines.
  • Lead internal capability development by mentoring QA peers on GenAI testing practices and helping evolve the organization’s AI quality maturity.


Required Skills And Qualifications

  • 6+ years of experience in software quality assurance, with at least 3+ years working in or around GenAI or LLM-based systems.
  • Deep understanding of GenAI quality dimensions: response grounding, factual correctness, context awareness, and hallucination minimization.
  • Experience creating and maintaining LLM evaluation datasets and designing test cases for dynamic prompt behaviour.
  • Hands-on experience with tools and techniques for testing retrieval pipelines, embedding quality, and vector similarity results in RAG architectures.
  • Familiarity with non-deterministic testing strategies, agent loop evaluation, and multi-step LLM task validation.
  • Comfortable working with APIs, logs, test scripts, and tracing tools to validate both system and AI behaviour.
  • Strong analytical thinking and a methodical approach to identifying bugs, regressions, and inconsistencies in AI outputs.
  • Bachelor or master’s degree in engineering


Preferred Skills

  • Experience with GenAI tools/platforms like OpenAI, LangChain, Semantic Kernel, Hugging Face, Pinecone, or Weaviate.
  • Exposure to evaluating LLMs in production settings, including safety nets, guardrails, and red-teaming approaches.
  • Familiarity with prompt tuning, few-shot learning, and function/tool calling in LLMs.
  • Basic scripting knowledge (Python, JavaScript, or TypeScript) for building test harnesses or validation utilities.
Share job
Similar Jobs
View All
13 Hours ago
Senior Data Engineer(Product Companies Only)
Internet
  • 4 - 8 Yrs
  • Anywhere in India/Multiple Locations
My Client is an AI powered, all-in-one white-label sales & marketing platform that empowers agencies, entrepreneurs, and businesses to elevate their digital presence and drive growth. Role : Senior Data Engineer Location : Remote They are se...
decor
16 Hours ago
SDE III FullStack(Backend Heavy)
Information Technology
  • 3 - 8 Yrs
  • Anywhere in India/Multiple Locations
As an SDE3, you’ll work on backend-heavy (70% Backend Node.Js & 30% Frontend React.Js)full stack features that bring real-world booking experiences to life — such as workflows, payments, lead generation, form submissions, notifications, and search....
decor
1 Day ago
Project Manager SAP
Information Technology
  • 30 - 35 INR - Annual
  • 10 - 15 Yrs
  • Pune
🚀 We’re Hiring – SAP Project Manager | Pune 🚀 Exciting opportunity with my client in Pune for an experienced SAP Project Manager to lead end-to-end S/4HANA implementations (Rise & GROW) and AMS projects in a dynamic, global environment. We’re l...
decor
2 Days ago
Android Developer - Mobile App Integration
Information Technology
  • Pune, Maharashtra, India
Responsibilities Design and develop advanced applications for the Android platform Collaborate with cross-functional teams to define, design, and ship new features Identify and fix bugs and performance bottlenecks Continuously discover, evaluate...
decor
2 Days ago
Full Stack Developer in Delhi
Information Technology
  • Pune, Maharashtra, India
Key Responsibilities Architect, develop, and maintain high-performance, scalable web applications capable of handling rapid user growth. Design and implement distributed, fault-tolerant systems with load balancing, caching, and database sharding a...
decor
2 Days ago
Lead Data Scientist
Information Technology
  • Pune, Maharashtra, India
Our PurposeMastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we’re helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments ch...
decor
2 Days ago
Software Engineer (Wireless & Smart Spaces) - C/C++, Python, TCP/IP, Wireless Networking, 4+ years of experience
Information Technology
  • Pune, Maharashtra, India
Join Cisco Wireless and help transform the network experience with our innovative wireless solutions. Enhance speed and security, and apply AI/ML for real-time insights, continuous monitoring and seamless Wi-Fi connectivity including the latest Wi-F...
decor
2 Days ago
Software Engineer
Information Technology
  • Pune, Maharashtra, India
Experience2 + years Advantageous to have experience with ERP SystemSkillsJavaScript, HTML, CSS, SQL ServerLocation :Aluva KochiSalaryNegotiableSend your resume to hr@coralme.com ...
decor

Talk to us

Feel free to call, email, or hit us up on our social media accounts.
Social media