Free cookie consent management tool by TermsFeed Performance Tester – GenAI | Antal Tech Jobs
Back to Jobs
3 Days ago

Performance Tester – GenAI

decor
Bangalore, Karnataka, India
Information Technology
Full-Time
Orion Innovation

Overview

Orion Innovation is a premier, award-winning, global business and technology services firm. Orion delivers game-changing business transformation and product development rooted in digital strategy, experience design, and engineering, with a unique combination of agility, scale, and maturity. We work with a wide range of clients across many industries including financial services, professional services, telecommunications and media, consumer products, automotive, industrial automation, professional sports and entertainment, life sciences, ecommerce, and education.

Role: Performance Test Engineer – Generative AI

Experience: 5+ years (with hands-on performance testing in GenAI / LLM-based applications)

Role Overview

We are seeking a skilled and detail-oriented Performance Tester with strong experience in Generative AI (GenAI) projects. The ideal candidate will be responsible for ensuring scalability, reliability, and optimal performance of AI-powered applications, including Large Language Model (LLM) integrations, conversational AI systems, and Retrieval-Augmented Generation (RAG) pipelines. This role requires expertise in performance engineering, cloud platforms, and testing of AI/ML workloads in production environments.

Key Responsibilities

  • Performance Strategy & Planning:
  • Define and implement performance testing strategies for GenAI and LLM-based applications.
  • Identify performance bottlenecks across APIs, model inference layers, vector databases, and cloud infrastructure.
  • Establish performance benchmarks, SLAs, and scalability targets for AI-driven systems.
  • Performance Testing & Engineering:
  • Design, develop, and execute load, stress, spike, endurance, and scalability tests for GenAI applications.
  • Perform performance testing of LLM-powered APIs (e.g., ChatGPT-like applications) hosted on cloud platforms.
  • Validate latency, throughput, token usage, concurrency handling, and cost-performance trade-offs.
  • Conduct performance validation for RAG pipelines including embedding generation and vector search.
  • Analyze model inference time, GPU/CPU utilization, memory usage, and autoscaling behavior.
  • Tools & Automation:
  • Develop automated performance test scripts using tools such as JMeter, LoadRunner, k6, or Gatling.
  • Monitor system performance using APM tools like Dynatrace, AppDynamics, Azure Monitor, or AWS CloudWatch.
  • Integrate performance testing into CI/CD pipelines using Azure DevOps or similar platforms.
  • Create dashboards and reports for performance metrics and trend analysis.
  • Cloud & Infrastructure Testing:
  • Conduct performance testing on AI solutions deployed on Azure, AWS, or GCP.
  • Validate autoscaling configurations, containerized deployments (Docker, Kubernetes), and serverless architectures.
  • Assess performance of vector databases such as Chroma, Pinecone, Weaviate, or FAISS under load.
  • Collaboration & Optimization:
  • Collaborate with AI engineers, data scientists, DevOps, and architects to optimize model serving and API performance.
  • Recommend improvements in prompt engineering, caching strategies, batching, and parallelization.
  • Support capacity planning and cost optimization for LLM-based applications.
  • Governance & Reporting:
  • Document performance test results, bottlenecks, and optimization recommendations.
  • Ensure compliance with security and data privacy standards in performance environments.
  • Present findings to stakeholders and provide actionable insights.


Key Requirements

  • Technical Skills:
  • 5+ years of experience in Performance Testing and Engineering.
  • Hands-on experience in performance testing GenAI / LLM-based applications.
  • Experience working with LLM platforms such as OpenAI GPT models, Gemini, Llama 2, Claude, or Grok.
  • Understanding of concepts like tokenization, embeddings, vector search, and RAG architecture.
  • Experience testing AI services hosted on Azure AI Services, Azure ML, AWS Bedrock, or Google Vertex AI.
  • Proficiency in performance testing tools such as JMeter, LoadRunner, k6, or Gatling.
  • Knowledge of API testing tools like Postman or Rest Assured.
  • Familiarity with monitoring tools such as Azure Monitor, AWS CloudWatch, Grafana, or Prometheus.
  • Experience with containerization (Docker) and orchestration (Kubernetes).
  • Basic scripting knowledge in Python or Java for test automation.
  • Understanding of CI/CD pipelines and DevOps practices.
  • GenAI-Specific Knowledge:
  • Experience testing conversational AI applications and chatbot performance.
  • Knowledge of inference latency optimization techniques for LLMs.
  • Understanding of GPU-based workloads and performance considerations.
  • Exposure to agentic frameworks like LangChain, Semantic Kernel, AutoGen, or CrewAI (preferred).
  • Experience validating performance of vector databases (Chroma, Pinecone, Weaviate, FAISS).


Qualifications

  • Bachelor’s degree in Computer Science, Information Technology, or related field.
  • 5+ years of experience in performance testing, with at least 2 years in AI/ML or GenAI projects.
  • Experience in testing cloud-native, microservices-based applications.
  • Strong analytical and troubleshooting skills.
  • Excellent communication and stakeholder management skills.


Orion is an equal opportunity employer, and all qualified applicants will receive consideration for employment without regard to race, color, creed, religion, sex, sexual orientation, gender identity or expression, pregnancy, age, national origin, citizenship status, disability status, genetic information, protected veteran status, or any other characteristic protected by law.

Candidate Privacy Policy

Orion Systems Integrators, LLC And Its Subsidiaries And Its Affiliates (collectively, “Orion,” “we” Or “us”) Are Committed To Protecting Your Privacy. This Candidate Privacy Policy (orioninc.com) (“Notice”) Explains

  • What information we collect during our application and recruitment process and why we collect it;
  • How we handle that information; and
  • How to access and update that information.


Your use of Orion services is governed by any applicable terms in this notice and our general Privacy Policy.

Share job
Similar Jobs
View All
14 Hours ago
Microsoft Dynamics 365 F&O Functional Consultant
Information Technology
  • 3 - 7 Yrs
  • Pune
Job Summary: The Associate, IT ERP Specialist is responsible for providing support to internal and external users to use CECO’s ERP effectively to fulfill business objectives. The Associate, IT ERP Specialist will assist other IT ERP Specialists with...
decor
1 Day ago
DevOps Engineer
Information Technology
  • 4 - 7 Yrs
  • Chennai
Role Profile We are looking for a DevOps Engineer, this role combines the management application systems, deployment processes to ensure accurate and efficient releases of new features and the maintenance of uptime, performance, and reliability. ...
decor
1 Day ago
Full Stack Developer
Information Technology
  • 260000 - 350000 INR - Yearly
  • Bangalore, Karnataka, India
About the job:As a Full Stack Developer at Serpwize Technologies LLP, you will have the opportunity to work with a cutting-edge tech stack that includes PHP, MySQL, HTML, CSS, JavaScript, Python, MongoDB, AngularJS, Node.js, React, Bubble.io, WordPre...
decor
1 Day ago
Data Scientist - L2 - BLR
Information Technology
  • Bangalore, Karnataka, India
About CoffeeBeans ConsultingCoffeeBeans is a tech-driven software consulting company that helps businesses solve complex problems using modern data, AI, and engineering solutions. We blend deep technical expertise with a product mindset to build scal...
decor
1 Day ago
Full Stack Developer (AI Focus) Internship in Jammu, Himachal Pradesh, Haryana, Sahibzada Ajit Singh Nagar, Chandigarh, Punjab, Uttarakhand, Mohali, Chandigarh, Mohali
Information Technology
  • Bangalore, Karnataka, India
We are looking for a passionate and driven full stack development intern who has a strong interest in AI tools and modern development practices. This internship is ideal for candidates who love building projects, experimenting with AI-powered tools, ...
decor
1 Day ago
Interesting Job Opportunity: PHP Developer - MVC Frameworks
Information Technology
  • Bangalore, Karnataka, India
Mumbai (Andheri East) | 5 Days Work From OfficeWe're looking for an experienced Senior PHP Developer to build scalable, high-performance web applications.Roles And Responsibilities Create clean, robust, well performing, DRY code in PHP/ JavaScript, M...
decor
1 Day ago
Orangebits - Data Engineer - Google Cloud Platform
Information Technology
  • Bangalore, Karnataka, India
DescriptionExperience : 5 to 10 YearsLocation : HyderabadWork Mode : Hybrid (3 days WFO in a week)Key Responsibilities Design, develop, and maintain data pipelines and data processing workflows on GCP. Implement scalable solutions using BigQuery and ...
decor
1 Day ago
Software Engineer
Information Technology
  • Bangalore, Karnataka, India
Job Title: Software EngineerExperience: 2-4 yearsLocation: BangaloreDepartment: Product & TechnologyEmployment Type: Full Time, PermanentAbout Digitap.ai:DIGITAP.AI is an Enterprise SaaS company providing high-tech advanced AI/ML, Alternate Data Solu...
decor

Talk to us

Feel free to call, email, or hit us up on our social media accounts.
Social media