Free cookie consent management tool by TermsFeed Senior MLOps Technical Lead | Antal Tech Jobs
Back to Jobs
3 Days ago

Senior MLOps Technical Lead

decor
Chennai, TN, India
Information Technology
Other
HCLTech

Overview

Noida, Uttar Pradesh
Job Summary

Overview Senior‑level role on a five‑person engineering team building a production‑grade healthcare conversational GenAI platform. The stack centers on Python 3.12+ , FastAPI , and Google ADK 2.0 as the primary multi‑agent orchestration and structured‑output toolkit, deployed on Azure . The role emphasizes backend service development, GenAI workflow engineering, secure integrations, and operational ownership for patient‑facing conversational experiences. Key Responsibilities Design, implement, and maintain
backend APIs and services using Python 3.12+ and FastAPI. Develop and operate
multi‑agent GenAI workflows using Google ADK 2.0 as the primary orchestration framework. Integrate and tune
LLM providers and GenAI toolkits (OpenAI, Anthropic, Google). Support and extend
workflows built with other orchestration frameworks (LangGraph, LangChain, PydanticAI) and ensure interoperability. Implement RAG, prompt engineering, structured output validation, and AI guardrails
to ensure safe, reliable model behavior. Build domain task handlers
for healthcare workflows (care tasks, medications, scheduling) and integrate with clinical systems as required. Leverage Azure services
(Cosmos DB, App Configuration, Key Vault, Event Hubs, Application Insights) for data, configuration, secrets, and telemetry. Ensure API security and PHI protection
using JWT/OAuth and security best practices. Contribute to architecture, code reviews, production support, observability, and incident response. Author and maintain automated tests
(pytest) and support CI/CD and containerized deployments. Required Qualifications 7+ years
professional software development experience.Strong production experience in Python and asynchronous frameworks ( FastAPI preferred). Hands‑on experience with Google ADK 2.0
for multi‑agent orchestration and structured LLM outputs.Familiarity with other LLM orchestration frameworks (LangGraph, LangChain, PydanticAI) and ability to work across toolchains.Demonstrated knowledge of prompt engineering , RAG , structured output validation, and AI safety/guardrail patterns.Experience integrating OpenAI, Anthropic, Google, and Google ADK 2.0 .Practical experience with Azure services (Cosmos DB, Key Vault, App Configuration).Familiarity with Docker , CI/CD (e.g., GitHub Actions), and containerized production deployments.Experience with event‑driven architectures and workflow engines.Strong understanding of JWT/OAuth and API security best practices.Proven ability to operate with high ownership in a small, fast‑paced team. Preferred Qualifications Healthcare domain experience (HIPAA, HL7/FHIR, Epic integrations).Experience with PydanticAI or equivalent structured output validation tools.Familiarity with OpenTelemetry and observability tooling.Exposure to React and TypeScript for occasional full‑stack contributions.

Key Responsibilities
1. Implement and optimize ML pipelines using MLflow, Kubeflow Pipelines, and TFX, enabling automated model training, validation, and deployment.
2. Integrate DevOps practices with Python scripting to automate infrastructure provisioning via Terraform, AWS CloudFormation, and Ansible for scalable ML environments.
3. Configure and maintain CI/CD workflows using Jenkins, GitLab CI/CD, CircleCI, and GitHub Actions to streamline code integration and deployment for ML projects.
4. Monitor and analyze ML system performance using Prometheus, Grafana, ELK Stack, and Fluentd, ensuring reliability and rapid issue resolution.
5. Apply advanced proficiency in Git, GitHub, GitLab, and Bitbucket for source code management and collaboration within the development team.
6. Participate in technical reviews, contribute to process compliance, and support feasibility studies by evaluating technical alternatives and risks for ML solutions.
7. Prepare and submit project status reports, collaborating with internal stakeholders to define deliverables and minimize escalation risks.
Skill Requirements
1. Advanced Proficiency In Ml Ops, Including Mlflow, Kubeflow Pipelines, Tfx, And Metaflow.
2. Advanced Proficiency In Devops Tools Such As Terraform, Aws Cloudformation, Ansible, Jenkins, Gitlab Ci/Cd, Circleci, And Github Actions.
3. Advanced Proficiency In Python For Automation, Scripting, And Ml Pipeline Development.
4. Advanced Proficiency In Monitoring And Logging Tools: Prometheus, Grafana, Elk Stack, Fluentd.
5. Advanced Proficiency In Version Control Systems: Git, Github, Gitlab, Bitbucket.
6. Solid Understanding Of Cloud Infrastructure And Deployment Strategies.
7. Solid Ability To Troubleshoot, Optimize, And Maintain Ml Environments.

Other Requirements

Overview Senior‑level role on a five‑person engineering team building a production‑grade healthcare conversational GenAI platform. The stack centers on Python 3.12+ , FastAPI , and Google ADK 2.0 as the primary multi‑agent orchestration and structured‑output toolkit, deployed on Azure . The role emphasizes backend service development, GenAI workflow engineering, secure integrations, and operational ownership for patient‑facing conversational experiences. Key Responsibilities Design, implement, and maintain
backend APIs and services using Python 3.12+ and FastAPI. Develop and operate
multi‑agent GenAI workflows using Google ADK 2.0 as the primary orchestration framework. Integrate and tune
LLM providers and GenAI toolkits (OpenAI, Anthropic, Google). Support and extend
workflows built with other orchestration frameworks (LangGraph, LangChain, PydanticAI) and ensure interoperability. Implement RAG, prompt engineering, structured output validation, and AI guardrails
to ensure safe, reliable model behavior. Build domain task handlers
for healthcare workflows (care tasks, medications, scheduling) and integrate with clinical systems as required. Leverage Azure services
(Cosmos DB, App Configuration, Key Vault, Event Hubs, Application Insights) for data, configuration, secrets, and telemetry. Ensure API security and PHI protection
using JWT/OAuth and security best practices. Contribute to architecture, code reviews, production support, observability, and incident response. Author and maintain automated tests
(pytest) and support CI/CD and containerized deployments. Required Qualifications 7+ years
professional software development experience.Strong production experience in Python and asynchronous frameworks ( FastAPI preferred). Hands‑on experience with Google ADK 2.0
for multi‑agent orchestration and structured LLM outputs.Familiarity with other LLM orchestration frameworks (LangGraph, LangChain, PydanticAI) and ability to work across toolchains.Demonstrated knowledge of prompt engineering , RAG , structured output validation, and AI safety/guardrail patterns.Experience integrating OpenAI, Anthropic, Google, and Google ADK 2.0 .Practical experience with Azure services (Cosmos DB, Key Vault, App Configuration).Familiarity with Docker , CI/CD (e.g., GitHub Actions), and containerized production deployments.Experience with event‑driven architectures and workflow engines.Strong understanding of JWT/OAuth and API security best practices.Proven ability to operate with high ownership in a small, fast‑paced team. Preferred Qualifications Healthcare domain experience (HIPAA, HL7/FHIR, Epic integrations).Experience with PydanticAI or equivalent structured output validation tools.Familiarity with OpenTelemetry and observability tooling.Exposure to React and TypeScript for occasional full‑stack contributions.

#body.unify div.unify-button-container .unify-apply-now: focus, #body.unify div.unify-button-container .unify-apply-#body.unify div.unify-button-container .unify-apply-now: focus, #body.unify div.unify-button-container .unify-apply-

Share job
Similar Jobs
View All
1 Day ago
Network Engineer (WLAN / Switching / Software)
Information Technology
  • Chennai, TN, India
Network Engineer (WLAN / Switching / Software)This role has been designed as ‘Hybrid’ with an expectation that you will work on average 2 days per week from an HPE office. Who We Are: Hewlett Packard Enterprise is the global edge-to-cloud company adv...
decor
3 Days ago
Senior Web UI Developer - HTML, JavaScript, React.js
Information Technology
  • Chennai, TN, India
Chennai, Tamil Nadu Job Summary The Senior React.js Developer is responsible for developing, enhancing, and maintaining high-quality web applications that meet both client requirements and organizational standards. This role plays a crucial part in ...
decor
3 Days ago
AWS Devops Engineer
Information Technology
  • Chennai, TN, India
Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues around the world, and where you’ll be ab...
decor
3 Days ago
Senior Automation Test Lead Embedded
Information Technology
  • Chennai, TN, India
Lucknow, Uttar Pradesh Job Summary The Senior Test Lead will be responsible for overseeing the testing activities related to Testing Tools and Test Automation (EMB), selenium, java. The primary objective will be to ensure the quality and efficiency o...
decor
3 Days ago
Technical Lead-Cloud & Infra Engg
Information Technology
  • Chennai, TN, India
Country/Region: IN Requisition ID: 37003 Work Model: Position Type: Salary Range: Location: INDIA - CHENNAI - BIRLASOFT OFFICE Title: Technical Lead-Cloud & Infra Engg Description: Area(s) of responsibility Architecture & Solution Design Arch...
decor
3 Days ago
IT System Administrator Technical Specialist
Information Technology
  • Chennai, TN, India
About Us Ribbon Communications (Nasdaq: RBBN) delivers communications software, IP and optical networking solutions to service providers, enterprises and critical infrastructure sectors globally. We engage deeply with our customers, helping them mode...
decor
3 Days ago
Senior Selenium Automation Tester - Cucumber, Java
Information Technology
  • Chennai, TN, India
Hyderabad, Telangana Job Summary The senior automation tester with expertise in cucumber, selenium, and Java will be responsible for developing, and executing automated tests to ensure the quality of software applications. This role will involve coll...
decor
3 Days ago
Lead Software Engineer
Information Technology
  • Chennai, TN, India
Responsibilities A Senior Deveoper delivers features for a Product in a business chain. As member of a Feature Team, he works in autonomy and in a continuous improvement approach. Generic Skills: Requirement analysis should understand user stories D...
decor

Talk to us

Feel free to call, email, or hit us up on our social media accounts.
Social media