Back to Jobs

4 Days ago

Software Engineer - Large Language Models

Apply Now

Information Technology

Full-Time

Muoro

Overview

Description

Role Overview :

We are seeking a highly skilled Software Engineer specializing in Large Language Models (LLMs) to design, develop, and deploy cutting-edge AI solutions leveraging state-of-the-art transformer architectures.

The ideal candidate will have strong expertise in deep learning, NLP, and model optimization, combined with software engineering best practices for building scalable AI systems in production.
Youll collaborate with data scientists, ML engineers, and product teams to build intelligent applications powered by advanced generative AI models such as GPT, LLaMA, Falcon, Mistral, Claude, or similar open-source and proprietary models.

Key Responsibilities

Design, train, fine-tune, and evaluate Large Language Models (LLMs) for specific use cases (e.g., summarization, code generation, chatbots, reasoning, and retrieval-augmented generation).
Experiment with transformer-based architectures (e.g., GPT, T5, BERT, LLaMA, Mistral).
Develop parameter-efficient fine-tuning (PEFT) strategies such as LoRA, QLoRA, adapters, or prompt-tuning.
Create and maintain high-quality datasets for pretraining, fine-tuning, and evaluation.
Optimize model inference using techniques like quantization, distillation, and tensor parallelism for real-time or edge deployment.
Integrate LLMs into production environments using frameworks like Hugging Face Transformers, PyTorch Lightning, or DeepSpeed.
Implement scalable model serving solutions using FastAPI, Ray Serve, Triton Inference Server, or similar frameworks.
Build and maintain APIs or SDKs that expose LLM capabilities to other teams and products.
Evaluate and experiment with open-source and proprietary foundation models.
Keep up with the latest trends in Generative AI, NLP, and Transformer models.
Perform benchmarking, ablation studies, and A/B testing to measure performance, cost, and quality improvements.
Collaborate with ML Ops and DevOps teams to design CI/CD pipelines for model training and deployment.
Manage and optimize GPU/TPU clusters for distributed training and inference.
Implement robust monitoring, logging, and alerting for deployed AI systems.
Ensure software follows clean code principles, version control, and proper documentation.
Partner with product managers, data scientists, and UX teams to identify and translate business problems into AI-driven solutions.
Contribute to internal research initiatives and help shape the companys AI strategy.
Mentor junior engineers in AI model development, coding standards, and best practices.

Required Technical Skills

Core Expertise :

Strong proficiency in Python and deep learning frameworks (PyTorch, TensorFlow, JAX).
Hands-on experience with transformer architectures and LLM fine-tuning.
Deep understanding of tokenization, attention mechanisms, embeddings, and sequence modeling.
Experience with Hugging Face Transformers, LangChain, LlamaIndex, or OpenAI API.
Experience deploying models using Docker, Kubernetes, or cloud ML services (AWS Sagemaker, GCP Vertex AI, Azure ML, OCI Data Science).
Familiarity with model optimization (quantization, pruning, distillation).
Knowledge of retrieval-augmented generation (RAG) pipelines, vector databases (FAISS, Pinecone, Weaviate, Chroma).

Additional Skills (Good To Have)

Experience with multi-modal models (text + image, text + code).
Familiarity with MLOps tools like MLflow, Kubeflow, or Weights & Biases (W&B).
Understanding of Responsible AI practicesbias mitigation, data privacy, and model explainability.
Experience contributing to open-source AI projects

(ref:hirist.tech)

Share job

Similar Jobs

View All

15 Hours ago

Principal Architect - DotNet

Healthcare & Life Sciences

15 - 20 Yrs
Chennai, Hyderabad

Summary role description: Hiring Principal Architect – .NET Full Stack in the Healthcare Technology provider. Company description: Our client is a global technology and services provider with operations across the U.S. a...

More info

15 Hours ago

Principal Architect - JAVA

Healthcare & Life Sciences

14 - 20 Yrs

Hiring for the Principal Architect - Java Full Stack for a healthcare technology leader advancing U.S. healthcare through AI and cloud innovation. Company description: Our client is a leading healthcare technology and clinical services ...

More info

16 Hours ago

Full Stack Developer

Information Technology

5 - 8 Yrs
Thane

About the Role We are building advanced AI-powered enterprise products and are looking for a Node.js + UI Developer (React) to join our engineering team. This role involves end-to-end development of high-performance web applications, from backend ...

More info

1 Day ago

Sr Technical Consultant

Information Technology

7 - 23 INR - Annual
5 - 8 Yrs
Pune

Position: Sr. Technical Consultant (Dotnet 6.0+) Experience: 5+ Years Job Title: ASP.NET Core 6.0 / Full stack Developer for Pune Location We are looking for a seasoned ASP.NET Core 6.0 / MVC Developer to join our innovative team. This ro...

More info

1 Day ago

Mobile Engineer (React Native)

Information Technology

1200000 - 1800000 INR - Annual
3 - 6 Yrs
Chennai

Job Description About the Role We are looking for a React Native Engineer to join our team in building robust, scalable, and high-performance mobile applications. You will work closely with engineers, designers, and product managers to deliver se...

More info

1 Day ago

Senior AI/ML Engineer

Information Technology

2000000 - 2500000 INR - Annual
4 - 8 Yrs
Chennai, Hyderabad

Role : Senior AI/ML Engineer Experience : 4 - 8 years Location: Chennai/Hyderabad Work Mode: WFO Roles & Responsibilities: Design, implement, and deploy Machine Learning solutions to solve complex problems and deliver real busine...

More info

1 Day ago

Junior Automation Tester - Selenium/Cypress

Information Technology

Chennai, Tamil Nadu, India

DescriptionWe are seeking a motivated and enthusiastic Junior Automation Tester to join our Quality Assurance (QA) team.This role is ideal for recent graduates or those early in their career who have a foundational understanding of testing principle...

More info

1 Day ago

Senior AI/Cloud Engineer

Information Technology

Chennai, Tamil Nadu, India

Job DescriptionTechnical Expertise Experience: 5+ years of hands-on experience in cloud infrastructure engineering. IaC: Expert-level experience in writing and managing Terraform scripts/modules. Automation: Proficient in scripting with Python and B...

More info

Talk to us

Feel free to call, email, or hit us up on our social media accounts.

Email info@antaltechjobs.in