Free cookie consent management tool by TermsFeed Software Engineer - Large Language Models | Antal Tech Jobs
Back to Jobs
4 Days ago

Software Engineer - Large Language Models

decor
Information Technology
Full-Time
Muoro

Overview

Description

Role Overview :

We are seeking a highly skilled Software Engineer specializing in Large Language Models (LLMs) to design, develop, and deploy cutting-edge AI solutions leveraging state-of-the-art transformer architectures.

  • The ideal candidate will have strong expertise in deep learning, NLP, and model optimization, combined with software engineering best practices for building scalable AI systems in production.
  • Youll collaborate with data scientists, ML engineers, and product teams to build intelligent applications powered by advanced generative AI models such as GPT, LLaMA, Falcon, Mistral, Claude, or similar open-source and proprietary models.

Key Responsibilities

  • Design, train, fine-tune, and evaluate Large Language Models (LLMs) for specific use cases (e.g., summarization, code generation, chatbots, reasoning, and retrieval-augmented generation).
  • Experiment with transformer-based architectures (e.g., GPT, T5, BERT, LLaMA, Mistral).
  • Develop parameter-efficient fine-tuning (PEFT) strategies such as LoRA, QLoRA, adapters, or prompt-tuning.
  • Create and maintain high-quality datasets for pretraining, fine-tuning, and evaluation.
  • Optimize model inference using techniques like quantization, distillation, and tensor parallelism for real-time or edge deployment.
  • Integrate LLMs into production environments using frameworks like Hugging Face Transformers, PyTorch Lightning, or DeepSpeed.
  • Implement scalable model serving solutions using FastAPI, Ray Serve, Triton Inference Server, or similar frameworks.
  • Build and maintain APIs or SDKs that expose LLM capabilities to other teams and products.
  • Evaluate and experiment with open-source and proprietary foundation models.
  • Keep up with the latest trends in Generative AI, NLP, and Transformer models.
  • Perform benchmarking, ablation studies, and A/B testing to measure performance, cost, and quality improvements.
  • Collaborate with ML Ops and DevOps teams to design CI/CD pipelines for model training and deployment.
  • Manage and optimize GPU/TPU clusters for distributed training and inference.
  • Implement robust monitoring, logging, and alerting for deployed AI systems.
  • Ensure software follows clean code principles, version control, and proper documentation.
  • Partner with product managers, data scientists, and UX teams to identify and translate business problems into AI-driven solutions.
  • Contribute to internal research initiatives and help shape the companys AI strategy.
  • Mentor junior engineers in AI model development, coding standards, and best practices.

Required Technical Skills

Core Expertise :

  • Strong proficiency in Python and deep learning frameworks (PyTorch, TensorFlow, JAX).
  • Hands-on experience with transformer architectures and LLM fine-tuning.
  • Deep understanding of tokenization, attention mechanisms, embeddings, and sequence modeling.
  • Experience with Hugging Face Transformers, LangChain, LlamaIndex, or OpenAI API.
  • Experience deploying models using Docker, Kubernetes, or cloud ML services (AWS Sagemaker, GCP Vertex AI, Azure ML, OCI Data Science).
  • Familiarity with model optimization (quantization, pruning, distillation).
  • Knowledge of retrieval-augmented generation (RAG) pipelines, vector databases (FAISS, Pinecone, Weaviate, Chroma).

Additional Skills (Good To Have)

  • Experience with multi-modal models (text + image, text + code).
  • Familiarity with MLOps tools like MLflow, Kubeflow, or Weights & Biases (W&B).
  • Understanding of Responsible AI practicesbias mitigation, data privacy, and model explainability.
  • Experience contributing to open-source AI projects

(ref:hirist.tech)
Share job
Similar Jobs
View All
15 Hours ago
Principal Architect - DotNet
Healthcare & Life Sciences
  • 15 - 20 Yrs
  • Chennai, Hyderabad
Summary role description: Hiring Principal Architect – .NET Full Stack in the Healthcare Technology provider. Company description: Our client is a global technology and services provider with operations across the U.S. a...
decor
15 Hours ago
Principal Architect - JAVA
Healthcare & Life Sciences
  • 14 - 20 Yrs
Hiring for the Principal Architect - Java Full Stack for a healthcare technology leader advancing U.S. healthcare through AI and cloud innovation. Company description: Our client is a leading healthcare technology and clinical services ...
decor
16 Hours ago
Full Stack Developer
Information Technology
  • 5 - 8 Yrs
  • Thane
About the Role We are building advanced AI-powered enterprise products and are looking for a Node.js + UI Developer (React) to join our engineering team. This role involves end-to-end development of high-performance web applications, from backend ...
decor
1 Day ago
Sr Technical Consultant
Information Technology
  • 7 - 23 INR - Annual
  • 5 - 8 Yrs
  • Pune
Position: Sr. Technical Consultant (Dotnet 6.0+) Experience: 5+ Years Job Title: ASP.NET Core 6.0 / Full stack Developer for Pune Location We are looking for a seasoned ASP.NET Core 6.0 / MVC Developer to join our innovative team. This ro...
decor
1 Day ago
Mobile Engineer (React Native)
Information Technology
  • 1200000 - 1800000 INR - Annual
  • 3 - 6 Yrs
  • Chennai
Job Description About the Role We are looking for a React Native Engineer to join our team in building robust, scalable, and high-performance mobile applications. You will work closely with engineers, designers, and product managers to deliver se...
decor
1 Day ago
Senior AI/ML Engineer
Information Technology
  • 2000000 - 2500000 INR - Annual
  • 4 - 8 Yrs
  • Chennai, Hyderabad
Role : Senior AI/ML Engineer Experience : 4 - 8 years Location: Chennai/Hyderabad Work Mode: WFO  Roles & Responsibilities: Design, implement, and deploy Machine Learning solutions to solve complex problems and deliver real busine...
decor
1 Day ago
Junior Automation Tester - Selenium/Cypress
Information Technology
  • Chennai, Tamil Nadu, India
DescriptionWe are seeking a motivated and enthusiastic Junior Automation Tester to join our Quality Assurance (QA) team.This role is ideal for recent graduates or those early in their career who have a foundational understanding of testing principle...
decor
1 Day ago
Senior AI/Cloud Engineer
Information Technology
  • Chennai, Tamil Nadu, India
Job DescriptionTechnical Expertise Experience: 5+ years of hands-on experience in cloud infrastructure engineering. IaC: Expert-level experience in writing and managing Terraform scripts/modules. Automation: Proficient in scripting with Python and B...
decor

Talk to us

Feel free to call, email, or hit us up on our social media accounts.
Social media