Free cookie consent management tool by TermsFeed Senior Multimodal Voice AI Engineer | Antal Tech Jobs
Back to Jobs
1 Week ago

Senior Multimodal Voice AI Engineer

decor
2600000 - 3000000 INR - Yearly
Sonipat, Haryana, India
Information Technology
Full-Time
ViH Metaverse

Overview

About Us
We're building the next generation of voice AI — where LLMs don't just read and write, they listen and speak. We need an engineer who deeply understands both LLM architectures and audio systems to build seamless audio-to-audio experiences.

Tech Stack
Core: Python, PyTorch, HuggingFace Transformers
Speech/Audio: Whisper, Wav2Vec2, Coqui TTS, ESPnet, librosa, torchaudio
LLM Infra: vLLM, TensorRT-LLM, ONNX, Triton
Audio Codecs: EnCodec, SoundStream, DAC
Vocoders: HiFi-GAN, Vocos, BigVGAN
Infra: Docker, Kubernetes, AWS/GCP, Redis, Kafka

What You'll Do
LLM Optimization & Integration

Optimize LLM inference for real-time voice applications (latency, throughput, memory)
Integrate audio encoders/decoders with transformer-based language models
Implement streaming inference pipelines for conversational AI
Fine-tune and adapt LLMs for speech-aware tasks

Audio-to-Audio Systems

Build end-to-end speech-to-speech pipelines (ASR → LLM → TTS)
Develop real-time voice transformation and conversion models
Implement neural audio codecs for speech tokenization
Design low-latency (<300ms) duplex conversation systems

Voice Synthesis & Processing

Build/optimize TTS systems for natural, expressive speech
Implement neural vocoders for high-quality audio generation
Design phoneme-level models and G2P systems
Develop voice cloning and speaker adaptation capabilities

What We're Looking For
Must Have

5+ years in ML/AI engineering with focus on speech or audio
Deep understanding of transformer/LLM architectures and how to optimize them
Hands-on experience with speech models (Whisper, Wav2Vec2, or similar)
Experience building TTS or ASR systems in production
Strong Python + PyTorch skills
Understanding of audio fundamentals (spectrograms, mel-filterbanks, sampling)

Good to Have

Experience with neural audio codecs (EnCodec, SoundStream, DAC)
Familiarity with LLM serving (vLLM, TensorRT-LLM)
Background in real-time audio streaming (WebRTC)
Published work or open-source contributions in speech AI
C++/CUDA for performance optimization

Share job
Similar Jobs
View All
2 Hours ago
Software Development Engineer – III (Erlang)
Information Technology
  • 5 - 9 Yrs
  • Gurgaon / Gurugram
About the Role We are seeking a Software Development Engineer – III to design, develop, and optimize high-performance, distributed backend systems that power real-time, large-scale automation and orchestration platforms. This role is ideal for ...
decor
21 Hours ago
MDG Technical Developer
Aerospace & Defense
  • 6 - 10 Yrs
  • Bangalore
Summary role description: Hiring MDG Technical Developer for a top global aerospace and defence innovator offering impactful, cutting-edge work. Company description: Our client is a leading global player in the aerospace and def...
decor
1 Day ago
Engineering Manager
Internet
  • 8 - 13 Yrs
  • Bangalore
Key Responsibilities: ● Leadership & Strategy ○ Lead and grow a team of backend,and FE engineers focused on Search, Ranking, and Product Discovery. ○ Collaborate with Product, Data Engineering, and UX teams to define the long-term search roa...
decor
1 Day ago
Junior Android Developer
Information Technology
  • 800000 - 1200000 INR - Annual
  • 1 - 2 Yrs
  • Pune
Title: Android Developer Location: Pune (Hinjewadi Phase 1 - WFO) Experience: 0 - 2 Years We are hiring fresh graduates from premium engineering colleges for an exciting Android Developer opportunity with a global leader in aviation technolo...
decor
1 Day ago
Software Engineer in Delhi
Space Exploration & Research, Information Technology
  • Mumbai, Maharashtra, India
Key Responsibilities Design and develop computer vision and video analytics modules for real-time traffic and safety applications. Integrate AI/ML models using frameworks like OpenCV, TensorFlow, or PyTorch. Work with live camera feeds, GStreamer pip...
decor
1 Day ago
iOS Developer
Space Exploration & Research, Information Technology
  • Mumbai, Maharashtra, India
We are seeking a talented and passionate iOS Developer to join our growing mobile development team. The ideal candidate will have a strong understanding of the iOS platform, excellent proficiency in Swift and/or Objective-C, and a commitment to writi...
decor
1 Day ago
Senior Data Analyst - R/Python
Space Exploration & Research, Information Technology
  • Mumbai, Maharashtra, India
DescriptionWe are looking for an experienced and dynamic Data Analyst Lead to head our data analytics function. This role requires a blend of hands-on analytics expertise and leadership skills to guide a team of data analysts in delivering high-quali...
decor
1 Day ago
Senior DevOps Engineer - AWS & GCP (On-site)
Space Exploration & Research, Information Technology
  • Mumbai, Maharashtra, India
About us:Working at Tech Holding isn't just a job, it's an opportunity to be a part of something bigger. We are a full-service consulting firm that was founded on the premise of delivering predictable outcomes and high-quality solutions to our client...
decor

Talk to us

Feel free to call, email, or hit us up on our social media accounts.
Social media