2500000 - 3000000 INR - Yearly
Mumbai, Maharashtra, India
Information Technology
Full-Time
Mowka
Overview
Hiring: AI Engineer
We're building India's most powerful real-time voice AI platform — custom SLMs and Speech-to-Speech agents that already power millions of calls across banking, healthcare, and new economy brands.
We're looking for an ML engineer who wants to own the voice stack end-to-end — not just run experiments, but ship models that real people talk to every day.
The Mission
As our ML Engineer for Speech & Audio, you'll work across the full model lifecycle — training, fine-tuning, evaluation, and inference optimization. You'll build the core models behind our voice agents and take them from research to production.
What You'll Work On
- Training and fine-tuning small language models (SLMs) for voice use cases
- Building turn detection and end-of-utterance models for real-time voice conversations
- Training and improving TTS models for natural, low-latency speech synthesis
- Data preparation, model evaluation, and inference optimization — a core and ongoing part of every project, because better data always wins
What We're Looking For
- 2+ years of hands-on ML experience
- Strong Python and PyTorch skills
- Experience with model training, fine-tuning, and evaluation pipelines
- Familiarity with speech/audio pipelines, sequence models, or transformers
- Solid instincts for datasets, experiment design, and debugging model behaviour
Good to Have
- Experience with ASR, TTS, or speech models
- Exposure to low-latency inference or production ML systems
- Understanding of multilingual or Hindi/English speech systems
Why Join
- Your models will directly power millions of real voice interactions — not a side experiment
- Early-stage, angel-backed company scaling fast — front-row seat to building a category-defining product
Similar Jobs
View All
Talk to us
Feel free to call, email, or hit us up on our social media accounts.
Email
info@antaltechjobs.in