Overview
About Neoflo
Neoflo.ai is building an AI-first platform to automate document-heavy enterprise workflows. We work on intelligent document processing, knowledge graphs, RAG systems, and AI agents that help enterprises reduce manual effort and scale operations.
Role
We are looking for a hands-on engineer who can take Data Science models and AI prototypes and turn them into reliable, scalable production services.
You will work at the intersection of AI, backend engineering, and cloud infrastructure, owning deployment, scalability, observability, and operational excellence of AI systems.
Responsibilities
Deploy and manage ML/LLM models in production.
Build FastAPI-based services and AI workflows.
Design and maintain CI/CD pipelines for AI applications.
Develop and operate RAG systems using Vector Databases and Knowledge Graphs.
Manage AWS infrastructure, containers, and model-serving environments.
Work closely with Data Scientists to productionize models and optimize inference performance.
Monitor, troubleshoot, and scale AI workloads.
Required Skills
3+ years of software engineering experience.
Strong Python and FastAPI experience.
Experience deploying ML/AI models in production.
Hands-on experience with AWS, Docker, and Kubernetes.
Experience with MongoDB, Redis/cache, and Vector Databases (Qdrant, Weaviate, Milvus, etc.).
Strong understanding of CI/CD, system design, and production operations.
Nice to Have
Experience with vLLM, Triton, Ray Serve, or similar model-serving frameworks.
Experience with LLMs, embeddings, RAG, and Knowledge Graphs.
Startup experience and ability to operate with high ownership.