Overview
About Us
SimplAI is redefining how enterprises adopt AI by offering the most intuitive and rapid platform to build, deploy, and monitor intelligent applications. Our mission is to simplify innovation for businesses through scalable, secure, and reliable AI infrastructure.
Job Description: DevOps Engineer
We are looking for a passionate DevOps Engineer with 1-3 years of industry experience to join our team in building, managing, and automating highly available, secure cloud infrastructure. You’ll be responsible for delivering fault-tolerant architectures that run with optimal capacity and cost-efficiency across AWS, Azure, and GCP. This role requires deep technical understanding, problem-solving ability, and a strong foundation in computer science principles.
Key Responsibilities:
- Manage scalable, secure infrastructure across AWS, Azure, and GCP with a focus on performance tuning, high availability, and cost optimization.
- Automate provisioning and configuration using tools like Terraform, Ansible, and Chef.
- Build and maintain CI/CD pipelines using Jenkins, Git, Helm, and Docker.
- Operate and optimize Kubernetes clusters (EKS, AKS, GKE) and container-based environments.
- Implement monitoring, alerting, and logging with Prometheus, Grafana, ELK, New Relic, and other observability tools.
- Write and maintain infrastructure automation scripts using Python or Shell.
- Support hybrid environments with tools like Istio for service mesh and traffic control.
- Conduct system patching, upgrades, audits, and enforce security best practices (VPN, SSH, encryption).
- Troubleshoot production issues, perform root cause analysis, and ensure operational continuity.
Required Qualifications:
- 1–3 years of experience in Linux-based system administration and troubleshooting.
- Hands-on experience with cloud infrastructure provisioning in AWS, Azure, or GCP.
- Familiarity with AIOps, MLOps, and GPU-based workloads and deployment environments.
- Proficiency in Python, Shell, or any mainstream development language for automation (Java, Ruby, Node.js).
- Familiarity with managing application infrastructure including NGINX, HAProxy, Apache, and load balancers.
- Experience in CI/CD pipeline design and implementation using Jenkins, Git, Artifactory, and container technologies.
- Knowledge of container orchestration and management using Kubernetes and related tools.
- Experience with databases like PostgreSQL, MySQL, MongoDB, and Redis, including backup, scaling, and performance tuning.
- Understanding of infrastructure monitoring using tools like Prometheus, Zabbix, CloudWatch, and New Relic.
- Strong grasp of secure infrastructure practices, including encryption, VPNs, and SSH.
Join SimplAI to work on cutting-edge cloud infrastructure supporting enterprise AI solutions—with a focus on automation, observability, cost-efficiency, and security.
Why Join Us?
- Exposure to real-world AI product development
- Work with passionate and brilliant minds
- Be part of building cutting-edge AI solutions with purpose