Overview
Company: Foundit
Website: Visit Website
Business Type: Enterprise
Company Type: Product & Service
Business Model: B2B
Funding Stage: IPO/Public
Industry: HR Tech
Salary Range: ₹ 20-26 Lacs PA
Job Description
About foundit.in
foundit.in (formerly Monster India) is one of India’s leading talent platforms, offering AI-driven job matching and a large database of active job seekers. The platform enables employers to post jobs, search candidates using 35+ smart filters, and run targeted hiring campaigns. Job seekers can explore opportunities across roles, industries, and experience levels through web and mobile apps. foundit focuses on improving hiring efficiency through technology, data insights, and integrated career services.
About Us / Team Context
We are building a robust, scalable, and cloud-native infrastructure backbone to support our applications and services. We are seeking experienced and motivated DevOps / SRE professionals to join our infrastructure team. This is a critical role: you will be responsible for ensuring reliability, automation, scalability, and efficient deployment workflows, working across cloud environments, containers, orchestration, and infrastructure-as-code.
Shift timing: 12:00 PM to 9:00 PM
Role Overview
As part of the infrastructure team, you will:
- For SRE / DevOps Engineer: ensure system reliability, uptime, scalability, automated deployments, and robust cloud-native operations.
- For Cloud Infrastructure DevOps Engineer: design, build, manage and maintain cloud infrastructure and CI/CD pipelines; manage container orchestration and deploy applications; automate deployments and infrastructure provisioning.
You will collaborate closely with development, QA, security, and operations teams to deliver production-grade systems, enforce best practices, drive automation, and support deployments and releases.
Key Responsibilities
- Manage and provision cloud infrastructure (on AWS, Azure or GCP) using Infrastructure-as-Code (IaC) tools such as Terraform. Simply
- Manage container orchestration and deployments using Kubernetes; use packaging tools such as Helm charts for application deployments.
- Implement GitOps / CI/CD workflows using tools such as Argo CD (or similar), Git, and version-control to enable automated, reproducible, reliable deployments.
- Write automation and scripting (e.g. in Python, Bash or similar) to support infrastructure tasks, deployment pipelines, configuration management, and operational workflows.
- Setup, manage, and maintain CI/CD pipelines that support continuous integration, delivery and deployment, ensuring fast and reliable releases.
- Ensure high availability, scalability, performance, security, and reliability of production systems and cloud infrastructure. Monitor, troubleshoot, do root-cause analysis (RCA) for incidents, and ensure minimal downtime.
- Collaborate with development, QA, security and operations teams to streamline deployment processes, enforce DevOps / SRE best practices, and support cross-team workflows.
- Maintain documentation, runbooks, best practices, and standards for infrastructure, deployments, and operational procedures.
- Participate in on-call rotations or shift-based operations (given the shift timing), ensuring coverage for production environments, incident response, and post-mortem analyses.
- Total experience: 6–10 years in DevOps / SRE / Cloud Infrastructure / Platform Engineering roles with production-grade environments.
- Solid hands-on experience with cloud platforms: AWS, Azure or GCP.
- Proficiency in Infrastructure-as-Code (IaC) tools: Terraform (mandatory).
- Strong container orchestration and containerization experience: Kubernetes (production-grade), Helm charts, Docker or analogous technologies.
- Experience with Git / version control, and CI/CD / GitOps workflows — including Git, Argo CD (or other GitOps), CI/CD tools (e.g. Jenkins, GitLab CI, GitHub Actions, etc.).
- Scripting / programming skills — Python (preferred) or Bash / shell scripting for automation tasks.
- Good understanding of system reliability, monitoring, logging, observability, incident response, and cloud/wrapper security best practices.
- Strong problem-solving, troubleshooting skills, ability to perform root-cause analysis, and ensure production stability under shift-based working model.
- Excellent collaboration and communication skills, ability to work with cross-functional teams, and adapt to dynamic environments.