Overview
About the Role
We are looking for a skilled DevOps Engineer / Site Reliability Engineer to join our infrastructure team. You will be responsible for building, maintaining, and scaling our cloud infrastructure while ensuring high availability, reliability, and performance of our systems. The ideal candidate has hands-on experience with AWS services, Kubernetes orchestration, and modern infrastructure practices.
Key Responsibilities
- Infrastructure & Cloud Management
- Design, implement, and manage scalable infrastructure on AWS (EC2, EKS, RDS, S3, Lambda, VPC, IAM, CloudWatch, etc.)
- Build and maintain Kubernetes clusters for container orchestration and microservices deployment
- Implement Infrastructure as Code (IaC) using Terraform, CloudFormation, or Pulumi
- Manage networking, security groups, load balancers, and DNS configurations
*** Reliability & Monitoring**
- Define and track SLIs, SLOs, and error budgets to ensure system reliability
- Set up comprehensive monitoring, alerting, and logging using tools like Prometheus, Grafana, ELK Stack, or Datadog
- Conduct incident response, root cause analysis, and post-mortems
- Implement disaster recovery strategies and ensure business continuity
*** CI/CD & Automation**
- Design and maintain CI/CD pipelines using Jenkins, GitLab CI, GitHub Actions, or AWS CodePipeline
- Automate repetitive tasks through scripting (Python, Bash, Go)
- Implement GitOps practices using ArgoCD or Flux
*** Security & Compliance**
- Implement security best practices across infrastructure (secrets management, encryption, access controls)
- Conduct regular security audits and vulnerability assessments
- Ensure compliance with industry standards and organizational policies
Required Qualifications
- Experience: 2-4 years in DevOps, SRE, or Cloud Infrastructure roles
- AWS Expertise: Strong hands-on experience with core AWS services (EC2, EKS, RDS, S3, Lambda, VPC, IAM, CloudWatch, Route53, ALB/NLB)
- ** Kubernetes: **Proficient in deploying, managing, and troubleshooting Kubernetes clusters (EKS preferred)
- **** IaC Tools****: Experience with Terraform, CloudFormation, or similar tools
- Scripting: Proficiency in Python, Bash, or Go *** CI/CD**: Hands-on experience building and maintaining pipelines
- *Linux: Strong *Linux system administration skills
- Networking: Solid understanding of TCP/IP, DNS, HTTP/HTTPS, load balancing, and firewalls
*Arivihan App Link: https://play.google.com/store/apps/details?id=arivihan.technologies.doubtbuzzter2
Website - https://arivihan.com/
YouTube - http://www.youtube.com/@mpboardarivihan
Instagram - https://www.instagram.com/mpboard.arivihan/
LinkedIn - https://www.linkedin.com/company/arivihan/posts/?feedView=all
*