Overview
Job SummaryAs a Senior DevOps Engineer, you'll design, automate, and optimize cloud infrastructure to empower development teams and ensure the reliability, scalability, and security of our distributed systems. You'll have major hands-on work along with collaborating closely with engineering, security, and product teams to streamline CI/CD pipelines, monitor production environments, and drive operational excellence.
Key Responsibilities
- Cloud Infrastructure Management
Architect, deploy, and manage multi-cloud environments (AWS/GCP/Azure/OnPrem/Edge) for scalability, cost-efficiency, and high availability.
Implement infrastructure-as-code (IaC) using Terraform/CloudFormation to provision and manage resources across development, staging, and production.
- CI/CD Pipeline Automation
Build and maintain CI/CD pipelines (GitHub Actions, Jenkins, ArgoCD) for seamless, zero-downtime deployments of microservices and applications.
Automate testing, security scans, and compliance checks within pipelines.
- Containerization & Orchestration
Setup/manage Kubernetes clusters (EKS/GKE/AKS) and optimize Docker workflows for containerized applications for both on-prem and hybrid cloud environments.
Design autoscaling strategies (Horizontal Pod Autoscaler, Cluster Autoscaler) to handle variable workloads.
- Monitoring, Logging & Observability
Implement end-to-end monitoring with Prometheus/Grafana, ELK Stack, or Datadog to ensure system health and performance.
Configure centralized logging and distributed tracing (Jaeger, OpenTelemetry) for debugging and root-cause analysis.
Drive SRE practices (SLIs/SLOs, error budgets, incident management) to maintain 99.9%+ uptime.
- Security & Compliance
Enforce security best practices: IAM policies, secrets management (Hashicorp Vault, AWS Secrets Manager), and network security (VPCs, firewalls).
Ensure compliance with GDPR, SOC2, HIPAA, or other industry standards.
- Collaboration & Optimization
Partner with developers to troubleshoot performance bottlenecks and optimize resource utilization (CPU, memory, storage).
Mentor junior engineers on DevOps/SRE best practices and tooling.
Technical Qualifications
Experience: 5+ years in DevOps, SRE, or Cloud Engineering roles.
Proven track record managing large-scale, production-grade cloud environments.
Core Skills
- Cloud Platforms: AWS/GCP/Azure (certifications preferred).
- CI/CD Tools: Jenkins, GitHub Actions, GitLab CI, ArgoCD.
- Containerization: Docker, Kubernetes, Helm.
- Infrastructure-as-Code: Terraform, Ansible, Puppet, Chef.
- Monitoring: Prometheus, Grafana, Nagios, Datadog, ELK.
- Scripting: Python, Bash, or Go for Skills:
- Experience with database management (PostgreSQL, Redis, MongoDB, Elastic).
- Knowledge of networking (CDNs, load balancers, DNS).
- Familiarity with serverless architectures (AWS Lambda, Azure Functions).
- Certifications in cloud technologies (AWS/GCP).
- Expertise in performance tuning, security, or advanced system design.
- Experience with edge computing for video processing
- Deploy microservices to automate AI/ML pipelines (training, inference, monitoring)
- Experience with edge computing for video processing
(ref:hirist.tech)