
Overview
BETSOL is a cloud-first digital transformation and data management company offering products and IT services to enterprises in over 40 countries. BETSOL team holds several engineering patents, is recognized with industry awards, and BETSOL maintains a net promoter score that is 2x the industry average.
BETSOL’s open source backup and recovery product line, Zmanda (Zmanda.com), delivers up to 50% savings in total cost of ownership (TCO) and best-in-class performance.
BETSOL Global IT Services (BETSOL.com) builds and supports end-to-end enterprise solutions, reducing time-to-market for its customers.
BETSOL offices are set against the vibrant backdrops of Broomfield, Colorado and Bangalore, India.
We take pride in being an employee-centric organization, offering comprehensive health insurance, competitive salaries, 401K, volunteer programs, and scholarship opportunities. Office amenities include a fitness center, cafe, and recreational facilities.
Learn more at betsol.com
Job Description
Job Summary:
As a Cloud Engineer(Sr), you will be responsible for Cloud Infrastructure Architecture, resource provisioning, automations, build observability on Cloud Platform. You will work closely with Application & Architecture teams to build new solutions, automations & also help to maintain a secure and high-performing Cloud platform.
Responsibilities:
Engineering, Provisioning, Changes
- Build new Cloud solutions specific to resource provisioning, Observability.
- Provision Cloud resources (Day 1).
- Work on tickets submitted by Application Team for Cloud Support.
Troubleshooting and Support
- Must have good troubleshooting skills on Cloud & OS.
- Join P1/P2 Incident/outage calls as needed.
- Collaborate with other teams to diagnose and resolve infrastructure problems.
Working Hours
- Required to follow flexible working hours and would be part of a 24/7 team.
- Work as on-call.
Architecture Meetings
- Join architecture discussions with Application teams, Cloud Architecture team and provide recommendations.
Technical Skills
Engineering/Development
Must Have
- Experience in engineering activities on AWS Services (EC2/RDS/S3/LB/CloudFront/Kafka/EMR/EKS/ECS/Route53/SFTP/CloudWatch, ElastiCache etc.).
- Automate AWS tasks using AWS Lambda.
- Expertise in writing Terraform & CloudFormation scripts.
- Expertise in Container Platforms like AWS EKS/Kubernetes, & AWS ECS, and API Management Platforms like AWS API Gateway.
- Good understanding on Container Workloads run & maintenance.
- Experience in disaster recovery and data replication processes.
- Experience in data backup and recovery activities using AWS Backup.
Troubleshooting Skills
- Strong troubleshooting skills of AWS service-related issues. CloudWatch Log insight query experience is a must.
- Experience in Data Recovery & Restore activities.
- Good knowledge on DNS, AD
- Experience on troubleshooting EKS using K9S.
Configuration Management
- Experience in Configuration Management using Ansible.
- Experience in AMI & Container Image builds.
DevOps
- Experience using any CI/CD tool (Jenkins/Spinnaker preferred) and good understanding on DevOps concepts.
- Familiar with DevOps technologies and frameworks.
Security
- Experience on Cloud security aspects specifically running Cloud resources securely.
- Good experience on SSL cert provisioning & management.
- Good understanding on AWS service policies (IAM, SCP, KMS, S3, VPC endpoints).
- Remediate OS & container image vulnerabilities.
Automation
- Identify automation opportunities.
- Python or node.js development experience.
- Bash scripting
Good To Have
- Experience in migrating resources to AWS Cloud.
- AWS Application Architecture experience
- OS experience (Linux/Windows).
- Experience in ReactJS.
Metrics to maintain
Business Impact Metrics
- Time to Market: Duration to deploy new applications, features, or services to production.
Automation Metrics
- Automation Coverage: Percentage of manual tasks replaced with automation (e.g., using AWS Lambda, Step Functions etc.).
- Number of Terraform modules built.
Infrastructure Build Metrics
- Number of new application infrastructure built.
- Time taken to deliver Star Program apps.
Documentation
- Number of process & solutions documented
Original Solutions
- Number of original solutions contributed to support Cloud Platform
Incident Management Metrics
- Number of issues troubleshooted and fixed.
- Number of issues escalated to AWS support (support tickets raised).
Image Management
- Number of Golden AMIs published.
- Number of Container Images provisioned.
Qualifications
Educational Qualifications
- Bachelor's degree in any Technology stream.
- Achieved any AWS certification (SysOps Administrator/DevOps Professional/SA Associate/SA Professional/any AWS Speciality certifications).
- Completed certification on Terraform
Soft Skills
- Strong communication and collaboration skills.
- Proactive and self-motivated with a strong work ethic.
Work Experience
- Not less than 3 Yrs. of previous hands-on experience in a similar role is preferred.