Overview
Company: Urbint
Website: Visit Website
Business Type: Startup
Company Type: Product
Business Model: B2B
Funding Stage: Series C
Industry: Software Development
Salary Range: ₹ 45-50 Lacs PA
Job Description
Company Overview:
Urbint is an AI-powered software company that helps utilities, energy, and infrastructure organizations predict and prevent operational risks. Its cloud-based platform combines real-world data with advanced analytics to improve safety, protect critical assets, and optimize field operations. Trusted by major North American energy and infrastructure companies, Urbint empowers teams to make smarter, faster decisions to enhance reliability and resilience.
Job Summary
We're seeking an experienced CloudOps Engineer III to help evolve and strengthen Urbint's cloud infrastructure and reliability practices. You'll be part of a high-performing Site Reliability Engineering (SRE) and Cloud Operations team responsible for building and maintaining hybrid cloud environments with a focus on uptime, performance, security, and cost efficiency. This is a hands-on, technical role ideal for someone who thrives on automation, scalability, and solving complex infrastructure challenges.
What You'll Do
- Cloud Infrastructure & Operations: Build, maintain, and scale reliable cloud systems across AWS, GCP, and hybrid environments.
- Infrastructure as Code (IaC): Design and implement repeatable, automated infrastructure deployments using Terraform, Kubernetes, and CI/CD pipelines.
- Cloud Infrastructure & Operations: Build, maintain, and scale reliable cloud systems across AWS, GCP, and hybrid environments.
- Monitoring & Incident Response: Develop and maintain monitoring, alerting, and observability using Prometheus, Grafana, and DataDog. Participate in incident response, troubleshooting, and root cause analysis.
- System Reliability & Performance: Partner with engineering teams to improve system resilience, scalability, and performance of distributed microservices.
- Security, Governance & Compliance: Ensure infrastructure meets compliance and security standards (SOC 2, ISO 27001, HIPAA). Implement IAM, encryption, and network security best practices.
- Cost Optimization (FinOps): Analyze cloud resource usage and collaborate with teams to optimize cost and performance.
- Collaboration: Work closely with Product, Engineering, and Security to align infrastructure needs with business goals.
- Continuous Improvement: Contribute to infrastructure design discussions, propose improvements, and champion DevOps best practices across the organization.
Required Skills & Expertise
- 7–12 years in CloudOps, SRE, or DevOps supporting large-scale distributed systems.
- 4+ years hands-on experience with AWS, GCP, or Azure.
- 3+ years practical experience with Kubernetes, Docker, Terraform, and CI/CD tools.
- Strong scripting and automation skills (Python, Go, or Shell).
- Experience with monitoring and logging tools (Prometheus, Grafana, DataDog, ELK).
- Familiarity with distributed systems, microservices, and cloud-native architectures.
- Understanding of IAM, VPCs, encryption, and security best practices.
- Exposure to regulated environments (SOC 2, HIPAA, ISO 27001).
- Self-driven, detail-oriented, and passionate about operational excellence.
- Comfortable in collaborative, cross-functional teams.
- Curious, continuously learning, and focused on improving systems and processes.