Overview
Role DescriptionWe are seeking an experienced Site Reliability Engineer (SRE) with strong expertise in Google Cloud Platform (GCP) and CI/CD automation. The ideal candidate will be capable of independently designing, building, and maintaining reliable, scalable, and secure cloud infrastructure while closely collaborating with development teams to ensure seamless delivery of applications.
Key Responsibilities
CI/CD & Automation
- Design, develop, and optimize end-to-end CI/CD pipelines using Jenkins, with a strong focus on Declarative Pipeline syntax
- Automate build, test, and deployment workflows to improve release velocity and reliability
Cloud Infrastructure & Operations
- Automate deployment, scaling, and management of applications across GCP services, including:
- Google Kubernetes Engine (GKE)
- Cloud Run
- Compute Engine
- Cloud SQL
- Cloud Storage
- Virtual Private Cloud (VPC)
- Cloud Functions
Collaboration & Integration
- Work closely with development teams to ensure smooth integration of applications into CI/CD pipelines and the GCP ecosystem
- Provide guidance on best practices for cloud-native application deployment
Observability & Reliability
- Implement and manage logging, monitoring, and ing solutions for cloud infrastructure and applications
- Proactively identify performance bottlenecks and reliability risks
Security & Compliance
- Ensure adherence to security best practices and organizational policies within the GCP environment
- Support secure CI/CD pipelines, access control, and infrastructure hardening
Documentation & Continuous Improvement
- Document processes, configurations, and architectural decisions
- Stay up to date with the latest GCP services, SRE principles, and DevOps best practices
- 8+ years of experience as an SRE or DevOps Engineer (preferably 9+ years)
- Strong hands-on experience with Google Cloud Platform (GCP)
- Expertise in Jenkins, specifically Declarative Pipelines
- Solid understanding of CI/CD concepts and automation
- Ability to work independently and deliver end-to-end solutions with minimal supervision
- Strong problem-solving and troubleshooting skills
Good to Have
- Experience with Kubernetes (GKE) and container-based deployments
- Knowledge of Infrastructure as Code (Terraform or similar)
- Familiarity with SRE practices such as SLIs, SLOs, and error budgets
Key Skills
GCP, Site Reliability Engineering, Jenkins (Declarative Pipeline), CI/CD, GKE, Cloud Run, Monitoring & Logging, Cloud Security, DevOps Automation
Skills
cloud infrastructure,gcp,site reliability engineering,ci/cd pipelines,jenkins,