Overview
You're the engineer who stabilizes 50+ SaaS products when everyone else is improvising. We need DevOps professionals who can explore unfamiliar AWS environments, restore order, and drive uptime beyond 99.9% using genuine monitoring, genuine automation, and rigorous RCAs. You'll break complex projects into daily deliverables, deliver production-ready Python or JavaScript, and leverage AI as your assistant.
Most organizations claim "cloud-native" while manually managing snowflakes. We're systematizing reliability across dozens of acquired offerings where original developers have departed and documentation is incomplete. That's where it gets interesting: you'll apply agents and contemporary tooling to understand new systems 5–10x more quickly, document your findings, and automate solutions so repeat incidents become impossible. Rather than evaluating you on certifications and vendor badges, we'll observe you troubleshoot in real time, author a genuine 5-Whys that identifies one preventable root cause, and create automations that withstand production conditions.
This is not a tier-two "follow the script" position. Here, you author the scripts, architect the deployment from development through staged environments to 10% rollout to full production with soak intervals and rollback conditions, and implement the monitors that detect corner cases. You reject dangerous changes before execution. You distinguish infrastructure failures you control from application bugs Engineering controls, and you route permanent remediation to the appropriate team.
You'll operate at the engineering center of reliability, driving infrastructure initiatives, incident management and RCAs, and change requests with copy-paste-executable documentation. If you've already managed a substantial SaaS offering and want to extend that expertise across a portfolio, join us. Bring expert-tier AWS knowledge, production-quality coding ability, uncompromising scope discipline, and daily, essential use of AI tooling. If you're prepared to maintain operational continuity, please apply.
What You Will Be Doing
- Sophisticated infrastructure migrations, consolidations, production-quality automations, monitoring implementations
- Diagnosing production incidents, deploying immediate remediation, and documenting root cause analyses with permanent corrections assigned to the owning teams
- Drafting, validating, and applying changes in production environments, including assessing whether a proposed change is safe for execution
- Spending hours in Jira and repetitive status meetings - we reward people who can deliver solutions, not simply document problems
- Supporting legacy systems forever - you'll be authorized to pursue substantial improvements
- Waiting through bureaucratic approval processes - you'll have the autonomy to deploy immediate fixes to address incidents
- Advance reliability and standardization of cloud infrastructure across our expanding product portfolio by deploying comprehensive monitoring, automation, and AWS best practices.
- Deep AWS infrastructure expertise (this is our primary platform - other cloud experience alone won't cut it)
- Experience owning large production infrastructure and troubleshooting production outages independently (not just following a runbook)
- Experience scripting with Python and Bash for day-to-day administration operations
- Experience managing and migrating production databases with multiple engines (including MySql, Postgres, Oracle, MS-SQL)
- Experience with infrastructure automation (Terraform, Ansible, or CloudFormation)
- Linux systems administration expertise
Hundreds of software businesses run on the Trilogy Business Platform. For three decades, Trilogy has been known for 3 things: Relentlessly seeking top talent, Innovating new technology, and incubating new businesses. Our technological innovation is spearheaded by a passion for simple customer-facing designs. Our incubation of new businesses ranges from entirely new moon-shot ideas to rearchitecting existing projects for today's modern cloud-based stack. Trilogy is a place where you can be surrounded with great people, be proud of doing great work, and grow your career by leaps and bounds.
There is so much to cover for this exciting role, and space here is limited. Hit the Apply button if you found this interesting and want to learn more. We look forward to meeting you!
Working with us
This is a full-time (40 hours per week), long-term position. The position is immediately available and requires entering into an independent contractor agreement with Crossover as a Contractor of Record. The compensation level for this role is $50 USD/hour, which equates to $100,000 USD/year assuming 40 hours per week and 50 weeks per year. The payment period is weekly. Consult www.crossover.com/help-and-faqs for more details on this topic.
Crossover Job Code: LJ-5236-IN-Bengalur-DevOpsEngineer.002