Overview
OverviewAs a DevOps engineer, you are a highly skilled & experienced subject matter expert with a deep
understanding of public, private and hybrid cloud platforms. Reporting to the DevOps Architect,
the lead will play a key technical role in the life cycle of projects, with significant interaction with
Architects, Tech leads and project managers. Your in-depth knowledge of DevOps best practices,
concepts and tooling coupled with a practical experience in DevOps, SecOps, and NetOps are
the key qualifying attributes for this position.
What does success look like?
A successful DevOps Engineer, Infrastructure & Platforms will have mastered the arts of:
- Implement solutions provided by Cloud Architects, Solution Architects
- Bridged the gap between Development teams, Infrastructure SME’s and other teams to
understand business requirements and deliver high quality outcomes
- Maintain & improve stability, integrity and efficiency of Material’s DevOps tools and
Infrastructure-as-a-Service (IaaS) and Platform-as-a-Service (PaaS) platforms through
monitoring, optimization and proactive maintenance. .
- Successful incident and problem resolution by direction and coordination during business
critical incidents or outages.
Responsibilities
Implementation
- Deploy & manage highly scalable, secure, and reliable cloud-based infrastructure using
AWS, Azure cloud, and GCP
- Use Infrastructure as a Code to define & deploy cloud infrastructure.
- Translate logical designs into physical designs. You can produce detailed designs and
document all work using required standards, methods, and tools, including prototyping
tools where appropriate.
- Implement & manage automation and containerization technologies like Docker and
Kubernetes to streamline application deployment and scaling.
- Implement cloud-based network infrastructure, ensuring high availability, security, and
performance.
- Review and optimize cloud environments to ensure adherence to Information Security
policies and industry best practices.
- Contribute to Business Continuity Plan by implementing or improving DR, HA and SRE
for critical cloud workloads.
- Deploy and manage on-premises and SaaS source code repositories including Azure
DevOps, Gitlab, GitHub, Bitbucket.
- Implement systems characterized by managed levels of risk, manageable business,
technical complexity, and meaningful impact with well-understood technology design
patterns.
- Manage availability of different service components to ensure they meet business needs
and performance targets.
- Create and interpret automation scripts for cloud-based CI/CD pipeline to
deploy/upgrade/patch web applications.
- Deploy & promote applications across environments (e.g., Dev, Quality, Staging,
Production)
- Deliver tasks based on project objectives; technically support projects to completion.
- Ensure deliverables are completed within target timeframes with consistently high-quality
- Prepare documentation and support transition of operational activities
Collaboration
- Collaborate with in Material IT groups (DWT, INF, EA) to implement base lines for IaaS
and PaaS platforms, adopting best practices and policies that support and optimize their
respective environments
- Collaborate with DevOps architect to support development teams with a full set of
DevOps ALM tools by leading the establishment of the right tooling and processes that will
result in a fully automated release management pipeline to include: the Automated build
process, environment setups, testing scripts, deployments, and production operational
metrics/debugging information (to target developers).
- Influential and effective advisor and contributor to change management, minimizing risks
that could be introduced to new or existing environments
- Safely execute changes in an interruption-free manner using secure modern infrastructure
capabilities
- Promote quality work by bringing multidisciplinary skillset to help connect the dots with IT
service owners, partners and vendors.
- Work closely with developers to ensure that applications are deployed, monitored, and
supported in a highly scalable and automated fashion.
- Document the process and tasks being taught so it can be standardized and cross trained
across the team
Operations
- Streamline DevOps processes throughout the organization, decreasing deployment time,
optimizing monitoring and reducing interruptions.
- Monitor & optimize costs with stakeholders to ensure IaaS & PaaS operations are within
budget.
- Participate in incident response activities on private and public cloud platforms.
- Innovate in the automation & reliability engineering space, making new solutions available
for use by our development community
- Optimize resource utilization on existing cloud workloads.
- Estimate costs for future cloud workloads based on project requirements.
- Assist with cloud ops for existing workloads (scaling, migration, decommission).
- Troubleshoot and identify problems across systems: including computing, storage,
networking, software, COTS and open-source packages and solutions, and cloud
computing, including IaaS, PaaS, SaaS.
- Bring an automation mindset to eliminate mundane or repetitive tasks by providing
objective, practical and relevant ideas, insights and advice on DevOps initiatives.
- Analyze current processes, identify and implement opportunities to optimize processes
- Cultivate a Continuous Integration/Continuous Delivery mind set by creating and
managing release pipelines for next generation application development efforts and
ongoing production operations.
- Evaluate and establish requirements for the implementation of changes by setting policy
and standards.
- Report on established KPIs to ensure performance is measured against expected
business outcomes.
Requirements
Education & Certifications:
- Bachelor's or master's degree in computer science, or Engineering (B.E/B
Tech/M. E/M Tech), or related field or equivalent experience.
- Certification on AWS cloud technologies is mandatory
- Good to have: Certification on GCP, Azure,; Certified Kubernetes administrator
Done it:
- 4 to 5 years of relevant experience in cloud-based infrastructure architecture,
administration and monitoring.
- In-depth knowledge and hands-on experience of AWS
- Proven experience as DevOps engineer.
- Strong understanding of networking, cloud, compute, and storage.
- Strong knowledge of Cloud Governance, focusing on application security, risk
management and collaboration.
- Experience in inter-cloud migration.
- Experience with scripting languages like Python, Ruby, PowerShell or Bash.
- Advanced experience with usage and management of Git version control
technologies (GitHub, Gitlab, AWS Commit etc.)
- Proven experience applying best practices for frontend and backend
infrastructure (web engines, relational databases, storage, network, disaster
recovery, high availability)
- Experience troubleshooting complex issues ranging from system resources to
application stack traces
Know It
- Hands on in GCP/Azure
- Experience with continuous Integration tools (TeamCity, Azure DevOps, Gitlab
CI/CD, AWS Code Pipeline)
- Working knowledge of automation service provisioning.
- Hands on experience with IAC tools such as Ansible, Chef, Terraform,
CloudFormation, Bicep, Typescript, Azure ARM etc.
- Hands on in containerization technologies (Docker, Kubernetes, Docker Swarm,
AWS Fargate)
- Strong knowledge of Identity and Access Management
- Secure systems & networks’ management through automation, capacity planning,
monitoring, performance tuning and testing
- You are aware of information security at large Networks, Docker security,
Kubernetes Security.
- Strong hardware and software diagnostic skills
Shown it
- Familiarity with web development experience and setup: Apache, Nginx etc.
- Familiarity with Agile/Scrum process using platforms like Jira/Confluence
- Experience with Grafana, Prometheus, ELK/EFK stack
- Demonstrable experience in converting complex business & security needs into
fit-for-purpose, straightforward solutions, with well-managed execution and non
interruptive transition to operations, as well as effective troubleshooting &
escalation handling.
- Demonstrated competencies in system administration. DevOps or systems
knowledge, security protocols for client/server environment.
- Experience supporting at least one of the following databases: MS-SQL server,
MySQL, MongoDB, PostgreSQL etc.
- Working knowledge of related technologies including encryption, IPsec, VLANs,
VPNs, routing, firewalls, proxy services, LAN/WAN connectivity
Do it
- Experience on SCM tools like GitHub, Gitlab, Bitbucket, AWS Code Commit,
Azure Repos etc.
- Experience in application and systems performance monitoring (Splunk, New
Relic, Azure Monitor, AWS CloudWatch).
- Excellent communication, interpersonal, analytical and problem-solving skills.
- Continuous application of skillsets in, cloud and hybrid environments adding
resiliency when introducing new and upcoming technologies.
- Experience with API Gee, Kong, or Kafka from an operational perspective will be a
good to have.
- Read and understand release notes and follow the deployment instructions
precisely to ensure zero defects in the deployment.
- Experience on open source/Linux will be good to have.
- A communicator, humble and patient individual working effectively with distributed
teams who is equally a mentor and a mentee
- A creative mindset that brings ideas to life by leveraging technologies in new ways that
could have high impact in every aspect of operations
- A technical player who is trusted to identify challenges and define how to overcome
them.
- A multi-tasker and flexible towards work timings, working in a fast-paced environment
without getting overwhelmed easily.
- Detail-oriented, always ready to put in the extra effort to deliver polished, well-formed
IT solutions for Material.
- Mindful of the employee experience - articulate, confident & well-spoken in all forms
of communication, comfortable to be the voice of our customer to Material IT, and the
face of Material IT to our customers.