Hyderabad, Telangana, India
Information Technology
Full-Time
Jio
Overview
Skills:
monitoring platforms, Linux\ Windows OS, SQL/NoSQL, Shell or python scripting, aws, Google Cloud Platform (GCP),
Job Overview
Location - Bhubaneshwar and Pune
Education - B.Tech, BE, MCA, MTech
Experience: 6-8 years
As a Site Reliability (SRE)/DevOps Engineer, you will be responsible for the availability, automation, performance, efficiency, Scaling, monitoring and emergency response for any incidents/issues in Applications. You will use your deep understanding of platforms, architecture, people, systems, and processes to both establish and continuously improve SLIs and SLOs for uptime, performance, deployment, monitoring, and troubleshooting. You are interested in setting direction and leading the day to day processes that shape our vision for reliability.
Responsibilities and Duties
Maintain and support the Product and Data systems: proactively monitor events, investigate issues, analyze solutions, and drive problems through to resolution.
Experience with configuration management tools like Chef, Puppet, Salt or equivalent
Experience in Administration of AWS, Google or Azure Cloud
Define requirements and develop tools and reporting as needed by projects and operations.
Participate in 24x7 on-call rotation for after-hours emergencies
Use operational tools and monitoring platforms to gain in-depth knowledge, understanding, and ongoing monitoring of system availability, performance, and capacity.
Implement alerting strategy that makes alerts actionable and unique.
Provide follow-through to ensure issues are resolved to satisfaction
Drive continuous improvement and innovation within the team.
A sense of ownership, initiative and drive.
Qualifications
Bachelor's degree in Computer Science, or a related technical field involving software or systems engineering, or equivalent practical experience
Hands on Experience with Linux\ Windows OS
Hands on experience on managing Web servers, Application servers, Databases (SQL/NoSQL)
Experience on Scripting ( Shell or python) / Docker\Kubernetes
Knowledge of monitoring tools and strategy
Experience with incident management, running incident post-mortems
Solid understanding of automated deployment processes
Expertise in designing, analyzing, and troubleshooting large-scale distributed systems.
Systematic problem-solving approach, coupled with effective communication skills and a sense of drive
monitoring platforms, Linux\ Windows OS, SQL/NoSQL, Shell or python scripting, aws, Google Cloud Platform (GCP),
Job Overview
Location - Bhubaneshwar and Pune
Education - B.Tech, BE, MCA, MTech
Experience: 6-8 years
As a Site Reliability (SRE)/DevOps Engineer, you will be responsible for the availability, automation, performance, efficiency, Scaling, monitoring and emergency response for any incidents/issues in Applications. You will use your deep understanding of platforms, architecture, people, systems, and processes to both establish and continuously improve SLIs and SLOs for uptime, performance, deployment, monitoring, and troubleshooting. You are interested in setting direction and leading the day to day processes that shape our vision for reliability.
Responsibilities and Duties
Maintain and support the Product and Data systems: proactively monitor events, investigate issues, analyze solutions, and drive problems through to resolution.
Experience with configuration management tools like Chef, Puppet, Salt or equivalent
Experience in Administration of AWS, Google or Azure Cloud
Define requirements and develop tools and reporting as needed by projects and operations.
Participate in 24x7 on-call rotation for after-hours emergencies
Use operational tools and monitoring platforms to gain in-depth knowledge, understanding, and ongoing monitoring of system availability, performance, and capacity.
Implement alerting strategy that makes alerts actionable and unique.
Provide follow-through to ensure issues are resolved to satisfaction
Drive continuous improvement and innovation within the team.
A sense of ownership, initiative and drive.
Qualifications
Bachelor's degree in Computer Science, or a related technical field involving software or systems engineering, or equivalent practical experience
Hands on Experience with Linux\ Windows OS
Hands on experience on managing Web servers, Application servers, Databases (SQL/NoSQL)
Experience on Scripting ( Shell or python) / Docker\Kubernetes
Knowledge of monitoring tools and strategy
Experience with incident management, running incident post-mortems
Solid understanding of automated deployment processes
Expertise in designing, analyzing, and troubleshooting large-scale distributed systems.
Systematic problem-solving approach, coupled with effective communication skills and a sense of drive
Similar Jobs
View All
Talk to us
Feel free to call, email, or hit us up on our social media accounts.
Email
info@antaltechjobs.in