Overview
Location: Hyderabad
Department: IT-Infra/Systems
Reports to: Head of Tech & Production Engineering
About Mihira Visual Labs
Mihira Visual Labs is a research-driven CGI and VFX studio redefining filmmaking through AI- and ML-powered workflows. We specialize in the development and production of full-length animated films, empowering creators with cutting-edge tools to accelerate high-quality storytelling and IP creation. Our mission is to make world-class storytelling faster, more efficient, and more cost-effective — where human imagination is the only true differentiator.
Role Overview
The IT Manager will be responsible for designing, managing, and maintaining the studio’s advanced IT infrastructure, ensuring reliable, secure, and high-performance systems to support cutting-edge VFX and animation production. This role requires strategic leadership in network design, security compliance, IT service management, and team mentorship, with a focus on System Reliability & Performance Monitoring and Infrastructure Automation & Optimization.
Key Responsibilities
IT Infrastructure & Operations
- Design, install, configure, and maintain the studio IT infrastructure, including high-performance workstations, servers, storage (NAS/SAN), and virtualization infrastructure (Proxmox VE HA).
- Manage critical production systems including Render Farm Management and large-scale storage systems (NetApp, Isilon/Dell-PowerScale, TrueNAS).
- Drive Non-Disruptive Infrastructure Operations Management for maximum uptime, availability, and resiliency without performance degradation.
- Implement and maintain comprehensive System Reliability & Performance Monitoring and observability solutions (Prometheus + Grafana, LibreNMS, Beszel Pulse for Proxmox VE).
- Oversee Incident Response & Management and conduct Capacity Planning & Scaling for future production needs.
- Ensure efficient operations with regular Configuration and Change Management processes.
- Manage and support high-performance remote access technologies for artists and production staff, including platforms such as Teradici/HPAnyware (PCoIP), Parsec, Jump Desktop, and NICE DCV.
- Develop and maintain critical infrastructure dashboards and reporting for system performance, security posture, and compliance status.
- Design, implement, and maintain the studio's secure network architecture, covering Network Design, Security, Firewall, Switching, Routing, Segmentation, WiFi, and Network Access Control (NAC).
- Implement and manage advanced security operations including EDR/XDR, SEIM/SOAR, Observability, SASE, SSE, and SD-WAN.
- Enforce Data & Pipeline Security and secure file transfer protocols with tools like OPSWAT MetaDefender and SIGNIANT MediaShuttle.
- Ensure compliance with industry security standards, including working and execution experience with TPN, ISO/IEC 27001:2022, ISMS, and CIS benchmarks.
- Demonstrate expert, vendor-agnostic understanding of networking technologies, including the 7 OSI layers, network segmentation, and deep knowledge of Port, Protocol, Socket, Session, Packet, Frame, Payloads, PacketFence, and network analysis tools like TCPDUMP/WireShark.
- Implement and manage advanced network capabilities including the aggregation, load balancing, and Quality of Service (QoS) of multiple ISP links, as well as protocols and technologies such as BGP, LACP, P2P, MPLS, and SD-WAN.
- Manage user accounts, permissions, and access control across all OS platforms: Windows 11 Pro, macOS, and Linux (Debian, SuSE, RHEL), including integration with SSSD-AD and Kerberos.
- Implement and manage Directory Services (Active Directory and GW EPP-GCDS) for centralized IdP, IAM, SSO, FIM, SCHIM, DNS, DHCP, and KDC.
- Administer and secure the core communication and collaboration suite using Google Workspace Enterprise + Context-Aware/Conditional Access (CAA) MDM.
- Support the pipeline team by ensuring all VFX software (Maya, Substance Painter, etc.) is properly installed and maintained.
- Drive Infrastructure Automation & Optimization using scripting and version control, including GitHub + PowerShell + Python + Shell Scripting.
- Support DevOps and Deployment toolchains (e.g., Chocolatey, Ansible, Salt, Puppet, CI/CD - Jenkins) and Infrastructure as Code (IaC) practices (e.g., Terraform, Pulumi, OpenTofu).
- Oversee Docker/Container environments.
- Oversee all aspects of IT Service Management & Delivery, utilizing platforms like Atlassian Jira Service Management and GLPI/ITAM for Asset Management.
- Lead, mentor, and train the IT support team, fostering an environment of continuous learning and growth.
- Develop and maintain thorough Documentation & Knowledge Sharing platforms and provide Teams Training on new tools and processes.
- Perform deep-dive Troubleshooting and Debugging at all stack and layers of infrastructure for performance management, uptime, and availability.
- Establish and maintain comprehensive hardware and software inventory management (ITAM) processes, including tracking, auditing, and generating regular reports for compliance and planning.
- Manage key vendor relationships and coordination for technical support, procurement, and ongoing service delivery.
- Define, measure, and ensure adherence to Service Level Agreements (SLAs) for critical production systems, particularly the render farm and high-speed storage.
- Maintain a foundational understanding of Power cooling (HVAC) and UPS systems and their requirements for the studio’s data center and high-performance computing environment.
- Collaborate effectively with Facilities or Building Management Systems (BMS) teams to plan and support physical infrastructure requirements.
- 10 to 12 years of experience in IT infrastructure or systems administration, preferably in a Studio infrastructure environment.
- Expert-level experience managing Linux and Windows-based environments.
- Deep expertise in Network Design, Security Architecture, Firewall Management, Switching, Routing, Segmentation, WiFi, and NAC.
- Proven experience with TPN, ISO/IEC 27001:2022, ISMS, and CIS implementation and execution.
- Strong knowledge of Enterprise and Opensource NAS tech (NetApp, Dell-PowerScale, TrueNAS etc.) and virtualization (Proxmox VE HA).
- Strong troubleshooting and problem-solving skills, with the ability to debug at all stack and layers of infrastructure.
- Experience working in VFX, animation, gaming, or post-production studios.
- Familiarity with Render Farm Management and Pipeline & Tooling Support.
- Knowledge of GPU-based computing environments and practical experience with AWS or GCP (or Azure, IBM Softlayer) for Compute, Backup, Archival, and render bursting.
- Hands-on experience with advanced monitoring (Prometheus + Grafana) and IT service management tools (Atlassian Jira Service Management, GLPI).
- Proficient in GitHub + PowerShell + Python + Shell Scripting for automation.
- Experience implementing Disaster Recovery and System Stability plans.
- Hands-on experience implementing and maintaining remote desktop/WFH solutions for high-fidelity creative workflows (e.g., Jump Desktop, Teradici, Parsec, NICE DCV).
- Problem-solving mindset
- Strong communication and collaboration
- Attention to detail
- Ability to work in fast-paced production environments
Ready to push boundaries and leave your creative mark? Join a team that values bold ideas, collaboration, and growth, let’s create something extraordinary together.