Free cookie consent management tool by TermsFeed HPC & Cloud Engineer | Antal Tech Jobs
Back to Jobs
3 Days ago

HPC & Cloud Engineer

decor
Chennai, TN, India
Information Technology
Full-Time
Sandisk

Overview

Company Description


Sandisk understands how people and businesses consume data and we relentlessly innovate to deliver solutions that enable today’s needs and tomorrow’s next big ideas. With a rich history of groundbreaking innovations in Flash and advanced memory technologies, our solutions have become the beating heart of the digital world we’re living in and that we have the power to shape.

Sandisk meets people and businesses at the intersection of their aspirations and the moment, enabling them to keep moving and pushing possibility forward. We do this through the balance of our powerhouse manufacturing capabilities and our industry-leading portfolio of products that are recognized globally for innovation, performance and quality.

Sandisk has two facilities recognized by the World Economic Forum as part of the Global Lighthouse Network for advanced 4IR innovations. These facilities were also recognized as Sustainability Lighthouses for breakthroughs in efficient operations. With our global reach, we ensure the global supply chain has access to the Flash memory it needs to keep our world moving forward.


Job Description


Cloud Architecture & Operations

  • Build and operate HPC environments on cloud platforms such as:
    • Amazon Web Services (AWS)
    • Microsoft Azure
    • Google Cloud Platform
  • Design hybrid-cloud and multi-cloud architectures for HPC workloads.

  • Implement cloud-native storage, networking, security, and disaster recovery solutions.

Infrastructure Automation & DevOps

  • Develop Infrastructure as Code (IaC) using:
    • Terraform
    • CloudFormation
    • Ansible

    • Python code

  • Build CI/CD pipelines for infrastructure and platform deployments.
  • Automate cluster provisioning, configuration management, monitoring, and patch management.
  • Develop self-service provisioning frameworks for research and engineering teams.

AI & Data Engineering

  • Design and implement scalable AI/ML data pipelines.
  • Build data ingestion, transformation, and orchestration frameworks.
  • Support distributed AI training and inference workloads.
  • Optimize GPU utilization for deep learning applications.
  • Collaborate with Data Scientists and ML Engineers to deploy production AI solutions.

Platform Monitoring & Reliability

  • Implement observability solutions using: Prometheus, Grafana, ELK Stack, OpenTelemetry
  • Monitor system performance, capacity planning, and SLA compliance.
  • Troubleshoot performance bottlenecks across compute, storage, network, and AI frameworks.

HPC Infrastructure Engineering

  • Design, deploy, and manage large-scale HPC clusters across on-premises and cloud environments.
  • Administer compute, storage, networking, and GPU resources for AI/ML and data-intensive workloads.
  • Optimize cluster performance, scheduling, and resource utilization using workload managers such as: Slurm, LSF, PBS Pro, Kubernetes

Security & Governance

  • Implement security best practices for HPC and cloud environments.
  • Manage IAM, secrets management, encryption, and compliance controls.
  • Support regulatory requirements and enterprise governance standards.

Qualifications


5+ years of experience in DevOps and Cloud infrastructure management

Technical Skills

  • Bachelor's or Master's degree in Computer Science, Engineering, Information Systems, or related field.
  • Strong experience with Linux system administration (RHEL, Rocky Linux, Ubuntu).
  • Experience managing HPC clusters and distributed computing environments.
  • Proficiency in Python, Bash, or Go.
  • Hands-on experience with: Terraform, Ansible, Git, Jenkins/GitHub Actions
  • Experience with container technologies: Docker, Kubernetes, Singularity/Apptainer
  • Knowledge of AI/ML frameworks: TensorFlow, PyTorch, Ray, Spark
  • Experience with GPU technologies and accelerator platforms.

Cloud Skills

  • AWS, Azure, or GCP architecture and operations.
  • Cloud networking, storage, and security services.
  • Hybrid cloud and HPC workload migration experience.

Additional Information


Sandisk thrives on the power and potential of diversity. As a global company, we believe the most effective way to embrace the diversity of our customers and communities is to mirror it from within. We believe the fusion of various perspectives results in the best outcomes for our employees, our company, our customers, and the world around us. We are committed to an inclusive environment where every individual can thrive through a sense of belonging, respect and contribution.

Sandisk is committed to offering opportunities to applicants with disabilities and ensuring all candidates can successfully navigate our careers website and our hiring process. Please contact us at jobs.accommodations@sandisk.com to advise us of your accommodation request. In your email, please include a description of the specific accommodation you are requesting as well as the job title and requisition number of the position for which you are applying.

Share job
Similar Jobs
View All
1 Day ago
Network Engineer (WLAN / Switching / Software)
Information Technology
  • Chennai, TN, India
Network Engineer (WLAN / Switching / Software)This role has been designed as ‘Hybrid’ with an expectation that you will work on average 2 days per week from an HPE office. Who We Are: Hewlett Packard Enterprise is the global edge-to-cloud company adv...
decor
3 Days ago
Senior Web UI Developer - HTML, JavaScript, React.js
Information Technology
  • Chennai, TN, India
Chennai, Tamil Nadu Job Summary The Senior React.js Developer is responsible for developing, enhancing, and maintaining high-quality web applications that meet both client requirements and organizational standards. This role plays a crucial part in ...
decor
3 Days ago
AWS Devops Engineer
Information Technology
  • Chennai, TN, India
Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues around the world, and where you’ll be ab...
decor
3 Days ago
Senior Automation Test Lead Embedded
Information Technology
  • Chennai, TN, India
Lucknow, Uttar Pradesh Job Summary The Senior Test Lead will be responsible for overseeing the testing activities related to Testing Tools and Test Automation (EMB), selenium, java. The primary objective will be to ensure the quality and efficiency o...
decor
3 Days ago
Technical Lead-Cloud & Infra Engg
Information Technology
  • Chennai, TN, India
Country/Region: IN Requisition ID: 37003 Work Model: Position Type: Salary Range: Location: INDIA - CHENNAI - BIRLASOFT OFFICE Title: Technical Lead-Cloud & Infra Engg Description: Area(s) of responsibility Architecture & Solution Design Arch...
decor
3 Days ago
IT System Administrator Technical Specialist
Information Technology
  • Chennai, TN, India
About Us Ribbon Communications (Nasdaq: RBBN) delivers communications software, IP and optical networking solutions to service providers, enterprises and critical infrastructure sectors globally. We engage deeply with our customers, helping them mode...
decor
3 Days ago
Senior MLOps Technical Lead
Information Technology
  • Chennai, TN, India
Noida, Uttar Pradesh Job Summary Overview Senior‑level role on a five‑person engineering team building a production‑grade healthcare conversational GenAI platform. The stack centers on Python 3.12+ , FastAPI , and Google ADK 2.0 as the primary multi...
decor
3 Days ago
Senior Selenium Automation Tester - Cucumber, Java
Information Technology
  • Chennai, TN, India
Hyderabad, Telangana Job Summary The senior automation tester with expertise in cucumber, selenium, and Java will be responsible for developing, and executing automated tests to ensure the quality of software applications. This role will involve coll...
decor

Talk to us

Feel free to call, email, or hit us up on our social media accounts.
Social media