Free cookie consent management tool by TermsFeed Senior Database Infrastructure Engineer (Cassandra / DataStax / Big Data Pipelines) | Antal Tech Jobs
Back to Jobs
2 Days ago

Senior Database Infrastructure Engineer (Cassandra / DataStax / Big Data Pipelines)

decor
Bangalore, Karnataka, India
Information Technology
Full-Time
HEROIC Cybersecurity

Overview

HEROIC Cybersecurity (HEROIC.com) is seeking a Senior Data Infrastructure Engineer with deep expertise in DataStax Enterprise (DSE) and Apache Cassandra to help architect, scale, and maintain the data infrastructure that powers our cybersecurity intelligence platforms.

You will be responsible for designing and managing fully automated, big data pipelines that ingest, process, and serve hundreds of billions of breached and leaked records sourced from the surface, deep, and dark web. You'll work with DSE Cassandra, Solr, and Spark, helping us move toward a 99% automated pipeline for data ingestion, enrichment, deduplication, and indexing — all built for scale, speed, and reliability.

This position is critical in ensuring our systems are fast, reliable, and resilient as we ingest thousands of unique datasets daily from global threat intelligence sources.

KEY RESPONSIBILITIES:

  • Design, deploy, and maintain high-performance Cassandra clusters using DataStax Enterprise (DSE)
  • Architect and optimize automated data pipelines to ingest, clean, enrich, and store billions of records daily
  • Configure and manage DSE Solr and Spark to support search and distributed processing at scale
  • Automate dataset ingestion workflows from unstructured surface, deep, and dark web sources
  • Cluster management, replication strategy, capacity planning, and performance tuning
  • Ensure data integrity, availability, and security across all distributed systems
  • Write and manage ETL processes, scripts, and APIs to support data flow automation
  • Monitor systems for bottlenecks, optimize queries and indexes, and resolve production issues
  • Research and integrate third-party data tools or AI-based enhancements (e.g., smart data parsing, deduplication, ML-based classification)
  • Collaborate with engineering, data science, and product teams to support HEROIC’s AI-powered cybersecurity platform
REQUIREMENTS:
  • 5+ years experience with Cassandra / DataStax Enterprise in production environments
  • Hands-on experience with DSE Cassandra, Solr, Apache Spark, CQL, and data modeling at scale
  • Strong understanding of NoSQL architecture, sharding, replication, and high availability
  • Advanced knowledge of Linux/Unix, shell scripting, and automation tools (e.g., Ansible, Terraform)
  • Proficient in at least one programming language: Python, Java, or Scala
  • Experience building large-scale automated data ingestion systems or ETL workflows
  • Solid grasp of AI-enhanced data processing, including smart cleaning, deduplication, and classification
  • Excellent written and spoken English communication skills
  • Prior experience with cybersecurity or dark web data (preferred but not required)
WHAT WE OFFER:
  • Position Type: Full-time
  • Location: Pune, India (Remote – Work from anywhere)
  • Compensation: Competitive salary based on experience
  • Professional Growth: Amazing upward mobility in a rapidly expanding company.
  • Innovative Culture: Fast-paced, innovative, and mission-driven. Be part of a team that leverages AI and cutting-edge technologies.
About Us: HEROIC Cybersecurity (HEROIC.com) is building the future of cybersecurity. Unlike traditional cybersecurity solutions, HEROIC takes a predictive and proactive approach to intelligently secure our users before an attack or threat occurs. Our work environment is fast-paced, challenging and exciting. At HEROIC, you’ll work with a team of passionate, engaged individuals dedicated to intelligently securing the technology of people all over the world.

Keywords & Technologies Used: DataStax Enterprise (DSE), Apache Cassandra, Apache Spark, Apache Solr, AWS, Jira, NoSQL, CQL (Cassandra Query Language), Data Modeling, Data Replication, ETL Pipelines, Data Deduplication, Data Lake, Linux/Unix Administration, Bash, Docker, Kubernetes, CI/CD, Python, Java, Distributed Systems, Cluster Management, Performance Tuning, High Availability, Disaster Recovery, AI-based Automation, Artificial Intelligence, Big Data, Dark Web Data
Share job
Similar Jobs
View All
2 Hours ago
Software Development Manager
Manufacturing & Industrial
  • 30 - 50 INR - Annual
  • 8 - 12 Yrs
  • Faridabad
MINIMUM REQUIREMENTS Competencies (skills & abilities) Full-stack development expertise (frontend + backend + DB) - Deep .NET and SQL Server knowledge, with proficiency in Java and Web technologies - Hands-on experience with Power App...
decor
2 Hours ago
Sr.Software Engineer
Information Technology
  • 5 - 25 INR - Annual
  • 5 - 12 Yrs
  • Pune
Position: Senior Software Engineer (.NET) Experience: 5+ Years Location: Pune (Hybrid) Employment Type: Permanent Job Summary Seeking a skilled Senior Software Engineer with expertise in .NET and modern web technologies. This role focuse...
decor
1 Day ago
Senior Juju Software Engineer (Go)
Information Technology
  • Sahibzada ajit singh nagar, Punjab, India
Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, e...
decor
1 Day ago
IET - Business Analyst - Senior Associate
Information Technology
  • Sahibzada ajit singh nagar, Punjab, India
At PwC, our people in business application consulting specialise in consulting services for a variety of business applications, helping clients optimise operational efficiency. These individuals analyse client needs, implement software solutions, an...
decor
1 Day ago
Lead Data Scientist - Python
Information Technology
  • Sahibzada ajit singh nagar, Punjab, India
Job Description Proficiency with Python (Pandas, NumPy), SQL, and Java. Experience with LLMs, LangChain, and Generative AI technologies. Familiarity with ML frameworks (TensorFlow, PyTorch) and data engineering tools (Spark, Kafka). Microservice...
decor
1 Day ago
Systems Plus - Azure Architect - Cloud Infrastructure
Information Technology
  • Sahibzada ajit singh nagar, Punjab, India
SystemsPlus is hiring for Azure ArchitectExp : 15years+.Location : Pune Azure Architect will lead design and implementation solutions that run on Microsoft Azure to deliver end-to-end cloud transformation.As the Azure Solutions Architect you will be...
decor
1 Day ago
Full Stack Developer - React.js/Python
Information Technology
  • Sahibzada ajit singh nagar, Punjab, India
We are seeking a skilled Full-Stack Developer with expertise in React and Python Django to develop scalable web applications.In this role, you will be responsible for designing, building, and maintaining both the front-end and back-end of our applic...
decor
1 Day ago
Teknobuilt - Test Engineer - Manual/Automation Testing
Information Technology
  • Sahibzada ajit singh nagar, Punjab, India
Quality Assurance (QA) EngineerLocation : Mumbai, IndiaEmployment Type : Full-timeTeknobuilt is an innovative construction technology company at the forefront of digital and AI platforms, revolutionizing program management and execution in the built...
decor

Talk to us

Feel free to call, email, or hit us up on our social media accounts.
Social media