Overview
Pune
About Us
We empower enterprises globally through intelligent, creative, and insightful services for data integration, data analytics and data visualization.
Hoonartek is a leader in enterprise transformation, data engineering and an acknowledged world-class Ab Initio delivery partner.
Using centuries of cumulative experience, research and leadership, we help our clients eliminate the complexities & risk of legacy modernization and safely deliver big data hubs, operational data integration, business intelligence, risk & compliance solutions and traditional data warehouses & marts.
At Hoonartek, we work to ensure that our customers, partners and employees all benefit from our unstinting commitment to delivery, quality and value. Hoonartek is increasingly the choice for customers seeking a trusted partner of vision, value and integrity
How We Work?
Define, Design and Deliver (D3) is our in-house delivery philosophy. It’s culled from agile and rapid methodologies and focused on ‘just enough design’. We embrace this philosophy in everything we do, leading to numerous client success stories and indeed to our own success.
We embrace change, empowering and trusting our people and building long and valuable relationships with our employees, our customers and our partners. We work flexibly, even adopting traditional/waterfall methods where circumstances demand it. At Hoonartek, the focus is always on delivery and value.
Job Description
We are seeking a proactive and technically strong Site Reliability Engineer (SRE) to ensure the stability, performance, and scalability of our Data Engineering Platform. You will work on cutting-edge technologies including Cloudera Hadoop, Spark, Airflow, NiFi, and Kubernetes—ensuring high availability and driving automation to support massive-scale data workloads, especially in the telecom domain. Key Responsibilities • • Ensure platform uptime and application health as per SLOs/KPIs • • Monitor infrastructure and applications using ELK, Prometheus, Zabbix, etc. • • Debug and resolve complex production issues, performing root cause analysis • • Automate routine tasks and implement self-healing systems • • Design and maintain dashboards, alerts, and operational playbooks • • Participate in incident management, problem resolution, and RCA documentation • • Own and update SOPs for repeatable processes • • Collaborate with L3 and Product teams for deeper issue resolution • • Support and guide L1 operations team • • Conduct periodic system maintenance and performance tuning • • Respond to user data requests and ensure timely resolution • • Address and mitigate security vulnerabilities and compliance issues Technical Skillset • • Hands-on with Spark, Hive, Cloudera Hadoop, Kafka, Ranger • • Strong Linux fundamentals and scripting (Python, Shell) • • Experience with Apache NiFi, Airflow, Yarn, and Zookeeper • • Proficient in monitoring and observability tools: ELK Stack, Prometheus, Loki • • Working knowledge of Kubernetes, Docker, Jenkins CI/CD pipelines • • Strong SQL skills (Oracle/Exadata preferred) • • Familiarity with DataHub, DataMesh, and security best practices is a plus
SHIFT - 24/7
Similar Jobs
View All
Talk to us
Feel free to call, email, or hit us up on our social media accounts.
Email
info@antaltechjobs.in