Free cookie consent management tool by TermsFeed Data Scraping Engineer | Antal Tech Jobs
Back to Jobs
1 Week ago

Data Scraping Engineer

decor
Chennai, Tamil Nadu, India
Information Technology
Other
Futurism

Overview

ID: 849 | 2-5 yrs | India | careers

Job Title: Data Scraping Engineer

Job Location: Baner, Pune

Experience: 2 to 5 Years

Shift: Monday to Friday (10:00 AM to 7:00 PM IST)

Qualification: BTech, MBA


Job Objective:

Futurism Technologies is looking for Data Scraping Engineer with 2+ years of experience scraping data from high-security websites. The ideal candidate will be proficient in traditional and AI-driven scraping techniques, capable of bypassing complex anti-bot systems, and skilled in filtering, modifying, and storing large-scale structured data. Strong command of Python, Excel and Screaming Frog SEO Spider is also essential for data analysis and website auditing.


Key Responsibilities:

    Develop robust scraping pipelines for websites with advanced bot protection (CAPTCHA, Cloudflare, rate limiting).
    Implement and leverage AI/ML techniques (e.g., visual DOM parsing, content classification, anomaly detection) to enhance scraping capabilities where traditional methods fall short.
    Use Screaming Frog SEO Spider for comprehensive crawling, data extraction, and SEO-focused analysis.
    Use Python to Scrap the high-security websites.
    Work with headless browsers (Playwright, Puppeteer) to render and extract dynamic JavaScript content.
    Clean, transform, and structure raw data for business-ready consumption using Excel (advanced formulas, pivot tables, lookups, macros, etc.).
    Store and manage scraped data in databases like MongoDB, PostgreSQL, or structured file formats (CSV, JSON).
    Create automated, fault-tolerant scraping jobs with retry logic, proxy rotation, and alerting systems.
    Stay up to date with scraping trends, legal compliance, and AI tools to optimize workflows.


Required Skills:

    Proficiency in Python and scraping libraries (Scrapy, Selenium, Playwright, BeautifulSoup).
    Hands-on experience with anti-bot bypass techniques (proxy rotation, CAPTCHA solving, header spoofing).
    Strong Excel knowledge – including advanced data manipulation and automation (macros, formulas, VBA optional).
    Working knowledge of Screaming Frog SEO Spider for crawling and extracting structured website data.
    Exposure to AI-based scraping enhancements, such as:
    Visual DOM recognition using ML/computer vision
    NLP for parsing unstructured content
    Content-type classifiers or dynamic selector generators
    Experience with structured data handling in MongoDB, MySQL/PostgreSQL, and flat file formats.
    Familiarity with XPath, CSS selectors, regex, and dynamic content handling.


Nice to Have:

    Familiarity with Docker, CI/CD pipelines, and cloud environments (AWS, GCP).
    Experience integrating with external APIs or handling real-time data feeds.
    Bash scripting or task automation (Airflow, Cron jobs).
    Understanding of ethical/legal considerations around scraping.


What We’re Looking For:

    An engineer who thinks outside the box and solves scraping challenges creatively.
    Passion for automation and data accuracy.
    Someone who’s hands-on, detail-focused, and eager to work with cutting-edge scraping and AI tech.

Share job
Similar Jobs
View All
1 Day ago
TrueFan - Senior Machine Learning Engineer
Information Technology
  • Thiruvananthapuram, Kerala, India
About UsTrueFan is at the forefront of AI-driven content generation, leveraging cutting-edge generative models to build next-generation products. Our mission is to redefine content generation space through advanced AI technologies, including deep ge...
decor
1 Day ago
Salesforce commerce cloud consultant
Information Technology
  • Thiruvananthapuram, Kerala, India
Salesforce Commerce Cloud consultant  5+ Years of Experience 6 to 12 months Mode - Remote 1.1LPM - 1.2LPM Max Key Responsibilities Translate business requirements into scalable Salesforce Service Cloud solutions, in collaboration with CAE's technic...
decor
1 Day ago
Cloud Infrastructure Engineer
Information Technology
  • Thiruvananthapuram, Kerala, India
DescriptionInvent the future with us. Recognized by Fast Company’s 2023 100 Best Workplaces for Innovators List, Ampere is a semiconductor design company for a new era, leading the future of computing with an innovative approach to CPU design focuse...
decor
1 Day ago
Devops Engineer- Intermetiate
Information Technology
  • Thiruvananthapuram, Kerala, India
BackJD: Dev ops Engineer:As a DevOps Specialist- should be able to take ownership of the entire DevOps process, including Automated CI/CD pipelines and deployment to production.They should also be comfortable with risk analysis and prioritization.Le...
decor
1 Day ago
Sr Data Scientist (London)
Information Technology
  • Thiruvananthapuram, Kerala, India
AryaXAI stands at the forefront of AI innovation, revolutionizing AI for mission-critical, highly regulated industries by building explainable, safe, and aligned systems that scale responsibly. Our mission is to create AI tools that empower research...
decor
1 Day ago
Software Test Engineer
Information Technology
  • Thiruvananthapuram, Kerala, India
By clicking the “Apply” button, I understand that my employment application process with Takeda will commence and that the information I provide in my application will be processed in line with Takeda’s Privacy Notice and Terms of Use. I further att...
decor
1 Day ago
Software Developer 5 (Java Fullstack)
Information Technology
  • Thiruvananthapuram, Kerala, India
Job DescriptionBuilding off our Cloud momentum, Oracle has formed a new organization - Oracle Health Applications & Infrastructure. This team focuses on product development and product strategy for Oracle Health, while building out a complete platfo...
decor
1 Day ago
Java Developer - Spring Frameworks
Information Technology
  • Thiruvananthapuram, Kerala, India
Java DescriptionWe are looking for a passionate and talented Java Developer with 2-3 years of hands-on experience to join our growing development team.The ideal candidate should have a strong foundation in Java technologies and the ability to develo...
decor

Talk to us

Feel free to call, email, or hit us up on our social media accounts.
Social media