Chennai, Tamil Nadu, India
Information Technology
Full-Time
Netscribes
Overview
Responsibilities
The core requirements for the job include the following:
- Design, build, and maintain web scraping applications to ingest competitor data from various online retail platforms.
- Develop RESTful APIs in Python (FastAPI / Flask / Django) to serve extracted data for downstream consumption.
- Containerize and deploy API services using Docker and orchestrate deployments via Azure DevOps / GitHub Actions.
- Utilize Azure services such as Azure Functions, Azure App Services, Azure Data Lake, Azure Key Vault, and Azure Monitor for end-to-end deployment and monitoring.
- Implement logging, exception handling, and retry mechanisms to build robust scraping and ingestion pipelines.
- Integrate open-source libraries (e. g., BeautifulSoup, Scrapy, Playwright, Puppeteer, Selenium) for scalable web data extraction.
- Collaborate with Data Scientists and Analysts to make the scraped data queryable and usable for analytics.
- Build dashboards or simple frontends (if needed) to monitor scraping coverage and API health.
- Work with Git-based version control, agile tools (JIRA/Confluence), and participate in code reviews and CI/CD cycles.
The core requirements for the job include the following:
- Strong proficiency in Python (3 x) - with production experience
- REST API development using FastAPI, Flask, or Django REST Framework
- Experience with Web Scraping tools - BeautifulSoup, Scrapy, Playwright/Selenium
- JSON handling, data transformation, and exception management
- Azure Cloud services experience: Azure App Services, Azure Functions, Azure Key Vault, Azure Blob / Data Lake, Azure Monitor / Application Insights. CI/CD using Azure DevOps or GitHub Actions.
- Docker-based development and deployment.
- Good understanding of data formats (JSON, CSV, Parquet).
- Experience working with relational databases (SQL) and awareness of NoSQL (MongoDB/Cosmos DB).
- Basic knowledge of data pipelines, data validation, and ETL design patterns.
- Working knowledge of open-source libraries and tools for scraping, APIs, or lightweight frontends.
- Git and collaborative development workflows.
- Basic front-end skills: HTML/CSS/JavaScript (for monitoring dashboards or UI components).
- Knowledge of Azure Data Factory, Event Grid, or Azure Logic Apps.
- Familiarity with retail/fashion data models like product attributes, pricing, and catalog taxonomy.
- Experience with ElasticSearch or BigQuery for storing and querying semi-structured data.
- Exposure to DataOps/MLOps pipelines.
Similar Jobs
View All
Talk to us
Feel free to call, email, or hit us up on our social media accounts.
Email
info@antaltechjobs.in