Free cookie consent management tool by TermsFeed Hiring Data Analyst(Web Scrapping using Python) | Antal Tech Jobs
Back to Jobs
17 Weeks ago

Hiring Data Analyst(Web Scrapping using Python)

decor
214318 - 1051540 INR - Annual
Pune, India
Information Technology
Full-Time
Texila Educare Healthcare and Technology Enterprise Pvt Ltd

Overview

Experience: 2Years

Key Responsibilities:

  • Develop and Maintain Web Scraping Scripts: Build efficient, scalable, and robust web scraping tools using Python and relevant libraries (e.g., BeautifulSoup, Scrapy, Selenium).
  • Data Extraction: Extract structured and unstructured data from websites and APIs, focusing on gathering high-quality and clean datasets.
  • Data Processing and Storage: Process, clean, and store extracted data in databases (SQL/NoSQL) or data warehouses, ensuring it's ready for analysis and reporting.
  • Website Parsing and HTML Manipulation: Parse complex HTML structures and interact with websites that require JavaScript rendering.
  • Error Handling and Logging: Develop error handling and logging mechanisms to ensure scripts run reliably and provide useful diagnostics when failures occur.
  • Automation and Scheduling: Automate scraping jobs to run on a regular basis using task schedulers (e.g., cron jobs) and ensure minimal downtime.
  • Ensure Compliance: Implement scraping systems that comply with website Terms of Service and applicable laws (e.g., GDPR, Copyright Laws, and Robots.txt).
  • Optimize Performance: Optimize scraping performance for speed and reliability. Handle rate limits, CAPTCHAs, and IP blocking mechanisms to ensure smooth operations.
  • Documentation and Reporting: Maintain clear documentation of scraping processes, data flows, and any issues encountered. Provide status updates and reports to stakeholders.
  • Collaboration: Work closely with data analysts, product teams, and engineers to ensure data quality and availability for decision-making processes.

Required Skills and Qualifications:

  • Proficiency in Python: Strong experience with Python, especially in libraries like BeautifulSoup, Scrapy, Requests, Selenium, and Pandas.
  • Web Scraping Frameworks: Experience with scraping tools such as Scrapy, Selenium, or Puppeteer.
  • HTML, CSS, JavaScript: Deep understanding of web technologies, including HTML, CSS, and JavaScript to navigate websites and handle dynamic content.
  • Data Manipulation and Storage: Experience with SQL and NoSQL databases (e.g., MySQL, PostgreSQL, MongoDB) and data processing libraries (e.g., Pandas).
  • APIs: Experience working with RESTful APIs to extract or push data.
  • Data Formats: Knowledge of data formats like JSON, XML, CSV, and how to parse/handle them.
  • Error Handling and Debugging: Strong skills in troubleshooting, debugging, and optimizing web scraping operations.
  • Networking and HTTP Protocols: Familiarity with HTTP requests, headers, cookies, and web scraping proxies (e.g., rotating proxies, IP management, VPNs).
  • Version Control: Experience using version control systems like Git.
  • Problem Solving and Critical Thinking: Ability to handle complex scraping challenges like dynamic content, CAPTCHA, JavaScript rendering, etc.

Preferred Qualifications:

  • Experience with Cloud Technologies: Familiarity with cloud platforms such as AWS, Google Cloud, or Azure for scalable scraping and storage solutions.
  • Distributed Systems: Experience with managing distributed web scraping jobs using tools like Celery, RabbitMQ, or Kubernetes.
  • Data Quality and Validation: Experience in data validation, cleaning, and transforming data for downstream processes.
  • Knowledge of Machine Learning: Familiarity with applying machine learning techniques to parse and extract data from semi-structured or unstructured sources.

Job Type: Full-time

Pay: ?214,318.07 - ?1,051,539.21 per year

Schedule:

  • Day shift

Experience:

  • total work: 2 years (Preferred)

Work Location: In person

Share job
Similar Jobs
View All
12 Hours ago
Backend Engineer
Information Technology
  • 3 - 5 Yrs
  • Bangalore
Responsibilities: Build abstractions and contracts with separation of concerns for a larger scope. Drive problem-solving skills for high-level business and technical problems. Do high-level design with guidance; Functional modeling, and bre...
decor
14 Hours ago
Senior React Native Developer
Information Technology
  • 2 - 6 Yrs
  • Pune
About the Role: We are seeking a highly skilled React Native Developer to join our dynamic development team in Pune. If you have a strong grasp of JavaScript, TypeScript, and mobile architecture, and enjoy building high-performance, scalable appli...
decor
1 Day ago
Python Developer
Information Technology
  • 5 - 10 Yrs
  • Pune
Location: Pune Experience Required: 5+ Years Duration: 6 Months (Extendable) Notice Period: Immediate Work Mode: Hybrid About the Role: We are looking for a highly skilled Python Developer with expertise in best software development practices...
decor
1 Day ago
Mechanical Engineer
Information Technology
  • 1 - 1 Yrs
  • Coimbatore
We have an opening under our diversity hiring in Coimbatore Role: Engineer (Mechanical-Fresher) Job Location: Coimbatore Mode of work: WFO Years of Exp: 0-1 Year Education: B.E/B. Tech in Mechanical Engineering (2022 to 2024 passed outs). ...
decor
1 Day ago
Analog Layout Engineer
Telecommunications
  • 4 - 8 Yrs
  • Bangalore
We Are Hiring: Analog Layout Engineer Location: Bangalore Experience Required: 4+ Years Job Description: We are seeking an experienced Analog Layout Engineer to join our growing team. As an Analog Layout Engineer, you will play a pivotal role i...
decor
1 Day ago
Interesting Job Opportunity: Business Analyst
Healthcare & Life Sciences
  • Hyderabad, Telangana, India
Job DescriptionThe job primarily involves automation of spreadsheet based business processes for various clients in the BFSI sector. The automation will be implemented using our product Sheetkraft. Relevant training will be provided. End-to-end deli...
decor
1 Day ago
Senior Lead - Cloud Network Engineer
Healthcare & Life Sciences
  • Hyderabad, Telangana, India
Who We AreAt Kyndryl, we design, build, manage and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward – always pushing ourselves to go further in our efforts to...
decor
1 Day ago
Engineering - Core Engineering - ETO PRX - Software Engineer - Associate - Hyderabad
Healthcare & Life Sciences
  • Hyderabad, Telangana, India
Job DescriptionWhat We DoAt Goldman Sachs, our Engineers don’t just make things – we make things possible. Change the world by connecting people and capital with ideas. Solve the most challenging and pressing engineering problems for our clients. Jo...
decor

Talk to us

Feel free to call, email, or hit us up on our social media accounts.
Social media