Overview
Job Title: Data Scientist
Location: Vadodara preferred or Remote
Experience Level: Entry to Mid-level (0–5 years)
Employment Type: Full-Time
Job Summary:
We are seeking a skilled and driven Data Scientist with experience in Big Data, Large Language Models (LLMs), data organization, data analytics and advanced Excel. The ideal candidate will have a strong analytical background, a passion for uncovering insights from complex datasets and the ability to communicate data-driven findings to technical and non-technical stakeholders. You will be instrumental in designing and implementing data solutions that empower business decision-making and innovation.
Key Responsibilities:
· Collect, clean and organize large and complex datasets from multiple sources.
· Develop and deploy predictive models and machine learning algorithms, including applications involving LLMs (e.g., GPT, BERT).
· Analyse structured and unstructured data to generate actionable insights for business strategies.
· Collaborate with cross-functional teams to identify data-related opportunities and deliver data-driven solutions.
· Build scalable data pipelines and contribute to data architecture best practices.
· Design and maintain advanced Excel dashboards, models and reports to support various departments.
· Apply statistical and data mining techniques to interpret and visualize trends, patterns and correlations.
· Present findings clearly through reports, visualizations and presentations tailored to different audiences.
· Stay current on emerging data technologies, tools and methodologies.
Required Qualifications:
· Bachelor’s or Master’s degree in Data Science, Computer Science, Statistics, Mathematics or a related field.
· 0 to 5 years of experience in a data science or analytics role.
· Advanced skills in Microsoft Excel, including Power Query, pivot tables, VBA/macros and complex formulas.
· Solid understanding of relational databases and SQL.
· Strong communication and problem-solving skills.
Preferred Qualifications:
· Hands-on experience with Big Data technologies (e.g., Hadoop, Spark, Hive).
· Experience with cloud platforms (e.g., AWS, Azure, GCP) for data processing and model deployment.
· Working knowledge or experience with Large Language Models (LLMs) and NLP techniques.
· Proficiency in Python, R, or other data science programming languages.
· Expertise in data analytics, data visualization (e.g., Tableau, Power BI, matplotlib) and data wrangling.
· Familiarity with MLOps and model lifecycle management.
· Knowledge of version control tools such as Git.
· Experience with APIs and integrating external data sources.