
Overview
About CoffeeBeans Consulting
CoffeeBeans is a fast-growing tech company delivering high-quality software and AI solutions to clients across industries. We combine deep technical expertise with an agile mindset to solve complex business problems. Our data science and GenAI teams work at the frontier of ML, NLP, and product development to build intelligent, scalable, and user-centric solutions.
Role Overview
As an L1 Data Scientist, you will work alongside experienced engineers and data scientists to solve real-world problems using ML and GenAI. In addition to classical data science work, we expect you to actively contribute to building and fine-tuning large language model (LLM)–based applications, including chatbots, copilots, and automation workflows.
This role is ideal for early-career professionals with a strong analytical mindset, a passion for ML/AI, and curiosity around the fast-evolving world of generative AI.
Key Responsibilities
Work with business stakeholders to understand problem statements and assist in translating them into data science tasks.
Perform data collection, cleaning, feature engineering, and exploratory data analysis.
Develop and evaluate machine learning models using Python and ML libraries (e.g., scikit-learn, XGBoost).
Assist in building LLM-based workflows such as RAG (Retrieval-Augmented Generation), prompt engineering, and fine-tuning for use cases like document summarization, query resolution, and task automation.
Contribute to developing GenAI-powered applications using tools like LangChain, OpenAI APIs, or similar LLM ecosystems.
Collaborate with engineers to build and test API endpoints, integrate models into applications, and monitor model performance post-deployment.
Maintain reproducible notebooks and clear documentation for both traditional ML and LLM experiments.
Stay up to date with trends in machine learning, NLP, and GenAI ecosystems and contribute insights to team discussions.
Required Skills & Qualifications
Bachelor’s or Master’s degree in Computer Science, Engineering, Mathematics, Statistics, or a related field.
0–3 years of experience in data science, ML, or AI-focused roles (internships or projects count).
Proficiency in Python, with exposure to libraries such as pandas, NumPy, scikit-learn, and matplotlib.
Basic experience working with LLMs (e.g., OpenAI, Cohere, Mistral, Hugging Face) or an interest and ability to learn quickly.
Understanding of NLP fundamentals and vector-based retrieval techniques is a plus.
Familiarity with SQL and working with structured data sources.
Clear communication, curiosity, and a willingness to take initiative.
Good-to-Have (Not Mandatory)
Exposure to building GenAI prototypes using LangChain, LlamaIndex, or similar frameworks.
Understanding of REST APIs and basics of integrating models into backend systems.
Experience with cloud platforms (AWS/GCP/Azure), Docker, or Git.
Why Join CoffeeBeans?
Work on cutting-edge GenAI and ML use cases across fintech, retail, healthcare, and more.
Fast learning curve with mentorship from senior data scientists, engineers, and architects.
Flat, collaborative culture that rewards ownership and curiosity.
Strong emphasis on product thinking, experimentation, and impact.