Overview
We are looking for a Data Scientist for PB Health, the innovative healthcare arm of Policybazaar to join us on our mission to revolutionize the patient healthcare journey through a cutting-edge digital hospital ecosystem.
PB Health, a subsidiary of PB Fintech, which is creating an integrated healthcare platform to connect patients, healthcare providers and insurers. The platform aims to simplify the healthcare experience, reduce paperwork, and improve access to care and insurance benefits using Advanced AI Technologies. PB Healthcare is also expanding its hospital network, starting with a 1,000-bed facility in the National Capital Region (NCR), and has raised $218 million to accelerate growth and innovation. Find more here
About the AI Lab at PB Health
The AI Lab at PB Health is at the forefront of innovation, tackling advanced challenges in processing and analyzing audio, and text data on a large scale. Our work spans various domains including Classical ML Models, Speech Recognition, Natural Language Processing (NLP), Large Language Models(LLMs). We are dedicated to transforming theoretical research into practical, cutting-edge products that enhance the customer experience.
Job Title : Data Scientist
Key Responsibilities:
∙ Collect, pre-process, and annotate datasets for training large language models.
∙ Develop, fine-tune, and evaluate conversational AI.
∙ Conduct independent and collaborative research to explore new techniques in AI/LLM development.
∙ Stay current with the latest advancements in AI, machine learning, and NLP, and apply relevant findings to ongoing projects.
∙ Collaborate cross-functionally with data scientists, engineers, and product teams to integrate models into production systems.
Qualifications:
∙ Bachelor's or Master’s degree in Computer Science, Artificial Intelligence, Data Science, or a related field. PhD is a plus.
∙ Experience working with large language models such as GPT, BERT, T5, etc.
∙ Strong programming skills in Python and familiarity with machine learning frameworks like PyTorch or TensorFlow.
∙ Proven experience in dataset preparation and annotation for NLP tasks.
∙ Strong understanding of natural language understanding, generation, and transfer learning
techniques.
∙ Excellent problem-solving skills and attention to detail.