Senior Data Scientist/LLM Engineer
Elixi
Description
Elixi is an AI-driven HealthTech startup with a mission to democratize long-term health by transforming scattered data into insights and recommendations people can trust and act on. We’re led by a successful tech entrepreneur who has built a global decacorn ($10B+) before — and has the ambition and resources to do it again.
This is a chance to join early and directly influence what we build, how we build it, and the culture we create along the way. We operate as a small, relentless team with strong ownership and high standards — if you’re driven to achieve greatness and grow alongside ambitious peers, you’ll feel at home.
About the role
We are seeking a highly skilled and motivated Senior Data Scientist with expertise in Large Language Models (LLMs) to join our dynamic health tech startup. As a Data Scientist, you will play a crucial role in analysing user data, developing predictive models, and creating innovative algorithms that drive our platform's functionalities.
What you'll be doing
- Gather, clean, and preprocess user data to ensure accuracy and relevance in health assessments.
- Design, create, and implement machine learning models, particularly LLMs and RAGs, to generate personalized health advice tailored to individual user profiles.
- Collaborate with health practitioners and engineering teams to translate medical insights into effective data-driven applications.
- Analyze testing results and user feedback to refine recommendation algorithms, improving overall accuracy and user satisfaction.
- Support in developing user-friendly tools that gamify health checklists and tracking, enhancing user engagement and adherence to recommendations.
- Present findings and analytical insights to stakeholders clearly, ensuring effective communication of complex data concepts.
What you'll need
- Proven experience in Data Science or a similar role, with a focus on machine learning and predictive modeling.
- Bachelor’s degree in a quantitative discipline (Mathematics, Statistics, Computer Science, Engineering); Master’s or PhD is a significant plus.
- Hands-on experience with Large Language Models (LLMs), especially Retrieval Augmented Generation models (RAGs), including training and deployment of models like GPT or BERT.
- Proficiency in Python and experience with data manipulation libraries (Pandas, NumPy) and machine learning frameworks (TensorFlow, PyTorch).
- Strong SQL skills for querying and managing large datasets.
- Solid understanding of statistics with the ability to evaluate model performance effectively.
- Ability to communicate complex data findings to non-technical stakeholders clearly.
- Experience with data visualization tools (e.g., Tableau, Matplotlib) to present insights.
Nice to have
- Experience in the health tech industry or knowledge of healthcare data regulations (e.g., HIPAA).
- Familiarity with cloud platforms (AWS, Google Cloud, Azure) for deploying models and managing data pipelines.
- Understanding of DevOps practices, including version control (Git) and CI/CD.
- Knowledge of gamification principles for designing engaging user experiences.
- Strong problem-solving skills with a proactive approach to data challenges.