Data Science Fellow - NLP

Data Science Fellow - NLP

Company Description

Our focus is to help Biotech/BioPharma, MedTech, Digital Health, Healthcare, Insurance, and Tech organizations accelerate innovation, enable data-driven decisions, and create AI/ML applications. We leverage AI to solve today’s hardest challenges. We are driven to make the world a better place by providing consulting services and building applications to improve outcomes and help our clients stay ahead of the competition.

Our team is composed of creative and results-focused individuals who excel at solving real-world problems. Our diverse backgrounds bring technology and expertise from various disciplines including neuroscience, physics, engineering, computational biology, genomics, mathematics, and computer science.

Our culture is vibrant, connected, rooted in our core values statement that:
Our work matters. Our clients are partners. Our work is our reputation. We own our choices. We are always learning. We support and challenge each other.

This position is part-time and may convert to a permanent position for the right candidate. 

Job Description

MDS is seeking a data science fellow to contribute to the development and validation of Natural Language Processing (NLP) based approaches to accelerate the development of therapeutics and the discovery of biomarkers. The primary sources of data for this project will be biomedical literature.

This role is ideal for a results-focused individual who excels at solving problems at the cutting-edge of data science, with strong technical abilities to take on challenges in data mining, machine learning, data extraction/analysis, and algorithm development in life sciences and healthcare. This fellowship will be focused on the intersection of data science and NLP to extract meaningful insights from biomedical, scientific, and clinical text.

Strong candidates will be prepared to leverage techniques from NLP to address challenges in healthcare and life sciences. Strong candidates will have a deep understanding of how to extract entities and relationships from biomedical literature and biomedical data sets to accelerate drug and biomarker discovery and development. Experience in creating intuitive data visualizations is a plus. Background in bioinformatics, drug development, electronic medical records systems, or NLP is a plus, but not essential.

Read more about our ERGO platform ( that accelerates NLP solutions for science, medicine and technology.

This position is part-time and may convert to a permanent position for the right candidate. 


  • Explore data to generate hypotheses, research similar problems for ideas, brainstorm with teammates to strategize potential solutions, test their feasibility, and contribute to their development
  • Prepare for a data science career with guidance from professionals with a wide variety of backgrounds


  • Currently pursuing a Masters, PhD, or a Postdoctoral researcher in Computer Science, Math, Statistics, Physics, Computational Biology, Biomedical Informatics, or a closely related field
  • Passion for solving complex data science challenges in NLP focused on biomedical text
  • ​​​​​​​Machine Learning and Python experience
  • Comfortable working in a fast-paced, collaborative team environment

Additional Information

  • Background in bioinformatics, drug development, electronic medical records systems, or NLP
  • Expertise in designing and implementing efficient algorithms
  • Familiarity with large open source biomedical datasets and ontologies for drugs/chemicals, genes/proteins, diseases/conditions, etc.
  • Familiarity with NLP software libraries/models including spaCy, scispaCy, BERT, GPT3
Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.