Senior NLP Data Scientist

Senior NLP Data Scientist

This job is no longer open
An NLP Data Scientist at Lightcast works with the Product, Data, and Classifiers & Extractors teams to produce state-of-the-art models and enriched datasets that address our customers’ most pressing problems about the future of work. This is a senior, tech leadership role and will have a strong emphasis on mentoring the team and leading the team to deliver state-of-the-art document understanding capabilities. A successful candidate will have a strong knowledge of current NLP capabilities and a track record of delivering NLP capabilities at scale.

Responsibilities:

    • Work cross-functionally to define problem statements, collect data, build models, and make recommendations to stakeholders and technical leaders.
    • Keep abreast of advancements in the field and deliver state-of-the-art solutions. 
    • Lead efforts to expand, augment, and enhance labeled datasets.
    • Work closely with Classifiers and Extractors Dev team and Data Engineering team to integrate models and deliver to production environments.
    • Spearhead model development for new and existing classifiers and extractors that process billions of documents at a time.
    • Champion the incorporation of new tools/methods/techniques.
    • Work with and manage very large (and messy) data sets; build ML pipelines that scale to work with this data effectively. Experience working with and designing continuous machine learning pipelines is required.
    • Manage large and/or complex projects to completion. Make business/technology trade-offs when appropriate.
    • Mentor a team of NLP data scientists and machine learning engineers with varying levels of experience. 
    • Contribute to positive team culture by participating in team-building activities and being a supportive and empathetic colleague.

Knowledge, Skills, Abilities:

    • Proven expertise building NLP models with deep learning as well as traditional methods. 
    • Awareness of and experience with current state-of-the-art language modeling architectures. 
    • Experience with self-supervised and semi-supervised learning.
    • Proficiency with Python. Proficiency in the C programming language or similar (D, C++) a plus.
    • Experience delivering accurate, robust, and performant NLP capabilities at scale. 

Credentials & Experience:

    • MS/PhD in Computational Linguistics, Engineering Mathematics, Statistics, Physics, Operations Research, or related field
    • 5+ years of industry experience solving analytical problems and building models using quantitative, statistical or machine learning approaches.
    • Strong experience with current NLP capabilities and delivering NLP capabilities at scale.
Lightcast is proud to be an equal opportunity workplace and is committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. Lightcast has always been, and always will be, committed to diversity, equity, and inclusion. We seek dynamic professionals from all backgrounds to join our team, and we encourage our employees to bring their authentic, original, and best selves to work.

#LI-Remote
This job is no longer open
Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.