Senior Data Scientist

Senior Data Scientist

This job is no longer open

Company Summary

Come join First American's Digital Title Group, newly formed to re-imagine and digitize the title search and examination process through Big Data, AI, document automation and modern, cloud-native application development. As a market leading title insurance company, powered by the nation's largest and most complete property information, ownership and recorded document database, First American is committed to advancing title automation and removing friction from the real estate closing process. Our modern title decisioning solutions create certainty and speed through data and analytics, delivered to real estate agents, lenders, title agents and homebuyers. Join a team that puts its People First! Since 1889, First American (NYSE: FAF) has held an unwavering belief in its people. They are passionate about what they do, and we are equally passionate about fostering an environment where all feel welcome, supported, and empowered to be innovative and reach their full potential. Our inclusive, people-first culture has earned our company numerous accolades, including being named to the Fortune 100 Best Companies to Work For® list for seven consecutive years. We have also earned awards as a best place to work for women, diversity and LGBTQ+ employees, and have been included on more than 50 regional best places to work lists. First American will always strive to be a great place to work, for all. For more information, please visit www.careers.firstam.com.

Job Summary

First American'sDigital Title Groupis newly formed to drive a generational paradigm shift in the way real estate is transacted. Gone will be the days of time consuming manual Title search & examination for every property.  We are re-architecting how our industry works by leveraging our unique living title data, automated search & examination processes, and data driven risk decisioning.  Our approach will remove friction from the real estate closing process, create certainty and speed through data and analytics, and offer a seamless delivery experience to real estate agents, lenders, title agents and homebuyers. This will not only transform our industry, but also directly impact >90% of First Americans $9B+ in annual revenues, a mission and team we will invest hundreds of millions of dollars behind over the next 3-5 years.

We are looking for aSenior Data Scientistto build and deploy Natural Language processing (NLP) models utilizing a variety of Machine learning and deep learning techniques. The role will present opportunities to work on large datasets and the ability to use innovative techniques in Artificial Intelligence ranging from various NLP methods, computer vision, and deep learning to enable solutions that will be directly impactful to our customers.

ABOUT THE JOB

  • Perform exploratory analysis, construct data pipelines, build machine learning models end-to-end from POC to deployment for large scale production systems

  • Monitor, maintain, optimize and continuously improve the deployed machine learning solutions during day to day operations

  • Deploy models through docker containers on AWS/GCP/Azure that serve real time and batch prediction results for various business functions

  • Design and implement scalable models with continuous monitoring and feedback collection system to enableautomatic model training

  • Optimize model and data performance in terms of reduced computation time and cost

  • Establish MDM system to track model performances and generate alerts through MLFlow, AWS Sagemaker, etc.

ABOUT YOU

  • PhD in quantitative field such as Mathematics, Statistics, Computer Science with 2+ years related work experience / MS with 5+ years of related work experience.

  • Extensive experience developing end-to-end machine learning solutions and leading solution diagnosis, including designing & architecting machine learning models that solve business problems & fit into the overall engineering framework, experimentation, model pipeline build, performance optimization, integration and deployment

  • Proficiency in machine learning, NLP & deep learning techniques for tasks involving Named Entity Recognition, document/sentence embeddings, classification. 

  • Proficiency with programming languages such as Python & SQL as well as toolkits/frameworks including spaCy, gensim, PyTorch/Tensorflow.

  • Strong experience in building end-to-end data pipelines, model performance monitoring processes and continuous model delivery. Knowledge ofAWS Lambda, AWS Glue is a big plus.

  • Familiarity with MLOps & common MLOps toolkits, e.g. MLflow, Sagemaker.

  • Knowledge and experience working with engineering toolkits that are frequently used with machine learning model deployment. e.g., Git,GitHub Actions,Docker, AWS EC2, AWS ECS, AWS ECR etc.

  • Familiarity with large scale data processing techniques & tools, e.g., multi-threaded computing, GPU computing, distributed computing in Ray, PySpark, etc.

Pay Range: $94,800 - $241,800 annually

This hiring range is a good faith and reasonable estimate of the salary range of possible compensation at the time of the posting, and is subject to change. The actual compensation offered will be determined by various factors, which may include a candidate’s education, training, experience, and geographic location.

#LI-EL1

#LI-REMOTE

First American invests in its employees' development and well-being, empowers them to provide superior customer service and encourages them to serve the communities where they live and work. First American is committed to diversity and inclusion. We are an equal opportunity employer.

Based on eligibility, First American offers a comprehensive benefits package including medical, dental, vision, 401k, PTO/paid sick leave and other great benefits like an employee stock purchase plan.

This job is no longer open
Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.