Sr. Data Engineer - ML Pipelines

Sr. Data Engineer - ML Pipelines

This job is no longer open

Sr Data Engineer - ML Pipelines 

Overjet is on a mission to improve oral healthcare for all. 

Our cutting-edge artificial intelligence technology encodes dentist-level training and analysis into scalable software tools. Today, our flagship products are used by some of the country’s largest insurance companies, dental support organizations, and dental practices to enable the best patient care. 

We’re looking for a Data Engineer with strong Python coding skills to join our growing ML Infrastructure team. This hire will be responsible for expanding and optimizing our data and machine learning pipeline architecture. 

Responsibilities

  • Responsible for building and running end-to-end data pipelines and operations from ingestion and integration through delivery for the data products.
  • Guide the development of data resources, support new product launches and improve product runtime performance.
  • Setting up data and cloud environments to make data science more efficient
  • Build tools to automate data import pipeline for different data systems
  • Build dashboards to empower business insights and actions

 

Qualifications

  • Exceptional skills in Java, Scala, Go, Python or equivalent.
  • Experience with SQL (ex. PostgreSQL) and NoSQL (ex. MongoDB)
  • ETL and ELT pipelines
  • Data processing and job orchestration
  • Experience using messaging platforms like Google Pub/Sub, AWS SQS, Kafka, etc.
  • Experience with data validation and data schema tools like Avro, ProtoBuf, etc.
  • Quickly learning new tools
  • Experience with GCP, AWS or other major public cloud
  • Experience with machine learning lifecycle management tools like MLFlow and Kubeflow is a plus
  • Experience with Docker and Kubernetes is a plus

 

This job is no longer open
Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.