Baton Health

2-10 employees
With the first Universal Primary Source, Baton brings fragmented provider information into one place to make primary source verification lightning fast.

Senior Data Engineer

Senior Data Engineer

About the company 

Baton Health is on a mission to modernize healthcare credentialing by eliminating costly and manual processes. 

Healthcare credentialing can be summarized as “background checks for doctors” and requires verification of data from hundreds of sources. Baton Health is creating a single Universal Primary Source that federates these disparate data sources into one. In our short time, we’ve built a rapidly scalable data ingestion system, a schema normalization engine, and a powerful data model for federating and delivering data from upwards of a thousand sources. Our existing stack includes Python, dbt, Snowflake, Prefect, and Postgres, amongst other tools and infrastructure.

About the role

We are looking for a Senior Data Engineer to join and directly shape our early-stage team. You will extend the build out of our ingestion infrastructure and data pipelines from end to end. You will help configure and maintain a data integration service based on the modern data stack, including a cloud data warehouse, a data transformation layer, and workflow orchestration. You will need to think deeply about the details of our various data sources, be creative in how we assess and ensure data quality, and be critical in how we evaluate technologies that serve our use cases. This role will play a crucial part in powering our product and building core business lines.

You will report to the CEO and work remotely along with a Director of Product and a Data Engineer, with opportunities to meet and work in person in NYC. 

What you’ll do

  • Steer the build of data pipelines to external and internal data sources and work with both structured and unstructured data to efficiently extract and load various sources into our data warehouse. 
  • Design and implement data orchestration using tools like Prefect to manage ingestion, processing, and data flow across all the components of the data infrastructure. 
  • Oversee the processing, deduplication, and reconciliation of data from different sources using Snowflake and dbt. 
  • Develop and oversee a state-of-the-art entity resolution system, aimed at integrating and correlating all primary source data associated with individual practitioners into a unified, singular record identifier for each individual person.
  • Ensure the preservation of original data formatting and create an auditable chain of custody for all data points ingested by Baton.
  • Work with the Product and Engineering teams to understand how the output data can be used to enrich the workflows of our customers
  • Develop an understanding of the credentialing world and innovate new ways to fulfill credentialing needs with the data our sources provide.
  • Integrate our data warehouse with that of other customers using tools like the Snowflake Marketplace, Fivetran, or others in order to support new revenue streams.

What you’ll need


  • 5+ years of experience as a hands-on Data Engineer specializing in data ingestion, extraction, and modeling using modern data stacks. 
  • Deep experience building dbt models to transform and connect disparate raw data sources. 
  • Experience with ETL tools like Fivetran or Airbyte and rETL tools like Census or Hightouch.
  • Experience with a cloud data warehouse like Snowflake, Redshift, or Big Query. 
  • Experience with workflow orchestration tools such as Airflow, Prefect, or Argo. 


Manages Complexity. You ask the right questions to accurately analyze situations and uncover root causes to difficult issues. Through acquiring data from multiple and diverse sources, you are able to make sense of complex, high-quantity, and sometimes contradictory information to solve problems.

Drive Results. Has a strong bottom-line orientation. Persists in accomplishing objectives despite obstacles and setbacks. Has a track record of exceeding goals successfully. Pushes self and helps others achieve results. Consistently achieving results, even under tough circumstances.

Communicates & Collaborates. Is effective in a variety of communication settings: one-on-one, small and large groups, or among diverse styles and position levels. Attentively listens to others. Adjusts to fit the audience and the message. Provides timely and helpful information to others across the organization. Encourages the open expression of diverse ideas and opinions. Developing and delivering multi-mode communications that convey a clear understanding of the unique needs of different audiences.


Additional Information:

Full-time base salary range of $120,000 to $150,000 plus equity


Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.