Senior Data Engineer

Senior Data Engineer

This job is no longer open

Our unique, rapidly growing data streams are enabling unique opportunities to manage clinical trials more efficiently and predictably. The Data Engineering division is looking for talented Data Engineers to build, expand, and support a cutting-edge, globally distributed data architecture which is the analytical backbone of our company. If you are empathetic, business-driven, and want to use your data engineering and data architecture skills to make a tangible impact in the clinical research community then this may be the role for you.

As a fast-growing company, we're looking for people who can effectively balance rapid execution and delivery with sustainable and scalable architectural initiatives to serve the business most effectively. You have strong opinions, weakly held, and while well-versed technically know when to choose the right tool, for the right job, at the right level of complexity. You will work closely with our Data Science & Analytics and Data Products divisions and the Product, and Product Engineering departments to help collect, stream, transform, and effectively manage data for integration into critical reporting, data visualizations, and data science/machine learning-driven data products.

What You'll Be Working On

  • Supporting the development and international expansion of modern, privacy-aware, data warehouse and data mesh architectures
  • Helping to build, manage, orchestrate, and integrate streaming data sources, data lakes, ELT processes, columnar storage systems, and distributed query execution solutions
  • Establishing proactive data quality/freshness dashboards, monitoring, alerting, and anomaly remediation systems
  • Building practical data onboarding tooling and process automation solutions
  • Learning to effectively understand and deftly navigate the global compliance ecosystem (HIPAA, GDPR, etc.) to ensure your work respects the rights, regulations, and consent preferences of all stakeholders, including historical underserved or underrepresented populations.
  • Developing a deep understanding of the clinical ecosystem, our products, and our business and how they all uniquely interact to help people.

What You Bring To Reify Health

  • 4+ years of experience successfully developing and deploying data pipelines and distributed architectures, ideally in a space similar to ours (startup, healthcare, regulated data).
  • Deep practical experience or familiarity with a good portion of our stack, including: AWS services (Redshift, MSK, Lambda, ECS, ECR, EC2, Glue, Quicksight, Spectrum, S3, etc.), Postgres, dbt, Kafka, Prefect, Docker, Terraform
  • Experience or interest in developing and managing enterprise-scale data, distributed data architectures
  • Excellent programming skills in Clojure or Python and deep comfort with SQL.
  • Solid software testing, documentation, and debugging practices in the context of distributed systems.
  • Great communication skills and can work comfortably with technical and non-technical stakeholders to develop requirements.
This job is no longer open
Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.