Sr. Data Products & Engineering Architect

Sr. Data Products & Engineering Architect

Join one of the nation’s leading and most impactful health care performance improvement companies. Over the years, Health Catalyst has achieved and documented clinical, operational, and financial improvements for many of the nation’s leading healthcare organizations. We are also increasingly serving international markets. Our mission is to be the catalyst for massive, measurable, data-informed healthcare improvement through:

  • Data: integrate data in a flexible, open & scalable platform to power healthcare’s digital transformation​

  • Analytics: deliver analytic applications & services that generate insight on how to measurably improve​

  • Expertise: provide clinical, financial & operational experts who enable & accelerate improvement​

  • Engagement: attract, develop and retain world-class team members by being a best place to work​

Role:  Sr. Data Products & Engineering Architect 

Team: Data Products/Enterprise Architecture

Travel: Approximately 10%, US

Location: US Remote

Job Summary

The Sr. Data Products & Engineering Architect is responsible for leading the vision and technology architecture for Health Catalyst’s data products and data processing operations. They will lead and influence software patterns, cloud pipeline strategies, and data management best practices for Health Catalyst. In doing so they will make technology, software and analytic architecture decisions ranging from data warehousing, data pipelines, operational quality, data accuracy and infrastructure, to machine learning and NLP. The right candidate will have direct experience leading a team to develop a commercial-grade data products.

Duties & Responsibilities:

  • Collaborate, design and coordinate within Enterprise Architecture team and across engineering and product teams, to create meaningful and efficient data products for healthcare provider clients.
  • Guide teams building data products towards data engineering best practices, injecting SDLC principles, promoting re-usability, fault tolerance & graceful recovery, alerting and monitoring, and data quality.
  • Design highly scalable, distributed enterprise data solutions with focus on cloud data platforms and enterprise data lake houses with mesh and fabric components for reporting & analytics.
  • Design and lead in the development of highly scalable and flexible data product offerings.
  • Designing and documenting architecture at multiple levels (high-level to detailed) and across multiple views (conceptual, logical, physical, data flow and sequence diagrams).
  • Translate business requirements into executable technical specifications.
  • Develop and lead proof-of-concepts projects.
  • Create strategies and plans for data transformation and enrichment including but not limited to normalization, standardization, categorical encoding, and feature engineering.
  • Integrate and leverage new technologies and techniques into the current platform for greater functional capability and seamless operational impacts.
  • Provide active “hands-on” architectural guidance and leadership through the entire lifecycle of development projects.
  • Performing third-party vendor assessments

What you'll bring to the role:

  • Experience with Spark, Databricks, Snowflake, Hadoop, Kafka, AWS, Azure, cloud data architectures, and tools in healthcare.
  • Extensive experience designing and implementing scalable healthcare data processing patterns and storage strategies.
  • Experience with one or more modern advanced analytics tools for machine learning / AI applications (for example Spark, Databricks, Python, R, etc.).
  • Demonstrated experience in handling large data sets in both relational and non-relational data stores.
  • Knowledge and experience working with ELT processes.
  • Detailed knowledge of both clinical and claims data.
  • Experience with software for reliable, scalable, distributed computing, specifically in healthcare.
  • Strong communication skills, particularly those relevant to translating highly technical information for non-technical audiences,
  • Demonstrated ability to manage and see-through long-term projects.

Preferred Education & Experience

  • Bachelor’s degree in computer science, computer engineering, or a related area (data science, mathematics, statistics, economics, etc.)
  • 8+ years’ experience in healthcare data, analytics, and technology

The above statements describe the general nature and level of work being performed in this job function.  They are not intended to be an exhaustive list of all duties, and indeed additional responsibilities may be assigned by Health Catalyst.

Studies show that candidates from underrepresented groups are less likely to apply for roles if they don’t have 100% of the qualifications shown in the job posting. While each of our roles have core requirements, please thoughtfully consider your skills and experience and decide if you are interested in the position. If you feel you may be a good fit for the role, even if you don’t meet all of the qualifications, we hope you will apply. If you feel you are lacking the core requirements for this position, we encourage you to continue exploring our careers page for other roles for which you may be a better fit.

At Health Catalyst, we appreciate the opportunity to benefit from the diverse backgrounds and experiences of others. Because of our deep commitment to respect every individual, Health Catalyst is an equal opportunity employer.

Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.