Sr Data Engineer

Sr Data Engineer

This job is no longer open

By joining Invitae, you’ll work alongside some of the world’s experts in genetics and healthcare at the forefront of genetic medicine. We’ve crafted a culture that empowers our teams and our teammates to have the biggest impact and to explore their interests and capabilities. We prize freedom with accountability and offer significant flexibility, along with excellent benefits and competitive compensation in a fast-growing organization!

We are looking for a reliable and motivated Senior Data Engineer to join our Patient Data Network team who can support our Data Solutions Team in developing the data ingestion pipelines and data platform architecture that supports the analytical and reporting needs of internal stakeholders, data scientists, and our machine learning team, as well as externally facing products.

 

What you’ll do:

Understand our complex data ecosystem 

Be hands-on with the technical design and implementation of reliable, scalable and efficient data processing framework (batch and streaming),  data driven products and software solutions for external and internal customers

Identifies, prioritizes, and solves for ambiguous, open-ended problems

Collaborate with multiple teams; Owns and delivers data solutions from end-to-end with high quality

What you bring:

  • Typically requires a minimum of 8 years of related experience with a Bachelor’s degree; or 6 years and a Master’s degree; or a PhD with 3 years experience. Any equivalent combination of training, education, and experience that provides the required skills, knowledge and abilities.
  • Extensive hands-on experience working with large datasets, pipelines, and modern warehouse technologies 
  • Self-starter attitude and ability to work towards a larger goal with minimal guidance
  • Advanced experience in SQL queries and performance tuning
  • Understanding of functional programing paradigms
  • Proficiency in Scala, Java, Python and a demonstrable ability to quickly learn
  • Focus on high quality code, including automated testing and coding best practices
  • Experience with messaging/queuing systems or stream processing systems
  • Experience in building distributed systems with infrastructure automation, monitoring and alerting
  • Track record of working with cross functional teams and stakeholders, 

 

Additional Preferred but not Required Skills:

  • Experience with Snowflake as a warehouse technology
  • Experience using Kafka for implementing streaming application
  • Experience with CI/CD pipelines (e.g. GitHub Actions)
  • Experience with maintaining and administering Kubernetes clusters
  • Interest in working on related but separate projects in parallel
  • Experience with DBT as data transformation tool
  • Experience with data lineage/data governance tools like Atlan
  • Experience with data modeling/dimensional modeling

#LI-Remote

This job is no longer open
Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.