Staff Data Engineer

Staff Data Engineer

This job is no longer open

About the role

Our Data Platform Engineering group builds and maintains the platform that delivers accessible data to power decision-making at Shopify for over a million merchants. We’re hiring high-impact developers across teams:

  • The Engine group organizes all merchant and Shopify data into our data lake in highly-optimized formats for fast query processing, and maintaining the security and quality of our datasets.

  • The Analytics group leverages the Engine primitives to build and deliver simple and useful products that power scalable transformation of data at Shopify in batch, streaming, or for machine learning. This group is focused on making it really simple for our users to answer three questions: What happened in the past? What is happening now? And, what will happen in the future?  

  • The Data Experiences group builds end-user experiences for experimentation, data discovery, and business intelligence reporting.

  • The Reliability group operates the data platform in a consistent and reliable manner. They build tools for other teams on Data Platform to leverage and encourage consistency as they champion reliability across the platform.

Qualifications

  • An experienced technical leader with a proven track record of delivering impactful results.

  • Technical engineering background in one or more areas in the next section.

  • Experience with technical mentoring, coaching, and improving the technical output of the people around you.

  • Exceptional communication skills and ability to translate technical concepts into easy to understand language for our stakeholders. 

  • Excitement for working with a remote team; you value collaborating on problems, asking questions, delivering feedback, and supporting others in their goals whether they are in your vicinity or entire cities apart.

A Staff Data Developer would typically have 6-10 years of experience in one or more of the following areas:

  • Experience with the internals of a distributed compute engine (Spark, Presto, DBT, or Flink/Beam)

  • Experience in query optimization, resource allocation and management, and data lake performance (Presto, SQL)

  • Experience with cloud infrastructure (Google Cloud, Kubernetes, Terraform)
    Experience with security products and methods (Apache Ranger, Apache Knox, OAuth, IAM, Kerberos)

  • Experience deploying and scaling ML solutions using open-source frameworks (MLFlow, TFX, H2O, etc.)

  • Experience building full-stack applications (Ruby/Rails, React, TypeScript)

  • Background and practical experience in statistics and/or computational mathematics (Bayesian and Frequentist approaches, NumPy, PyMC3, etc.)

  • Modern Big-Data storage technologies (Iceberg, Hudi, Delta)

At Shopify, we are committed to building and fostering an environment where our employees feel included, valued, and heard. Our belief is that a strong commitment to diversity and inclusion enables us to truly make commerce better for everyone. We strongly encourage applications from Indigenous people, racialized people, people with disabilities, people from gender and sexually diverse communities and/or people with intersectional identities.

Shopify is now permanently remote and working towards a future that is digital by design. Learn more

#LI-REMOTE

How we hire

At Shopify, we put a lot of care and time into who we hire. We believe that in order to build the best products, we need to build high impact teams. Our recruitment process centres around what we call the Life Story interview, a conversational-style interview where we get to learn more about you.

Learn more about our hiring process 

This job is no longer open
Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.