Senior Data Platform Engineer

Senior Data Platform Engineer

This job is no longer open

The Team and Role

Upstart’s Data Engineering team builds the data infrastructure and platform for our AI lending products. Upstart’s data platform team (Sub-squad within the broad data engineering squad) creates developer-friendly products that enable data discovery, governance, analytics, and reporting. The team’s mission is to promote data-driven decisions by creating the necessary tooling and frameworks in service to data engineering. You will be instrumental in defining and implementing our core infrastructure and help us realize our data-driven mission of affordable credit.

You’ll work closely with Product Managers, Engineers, Designers, and others in a cross-functional environment to ship developer-friendly products. You will also have the opportunity to lead various projects involving data pipelines, data governance, machine learning infrastructure, and API design and development, to name just a few, all working within our native cloud-based application stack.

As a Senior Data Platform Engineer, you’ll organize development efforts, mentor junior engineers, and set and maintain high standards of engineering excellence. You’ll also work with exceptionally talented peers and architects, and will never run out of opportunities to continue learning, honing your skills and growing your career.

Position Location - This role is available in the following locations: Remote (#LI-REMOTE) 

Time Zone Requirements - This team operates on the West Coast time zone.

Travel Requirements - This team has regular on-site collaboration sessions. These occur a few days per quarter at the San Mateo office. If you need to travel to make these meetups, Upstart will cover all travel related expenses.

How you’ll make an impact: 

  • Design the data infrastructure architecture for driving large-scale data science projects
  • Create the software that enables our industry-leading machine learning insights to be delivered at enterprise scale
  • Build APIs and interfaces to deliver key analytic and data science insights
  • Write scalable, robust, and fully-tested software for deployment in mission-critical production environments
  • Work with cross-functional stakeholders to identify and prioritize use cases that support the company’s imperatives and objectives and improve upstarts engineers experiences
  • Identify opportunities for data-driven applications that could measurably improve the Upstart engineer’s experience and the business’s efficiency through automation and machine learning
  • Participate in defining cross-functional data governance
  • Identify and provide tools and training to help others leverage data

What we're looking for:

  • 8+ years of industry experience working with distributed data technologies (e.g. Hadoop, MapReduce, Spark, etc) for building efficient, large-scale data pipelines.
  • A strong background in distributed data processing & software engineering and with the ability to build high-quality, scalable data products.
  • Strong knowledge of data architectures, databases, data modeling and data infrastructure ecosystem.
  • Experience with relational databases (such as MySQL, Postgres) and analytics databases (such as Druid, Apache Pinot).
  • Strong Software Engineering experience and proficiency in at least one high-level programming language (Java, Scala, Python or equivalent).
  • BS in computer science, engineering, mathematics, statistics or a related field OR equivalent practical experience in data engineering.

Preferred Skills: 

  • Experience with cloud computing platforms such as Amazon AWS, Google Cloud
  • Experience building stream-processing applications using Apache Flink, Spark-Streaming, Apache Storm, Kafka Streams or others
  • Experience with Search systems (such as ElasticSearch, Solr), NoSQL datastores (such as HBase, Cassandra, MongoDB).
  • Thorough understanding of data lake/warehouse architectures (BigQuery, Snowflake, RedShift)
  • Experience building distributed, high-volume data service

 

This job is no longer open
Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.