Data Engineer, Product

Data Engineer, Product

This job is no longer open

The Team and Role

Upstart’s Data Engineering team builds the data infrastructure and platform for our AI lending products. Data engineering is part of our engineering organization, as we believe great data engineering relies on solid software engineering fundamentals. However, we also recognize that each Data Engineer has a unique blend of skills. Whether your strength is in data modeling or in stream processing, we want to talk to you.

As a Senior Data Engineer, you may work cross functionally with product, software engineering, machine learning, finance, growth, and other stakeholders, driving projects from conception to launch, in order to help teams leverage data in their decision making. Thus, helping accomplish our mission of enabling effortless credit based on true risk.

Position Location - This role is available in the following locations: Remote, Columbus. 

Time Zone Requirements - This team operates on the East/West Coast time zones.

Travel Requirements - This team has regular on-site collaboration sessions. These occur quarterly at your home office. If you need to travel to make these meetups, Upstart will cover all travel related expenses.

How you’ll make an impact:

  • Develop and automate large scale, high-performance data processing systems

  • Design data models for optimal storage and retrieval to meet critical business requirements

  • Collaborate with data scientists to build data pipelines for training Upstart’s Machine Learning models

  • Ensure delivery of critical data with expected data quality reliably, following Minimum Viable Product (MVP) principles

  • Contribute to shared Data Engineering tooling & standards to improve the productivity of Data Engineers and data consumers

  • Influence cross-functional teams of business stakeholders, data scientists, and team members to derive maximum value from data

  • Maintain high standards of engineering excellence through code reviews, unit tests, and robust alerting

 

What we’re looking for: 

  • Minimum requirements:
    • 3+ year(s) of experience as a Data Engineer or Software Engineer with a focus in Data Engineering

    • Proficiency in at least one high-level programming language (Java, Scala, Python, or equivalent)

    • Expertise building real-time or batch data pipelines using Spark, Flink or other open source frameworks

    • Experience with relational databases (such as MySQL, PostgreSQL) and strong knowledge of data structures and relational data modeling

    • Excellent communication skills, particularly with explaining technical or complex matters to non-technical audiences

    • MS or BS in Computer Science, Engineering, Mathematics, Statistics, Bioinformatics or a related field OR equivalent practical experience in Data Engineering

  • Preferred qualifications:
    • Experience with cloud computing platforms like AWS, Azure, or Google Cloud

    • Experience with workflow management and data pipeline orchestration tools such as Airflow or Oozie

    • Experience with data lake/warehouse architectures (BigQuery, Snowflake, Redshift)

    • Experience building reliable, distributed, high-volume data services

  • Our tech stack:

    Python, SQL, AWS, Airflow, Spark, Kafka, PostgreSQL, Redshift, Looker, Kubernetes, and Docker

This job is no longer open
Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.