The Team and Role
Upstart’s Data Engineering team builds the data infrastructure and platform for our AI lending products. Data engineering is part of our engineering organization, as we believe great data engineering relies on solid software engineering fundamentals. However, we also recognize that each Data Engineer has a unique blend of skills. Whether your strength is in data modeling or in stream processing, we want to talk to you.
As a Senior Data Engineer, you may work cross functionally with product, software engineering, machine learning, finance, growth, and other stakeholders, driving projects from conception to launch, in order to help teams leverage data in their decision making. Thus, helping accomplish our mission of enabling effortless credit based on true risk.
Position Location - This role is available in the following locations: Remote, Columbus.
Time Zone Requirements - This team operates on the East/West Coast time zones.
Travel Requirements - This team has regular on-site collaboration sessions. These occur quarterly at your home office. If you need to travel to make these meetups, Upstart will cover all travel related expenses.
How you’ll make an impact:
Develop and automate large scale, high-performance data processing systems
Design data models for optimal storage and retrieval to meet critical business requirements
Collaborate with data scientists to build data pipelines for training Upstart’s Machine Learning models
Ensure delivery of critical data with expected data quality reliably, following Minimum Viable Product (MVP) principles
Contribute to shared Data Engineering tooling & standards to improve the productivity of Data Engineers and data consumers
Influence cross-functional teams of business stakeholders, data scientists, and team members to derive maximum value from data
Maintain high standards of engineering excellence through code reviews, unit tests, and robust alerting
What we’re looking for:
3+ year(s) of experience as a Data Engineer or Software Engineer with a focus in Data Engineering
Proficiency in at least one high-level programming language (Java, Scala, Python, or equivalent)
Expertise building real-time or batch data pipelines using Spark, Flink or other open source frameworks
Experience with relational databases (such as MySQL, PostgreSQL) and strong knowledge of data structures and relational data modeling
Excellent communication skills, particularly with explaining technical or complex matters to non-technical audiences
MS or BS in Computer Science, Engineering, Mathematics, Statistics, Bioinformatics or a related field OR equivalent practical experience in Data Engineering
Experience with cloud computing platforms like AWS, Azure, or Google Cloud
Experience with workflow management and data pipeline orchestration tools such as Airflow or Oozie
Experience with data lake/warehouse architectures (BigQuery, Snowflake, Redshift)
Experience building reliable, distributed, high-volume data services
Our tech stack:
Python, SQL, AWS, Airflow, Spark, Kafka, PostgreSQL, Redshift, Looker, Kubernetes, and Docker