Data Engineering Intern - Graduate

Data Engineering Intern - Graduate

This job is no longer open

About the Role:

At Tubi, data plays a vital role in keeping viewers engaged and the business thriving. Every day, data engineering pipelines analyze the massive amount of data generated by millions of viewers, turning it into actionable insights. In addition to processing TBs a day of 1st party user activity data, we manage a petabyte scale data lake and data warehouses that several hundred consumers use daily. We have two openings on two different teams.

Core Data Engineering (1): In this role, you will join a team focused on Core Data Engineering, helping build and analyze business-critical datasets that fuel Tubi's success as a leading streaming platform.

  • Use SQL and SQL modeling to interact with and create massive sets of data
  • Use DBT and its semantic modeling concept to build production data models
  • Use Databricks as a data warehouse and computing platform
  • Use Python/Scala in notebooks to interact with and create large datasets

Streaming Analytics (1): In this role you will join a small and nimble team focused on Streaming Analytics that power our core and critical datasets for machine learning, helping improve the data quality that fuels Tubi's success as a leading streaming platform.

  • Use SQL to explore and analyze the data quality of our most critical datasets, working with different technical stakeholders across ML & data science 
  • Work with engineers to implement a near-time data quality dashboard
  • Use Python/Scala in notebooks to transform and explore large datasets
  • Use tools like Airflow for workflow management and Terraform for cloud infrastructure automation

Qualifications: 

  • Fluency (intermediate) in one major programming language (preferably Python, Scala, or Java) and SQL (any variant)
  • Familiar with big data technologies (e.g., Apache Spark, Kafka) is a plus
  • Strong communication skills and a desire to learn!

Program Eligibility Requirements:

  • Must be actively enrolled in an accredited college or university and pursuing an undergraduate or graduate degree during the length of the program
  • Current class standing of sophomore (second-year college student) or above
  • Strong academic record (minimum cumulative 3.0 GPA)
  • Committed and available to work for the entire length of the program

About the Program:

  • Application Deadline: April 19, 2024 
  • Program Timeline: 10-week placement beginning on 6/17
  • Weekly Hours: Up to 40 hours per week (5 days)
  • Worksite:  Remote or Hybrid (SF or LA)
This job is no longer open
Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.