Data Engineer


Data Engineer

605 is an independent TV measurement and analytics firm that offers advertising and content measurement, full-funnel attribution, media planning, optimization and analytical solutions. Comprised of engineers, analysts, data scientists, media experts and marketing strategists, 605 forges new paths using groundbreaking innovations that set industry standards for audience targeting and measurement.

Our engineers are expected to wear a number of hats and have the opportunity to touch all parts of the stack. As an engineer at 605, you might work to improve our optimization algorithms, write code using Apache Spark, participate in architectural decisions for new components, or improve performance in collaboration with DevOps. In the same week, you could work on user-facing interfaces and reports with frontend developers, write code to import, process and QC terabytes of new data, and work with analysts and statisticians to ensure the validity of our processes. Responsibilities include:

  • Interpreting data sets and defining processes and procedures to ensure data is received to spec and in a timely and accurate manner
  • Identifying and addressing issues with data sets
  • Establishing clear communications both across the data provider relationships as well as within 605
  • Ensuring appropriate procedures are in place to meet SOC 3 compliance procedures
  • Ensuring excellent communication procedures are in place via daily/monthly/weekly reporting to advise status of data sets and that end users are informed of data status
  • Working with business and product owners to develop procedures, methods, code to provide efficient tools and products to meet the needs of the end users
  • Supporting all groups across 605 as needed to address questions regarding data quality, status, ETL and other platform related questions and concerns that may arise
  • Assisting junior analysts with the facilitation of reporting and analytics
  • Bachelor's degree in Computer Science or a related field (or 4 additional years of relevant work experience)
  • A strong understanding of data structures, algorithms, and effective software design
  • Significant development experience with a major modern language (e.g. Java, Scala, Python, Ruby, C/C++, etc.)
  • Significant experience working with structured and unstructured data at scale and comfort with a variety of different stores (key-value, document, columnar, etc.) as well as traditional RDBMSes and data warehouses
  • Experience writing unit and functional tests
  • Comfort with version control systems (e.g. Git, SVN)
  • Comfortable working in a Cloud environment specifically AWS
  • Excellent verbal and written communication skills; must work well in an agile, collaborative team environment

Preferred Qualifications

  • Master's Degree in Computer Science or a related field
  • Basic understanding of statistics and experience with statistical packages such as R, Matlab, SPSS, etc.
  • Practical experience with supervised machine learning techniques
  • Practical experience with Apache Spark and Apache Airflow
  • Experience with AWS products (Redshift, EMR, S3, IAM, RDS, Managed Airflow, etc)
  • Experience wrangling terabytes of big, complicated, imperfect data
  • Experience with front-end languages, especially Javascript, HTML5, and CSS3
  • Strong background with test-driven development
  • Comprehensive medical, dental and vision insurance for employees and their families
  • Company-paid life insurance, short and long term disability
  • 401k with match after six months of employment
  • Pre-tax flexible compensation plan for medical, transit, parking, or dependent care expenses
  • Generous vacation, personal and sick days
  • Remote work model with on-demand office space in Pasadena, CA and New York, NY
  • 100% paid parental and family caregiving leave
  • Educational reimbursement program
  • A kitchen stocked with sodas, snacks, yogurt and other goodies
  • A tight-knit startup community who likes to eat! We celebrate everyone’s birthdays, have frequent team lunches, and do events in and out of the office
  • 605 is an active participant in conferences
Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.