Senior Data Engineer

Senior Data Engineer

This job is no longer open

Paper is looking for a Senior Data Engineer to join the growing R&D and analytics team. The Senior Data Engineer will be responsible for expanding and optimizing our data and data pipeline architecture, as well as optimizing data flow and collection for cross functional teams. The ideal candidate is an experienced data pipeline builder and data wrangler who enjoys optimizing data systems and building them from the ground up. The Data Engineer will support our data analysts and data scientists on data initiatives and will ensure that our data delivery architecture is optimized and consistent throughout ongoing projects. They must be self-directed and comfortable supporting the data needs of multiple teams, systems and products. The right candidate will be excited by the prospect of designing and optimizing our data architecture to support the company's growth.

Responsibilities

  • Create and maintain an optimal data pipeline architecture.
  • Extend the current ELT pipeline to help assemble complex data sets to address a diverse set of business and data analytics requests.
  • Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.
  • Develop real-time data pipelines.
  • Maintain a product API for data requests.
  • Assist in developing a privacy-aware data catalogue.
  • Develop secure and anonymized solutions for sharing data with external partners.
  • Participate in ensuring code and data quality within the department.
  • Create data tools, such as an experimentation platform, for analytics and data scientist team members.

Qualifications

  • Advanced working SQL knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases.
  • Experience building and optimizing cloud data pipelines, architectures and data sets.
  • Experience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement.
  • Strong analytic skills related to working with unstructured datasets.
  • Build processes supporting data transformation, data structures, metadata, dependency and workload management.
  • Strong project management and organizational skills.
  • Experience supporting and working with cross-functional teams in a dynamic environment.
  • We are looking for a candidate with 5+ years of experience in a Data Engineering role, who has attained a degree in Computer Science, Statistics, Informatics, Information Systems or another quantitative field. They should also have experience using the following software/tools:
    • Strong background in object-oriented/object function scripting languages: Python, Java, C++, Scala, etc.
    • Extensive experience with relational SQL and NoSQL databases
    • Experience with data pipeline and workflow management tools: Azkaban, Luigi, Airflow, Prefect, etc.
    • Experience with Google cloud services is a plus
    • Experience with big data tools: Hadoop, Spark, Kafka, etc. is a plus
    • Experience with stream-processing systems: Storm, Spark-Streaming, etc. is a plus

Position can be located in any geography in the US or Canada.

This job is no longer open
Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.