Data Engineer

Data Engineer

This job is no longer open

Locations - Austin or Remote US


About the team  

The BI team at Cloudflare is responsible for building and managing the cloud data analytics platform. Our focus is on developing a centralized cloud data analytics platform using open source technologies for internal Business Partners and Machine Learning teams. The team aims to make data more accessible, meet Cloudflare's critical business needs, and provide self-service reporting and analytics tools to support current and new business initiatives.

About the role

In the role of Data Engineer, the primary responsibility is to contribute to building and improving a scalable petabyte-scale data lake and Cloud Enterprise Data Warehouse. This involves utilizing a modern tech stack from scratch with open source technologies. Success in this position requires a strong background in data engineering, combined with product and business acumen, to deliver scalable data pipelines and analytics solutions that empower advanced analytics through a user-friendly self-service interface.

What you will do

  • Partner closely with internal stakeholders to gain a strong understanding of business and product data needs
  • Design, build and support scalable and reliable data solutions that can enable self-service reporting and advanced analytics using open source technologies
  • Develop technical tools and programming that leverage machine learning and big-data techniques to cleanse, organize and transform data and to maintain and update data structures and integrity on an automated basis
  • Design application components and evolve architecture: API/Services, data access, integration, application components, etc.
  • Analyze and support platform requirements for Data Science team
  • Implement automation tools and frameworks (CI/CD pipelines)
  • Build tools to automate the monitoring or workload and take proactive measure to scale the platform or to fix the problem 
  • Mentor junior data engineers

Examples of desired skills, knowledge, and experience 

  • Proven ability to work closely with business and product teams to ensure data solutions are aligned with business initiatives and are of high quality
  • 2-5 years of development experience in Big Data space working with Petabytes of data and building large scale data solutions using any Cloud Platform, Apache Spark, Airflow, Scala, Golang, Python, etc
  • Experience with API design and development of RESTful web services or GraphQL is a plus
  • Working experience in Kubernetes, Docker etc. is a plus
  • Bachelor’s or Master’s Degree in Computer Science or Engineering or related experience required

This job is no longer open
Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.