Head of Data Engineering

Head of Data Engineering

This job is no longer open

Golden is looking for a Head of Data Engineering to architect, recruit and build our data team in order to grow entity count, semantic triples and prose in the Golden Knowledge Base as part of our mission to organize and map the world’s knowledge. The company is venture-backed by a16z, Founders Fund, Giga fund and other top tier investors, and is led by Jude Gomila, a founder of Heyzap (YC ‘09, acquired for $45million in 2016), and investor in 200+ startups.

We are looking for a data engineering lead to both manage data ingestion projects and develop tooling for increased scale, accuracy, and automation in our data pipeline. This person will lead the building of a team of data engineers, NLP experts, ML experts and ML related devops engineers. You will work closely with and build the Golden’s AI/NLP/data team. Successful candidates will be able to demonstrate an ability and history of thoughtfulness and curiosity in data ingestion, generation, and pipelining. We are open to anything from a full-time manager to a hybrid manager / individual contributor.

As an early team member of Golden, you would be working in a startup environment with high autonomy and responsibility. This role - engineering the ingestion and generation of data at scale - is a crucial building block of Golden’s success. Apply below.

The person in this role will need to:

  • Build and scale our data engineering team
  • Have strong architectural knowledge of how to go about solving such a problem set
  • Have strong experience at scale with data-oriented products
  • Have experience in data ingestion and creation
  • Have experience with data at web scale
  • Have experience with NLP and management of an NLP team
  • Help architect our approach, infrastructure and team to attack the overall of building a knowledge graph from public data.
  • Deliver our objectives of semantic triple ingestion speed and entity creation

You will need to build a team that can:

  • Make thoughtful judgements on data quality to clean data sources for import
  • Identify and proactively create new data ingestion and processing tooling to eliminate manual processes, inefficient or repetitive work, or address quality issues
  • Build and work with the AI/NLP team to scale and embed techniques they’ve developed or prototyped
  • Have strong team leadership/management experience
  • Use Python, Jupyter notebooks, and Pandas to inspect and analyze data sources

Bonus points:

  • Experience with ingesting public data sources
  • Specific experience with any of the following: extraction of triples, topic prediction, taxonomic detection, event detection, clustering, relevancy, deduction and inference of data, generation of text with NLP
  • Strong experience with probability and statistics

Advantages of working for us:

  • A meaningful and important mission
  • High autonomy and ownership
  • Work with a hard working, creative team
  • Sunny office with natural lighting in the heart of SF
  • Work closely with an experienced founder and founding team
  • Lunch provided daily
  • Medical, vision, dental, 401k, commuter benefits and more
  • Equity in fast growing startup
This job is no longer open
Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.