Senior Data Engineer

Senior Data Engineer

This job is no longer open
Our mission is to make biology easier to engineer. Ginkgo is constructing, editing, and redesigning the living world in order to answer the globe’s growing challenges in health, energy, food, materials, and more. Our bioengineers make use of an in-house automated foundry for designing and building new organisms. Today, our foundry is developing over 40 different organisms to make different products across multiple industries.
 
We're creating the codebase, compiler, and debugger for biology. We have built a strong set of internal software tools, automation, and processes that enable high-throughput genetic engineering across multiple species. We want to make them better, more powerful, more scalable, and more effective, while making them easier to use, manage, and deploy.
 
As a Senior Data Engineer, you’ll join in architecting our platform to support analytics and machine learning that will ultimately help to define how our bioengineering is performed at scale.  Ginkgo's programming languages of choice are Python and SQL, and DNA, but you are someone who loves writing elegant code in any language.  Plus, you're an experienced data wrangler who enjoys building systems from the ground up. Most importantly, you will be passionate about making biology the next engineering discipline.

Note: The current list of tools we utilize includes RDS Postgres, Snowflake, Airflow, AWS DMS, Spark on EMR, and Python. Extensive experience with the tools we use is not required, but rather a working understanding of the Desired Software and Tools listed below is preferred.

Desired Software and Tools Working Knowledge

    • Data pipeline and workflow management tools: Airflow, Luigi, etc.
    • Big Data tools: Snowflake, Hive, Spark.
    • AWS cloud services: EC2, EMR, RDS, Redshift, S3.
    • Languages: Python, Java, Scala, etc.
    • Linux

Responsibilities

    • Expanding and optimizing our data pipeline architecture, as well as flow and collection for cross functional teams. This includes: automating manual processes, ETL, re-designing infrastructure for greater scalability, and improving reliability and accuracy.
    • Supporting our software engineering initiatives to ensure optimal delivery architecture is consistent throughout on-going projects.
    • Using appropriate tools to analyze the data pipeline and provide actionable insights into operational efficiency, data accuracy, and other KPI’s.
    • Working with various stakeholders to assist with related technical issues and infrastructure needs.
    • Keeping our data secure.
    • If remote, must be able to start workday at 10am eastern standard time.

Desired Experience and Capabilities

    • BS, MS, or PhD in computer science or related quantitative field
    • 5+ years of data engineering experience, with advanced knowledge of database design best practices
    • Experience working with relational databases, data warehouses, and big data platforms.
    • Demonstrated ability performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement.
    • Strong analytical skills in relation to working with large datasets.
    • Experience building processes that support data transformation, data structures, metadata, dependency, and workload management.
    • Working knowledge of message queuing, stream processing, and highly scalable big data stores.
    • Analytical, highly motivated self-starter, with strong project management and organizational skills.


To learn more about Ginkgo, check out some recent press:
What is it really like to take your company public via a SPAC? One Boston biotech shares its journey (Fortune)
Ginkgo Bioworks resizes the definition of going big in biotech, raising $2.5B in a record SPAC deal that weighs in with a whopping $15B-plus valuation (Endpoints News)
Ginkgo Bioworks CEO on scaling up Covid-19 testing: ‘If we try, we can win’ (CNBC)
Ginkgo raises $70 million to ramp up COVID-19 testing for employers, universities (Boston Globe)
Ginkgo Bioworks Redirects Its Biotech Platform to Coronavirus (Wall Street Journal)
Ginkgo Bioworks Provides Support on Process Optimization to Moderna for COVID-19 Response (PRNewswire)
The Life Factory: Synthetic Organisms From This $1.4 Billion Startup Will Revolutionize Manufacturing (Forbes)
Synthetic Bio Pioneer Ginkgo Raises $290 Million in New Funding (Bloomberg)
Ginkgo Bioworks raises $350 million fund for biotech spinouts (Reuters)
Can This Company Convince You to Love GMOs? (The Atlantic)

We also feel that it’s important to point out the obvious here – there’s a serious lack of diversity in our industry, and that needs to change. Our goal is to help drive that change. Ginkgo is deeply committed to diversity, equity, and inclusion in all of its practices, especially when it comes to growing our team. Our culture promotes inclusion and embraces how rewarding it is to work with people from all walks of life.  

We’re developing a powerful biological engineering platform, so we must remain mindful of the many ways our technology can – and will – impact people around the world. We care about how our platform is used, and having a diverse team to build it gives us the best chance that it’s something we’ll be proud of as it continues to grow. Therefore, it’s critical that we incorporate the diverse voices and visions of all those who play a role in the future of biology.

It is the policy of Ginkgo Bioworks to provide equal employment opportunities to all employees and employment applicants.
This job is no longer open
Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.