Principal Engineer, Machine Learning Acceleration

Principal Engineer, Machine Learning Acceleration

This job is no longer open

The Machine Learning Infrastructure team builds the industrial scale machine learning platform used by our Machine Learning engineers developing software for our self-driving car. We provide essential tools and frameworks to support the entire lifecycle of machine learning, from data processing, large scale training and evaluation frameworks to efficient neural net inference runtimes for onboard execution and simulation. 

What You'll Be Doing

  • Manage and monitor ML training, visualize and compare ML training, evaluate models on different hardware accelerators and track cost, etc.
  • Own training cluster and provide technical support to internal customers.
  • Work closely with machine learning researchers to identify the bottlenecks and optimize the codebase by: better parallelism and data caching, fully utilizing the hardware: CPU, GPU, memory and disk/network IO, distributed/multi-machine training.
  • Act as a sounding board and technical architect for your teams, help define technical vision and strategy
  • Provide coding and performance guidelines and best practices to machine learning researchers/engineers.

What We're Looking For

  • Degree in Computer Engineering, Computer Science, Applied Mathematics, or a related field
  • 5+ years of industry experience
  • Strong programming skills in Python.
  • Familiar with DL frameworks like PyTorch, Tensorflow, Keras, etc.
  • Experience working with cloud infrastructure like AWS, GCP or Azure.
  • Experience working with large scale dataset.

Bonus Points

  • Strong skills in C++.
  • Experience in numba, cython, CUDA, etc.
  • Experience in distributed computing.
  • Strong machine learning/deep learning background.

 Why you should join us:

  • You’ll have the opportunity to work on cutting-edge technology and some of the most exciting problems within engineering including: robotics, infrastructure, visualization, etc.
  • As part of the Machine Learning Infrastructure team, you’ll be collaborating and working closely with a tight knit team and some of the industry’s top researchers.

Why you should join us:

Led by Oscar Beijbom. Our Machine Learning team, authors of nuScenes (https://www.nuscenes.org), PointPillars (https://arxiv.org/abs/1812.05784), and PointPainting (http://arxiv.org/abs/1911.10150). The ML Team is doubling in size in 2021, and we are adding engineers and researchers to join us in our mission to launch a level 5 autonomous taxi system.

(Colorado only*) Minimum salary of $208,620/year + bonus + equity + benefits

*Note: Disclosure as required by sb19-085(8-5-20)

This job is no longer open
Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.