Senior Manager, Machine Learning Engineering

Senior Manager, Machine Learning Engineering

This job is no longer open
System1 is looking for an experienced engineer to join our team as a Senior Manager of Machine Learning Engineering.  In this role, you will be responsible for leading a talented team of ML engineers as well as contributing significant individual contributor work towards architecting and building MLOps capabilities and infrastructure abstractions that will accelerate development speed of new cutting-edge machine learning models that power our large scale marketing platform and support them throughout the entire machine learning model production lifecycle.  

The ideal candidate will not only have experience leading a team, either as a people manager or technical lead, but also be an expert software engineer with previous experience developing tooling to support the building, training, testing and deployment of machine learning models.

System1’s products touch 100s of millions of users per month and our platform processes 5B+ data points every day.  If working in a dynamic environment to build cutting-edge MLOps capabilities that empower our advanced machine learning models to scale to billions of predictions per day driving millions in revenue is something you find exciting, then you will love System1!

The Role You Will Have

    • Lead and work alongside a small team leveraging state-of-the-art tools and frameworks to build MLOps capabilities that enable/support the full ML model production lifecycle (data transformation/preparation, model training & development, model validation, model serving, model monitoring) 
    • Build scalable and efficient distributed ML pipelines that enable training and serving of large-scale cutting-edge machine learning models
    • Develop systems, tools, & processes to govern ML models for compliance, bias, versioning, traceability, and audit-ability
    • Work collaboratively with data scientists to accelerate ML model development and deployment 
    • Evaluate, recommend, and perform proof-of-concepts for new ML model lifecycle management services
    • Take projects through the full engineering lifecycle: designing, ticketing, building, testing, deploying, and debugging tools and products

What You Will Bring

    • Demonstrated experience in supporting ML model lifecycle management and/or building distributed ML pipelines
    • Previous experience leading highly technical engineering teams (either as people manager or technical lead) 
    • Well versed with Python and Python libraries/frameworks​​
    • Strong software design and implementation skills with a general-purpose programming language (e.g. Rust, Scala, C++/C#, Java, Haskell, etc) 
    • Experience in building distributed microservices in a cloud environment, working with large scale databases such as Cassandra, Redis or other caching systems, and ensuring that the system is able to serve data at very high levels of throughput with millisecond level latencies
    • Previous experience with deploying and maintaining advanced real-time scoring ML models in production (e.g. PyTorch, Tensorflow, Triton Inference Server)
    • Strong functional programming development skills
    • Passion for leveraging technology to solve problems and maximize productivity
    • Ability to execute ideas and influence decision making in a clear and effective manner
    • Ability to thrive in a collaborative environment

What We Have to Offer

    • Competitive salary + bonus + equity
    • Generous PTO + 11 Company Holidays
    • Untracked sick time
    • 100% covered Medical, Dental, Vision for employees
    • 401k w/match
    • Paid professional development
    • Leadership & growth opportunities
    • Virtual company and team building events
    • #BI-Remote
    • #LI-Remote
This job is no longer open
Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.