Neural Magic

Somerville, MA
51-200 employees
Neural Magic offers high-performance inference serving solutions for you to deploy leading open-source LLMs on your private CPU and GPU infrastructure.

Machine Learning Engineer

Machine Learning Engineer

Share this job

Description

Neural Magic is an early-stage AI software company democratizing high performance for deep learning models. Our goal is to reduce the cost and increase the performance of end-users deploying deep learning applications. Based on decades of research at MIT, Neural Magic has developed a software platform that allows developers to sparsify deep learning models to minimize footprint and run on CPUs at GPU speeds. Please look through our website and GitHub repos to get a feel of what we are about.

Founded by an award-winning team of computer scientists and researchers out of MIT, we are a venture-backed company headquartered in Davis Square, Somerville, MA. Our investors include Amdocs, Andreessen Horowitz, Comcast Ventures, NEA, and Pillar VC.

We are seeking a machine learning engineer to work closely with our product and research teams to develop SOTA deep learning software. This person will work closely with our technical and research teams to develop training and deployment pipelines, implement model compression algorithms, and productize deep learning research. If you are someone who wants to contribute to solving challenging technical problems at the forefront of deep learning, this is the role for you!


Responsibilities

  • Use your understanding of machine learning to tackle meaningful technical problems
  • Collaborate with research and product development teams to build machine learning products
  • Prototype and implement appropriate ML algorithms, tools, and pipelines
  • Create and manage training and deployment pipelines
  • Collaborate with a cross-functional team about market requirements and best practices
  • Keep abreast of developments in the field

Requirements

  • Proven experience as a machine learning engineer or similar role
  • Solid knowledge of machine learning and deep learning fundamentals with experience in one or more of computer vision, NLP, speech, reinforcement learning, generative models, etc
  • Knowledge of common ML frameworks (like PyTorch or Keras) and libraries (like NumPy and scikit-learn)
  • Strong programming skills with proven experience implementing Python-based machine learning solutions
  • Experience with engineering and supporting ML pipelines in a popular ML framework such as PyTorch, TensorFlow, jax, etc.
  • Experience with engineering and maintaining training and/or deployment pipelines for Generative models / NLG / LLMs
  • Ability to interpret and implement research ideas and algorithms
  • Creative, collaborative, and innovation-focused
  • Strong sense of project ownership and personal responsibility
  • Bachelor's in Computer Science, Mathematics or similar field

Benefits

  • Health Care Plan (Medical, Dental & Vision)
  • Retirement Plan (401k, IRA)
  • Paid Time Off (Vacation, Sick & Public Holidays)
  • Family Leave (Maternity, Paternity)
  • Short Term & Long Term Disability
  • Training & Development
  • Work From Home
  • Free Food & Snacks
  • Wellness Resources
  • Stock Option Plan

We are an equal opportunity employer. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status.

Apply for this job

Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.