LOOKING FOR A NEW ADVENTURE?

Speedinvest and our portfolio of startups are always hiring exceptional talent!
Browse open jobs below to find your next career move.
243
companies
1,112
Jobs

MLOps Engineer (PyTorch/DevOps)

Deep Render

Deep Render

Software Engineering
London, UK
Posted on Wednesday, January 3, 2024
Deep Render is a Deep Tech startup founded to liberate the world of all bandwidth constraints by pioneering AI-Compression technology. Our compression codecs are based on a fundamental technology shift, representing over 100 years of progress in the industry.

We're leading the way in AI-Compression, having the world's first AI-Codec running in real-time on mobile devices. Our team comes with 50+ years of combined research experience with over 80+ filed patents.

Last year, Deep Render raised its Series A funding round from top-tier investors and is looking to grow its current 35-person team in 2024. We're in commercial engagements with some of the largest Big Tech companies in the world and expect hundreds of millions of people to be using the Deep Render AI Codec very soon.

Role:

The MLOps team at Deep Render builds novel and efficient pipelines, platforms and tools to accelerate and simplify the complex machine-learning workflow of our researchers. As an MLOps Engineer, you will play a pivotal role in the seamless operation of machine learning infrastructure and pipelines. Your responsibilities will involve maintaining and optimising critical infrastructure components. Your expertise in PyTorch will be crucial, as you troubleshoot and resolve PyTorch-related issues and bottlenecks. Implementing DevOps practices, including Kubernetes and Docker, will be second nature to you, enhancing quality control and efficient model distribution. Supporting the existing ML stack, you'll collaborate with researchers and engineers to enhance operational efficiency and create custom tooling to streamline the model development process. Your role is at the intersection of infrastructure, PyTorch, DevOps, automation, and custom tooling, making you the linchpin for a robust and efficient ML ecosystem.

Responsibilities:

  • Infrastructure code: Maintain and optimise infrastructure components critical to research and production pipelines. Collaborate with cross-functional teams to ensure the scalability and reliability of the infrastructure.
  • PyTorch maintenance: Be the subject matter expert in PyTorch, ensuring the smooth operation of PyTorch-based models. Troubleshoot and debug PyTorch-related issues and bottlenecks.
  • DevOps: Implement DevOps practices to ensure quality control, testing, and the efficient distribution of machine learning models. Utilise DevOps tools and frameworks, including Kubernetes and Docker, to streamline deployment processes.
  • Automation: Build automations to enhance efficiency in the development and deployment of machine learning models.
  • ML Stack Support: Support the existing ML stack by providing technical expertise and ensuring its operational efficiency. Collaborate with research scientists and engineers to enhance the ML infrastructure and stack.
  • Custom Tooling: Develop custom tools and utilities to streamline the model development and deployment process.

Must have:

  • MSc in Computer Science or a related field (Mathematics, Physics, Engineering)
  • Experience writing production code in Python/PyTorch
  • Knowledge of DevOps technologies and stacks, including Kubernetes and Docker.
  • Experience in building and maintaining data pipelines for production-ready systems.
  • Familiarity working with machine learning models.

Preferred skills:

  • A minimum of 3 years of experience, ideally in a machine learning start-up environment