Leading the Way in Generative AI Production
We are seeking a seasoned Senior Machine Learning Engineer to join our team and help drive innovation in generative AI.
The Role:
* Deployment at Scale: Design and implement robust production pipelines that deploy large-scale models, including diffusion-based systems, across multiple services, each with hundreds of GPUs.
* Powering a Global Platform: Contribute to one of the world's highest-throughput GenAI systems, generating millions of images and videos daily.
* Optimising for Speed and Efficiency: Utilise advanced techniques such as quantisation, compilation, caching, distillation, and multi-GPU parallelism to enhance throughput, latency, and stability.
* Pushing Model Innovation into Production: Collaborate closely with researchers to productionise new capabilities, such as LoRAs, ControlNets, and custom architectures.
Key Skills Include:
* Generative AI Production Expertise: Proven experience deploying diffusion-based models (e.g. latent diffusion, LoRA, ControlNet) into production environments, ideally across dozens or hundreds of GPUs.
* High-Performance ML Engineering: Proficiency in Python and PyTorch, with a focus on optimised inference, model tuning, and memory-efficient execution.
* MLOps Fluency: Familiarity with model deployment tools and practices (e.g. model registries, workflow orchestration, CI/CD for ML).
* Real-World Systems Thinking: Comfort with performance trade-offs, debugging large-scale systems, and delivering improvements fast.
* Team Collaboration: Experience working in fast-moving, cross-functional teams shipping real-world AI products.
* Adaptability: Ability to pivot quickly between deep technical work, product needs, and cross-functional alignment.