Cloud Infrastructure Team Lead in Islington
As the DevOps Engineer, you will be responsible for designing, implementing, and maintaining the infrastructure and processes that enable continuous integration, delivery, and deployment of software applications. You will collaborate with development and operations teams to streamline workflows, automate repetitive tasks, and ensure the reliability, scalability, and security of the production environment. Availability for on‑call duties may be required as part of a scheduled roster.
What you'll do
* Lead and manage a team of DevOps engineers responsible for the development, deployment, and maintenance of infrastructure and automation systems
* Oversee the design, implementation, and optimization of DevOps products and workflows
* Collaborate with cross‑functional teams, including product management, operations, video engineering, and monitoring, to ensure seamless delivery of infrastructure services
* Monitor and troubleshoot infrastructure delivery issues, ensuring high availability and quality of service
* Implement and manage cost optimisation strategies to ensure efficient use of resources and budget control
* Stay current with industry trends and emerging technologies, incorporating them into the team's projects as appropriate
* Provide mentorship and professional development opportunities for team members, fostering a culture of continuous learning and improvement
* Manage team WIP limits, roadmaps, timelines, and resources to ensure successful delivery of multiple projects
What you'll bring
* Experience working on multiple projects as part of a cross‑functional team
* Working with architecture teams to design scalable, fault‑tolerant, and cost‑efficient solutions
* Passion for researching and implementing new technologies
* Experience with mentoring/knowledge‑sharing
* Proven experience in IaC frameworks (e.g. Terraform, Ansible, Pulumi)
* Proven experience with the GitOps approach and related tools (e.g. ArgoCD, FluxCD)
* Proven experience working with databases (RDBMS or NoSQL)
* Proven experience in a containerised environment (K8s, Docker) and related tools (kubectl, Helm, kustomize, docker‑compose)
* Proven experience in networking and security standards, protocols and best practices
* Proven experience in tracing systems (e.g. OpenTelemetry, Jaeger)
* Experience in performance optimisation and resource management
* Understanding of Agile methodologies
* Ability to diagnose and resolve service‑affecting issues in a broadcast or livestream environment
#J-18808-Ljbffr