Job Description
We are seeking a highly skilled and experienced System Reliability Engineer to join our team. As a key member of our organization, you will be responsible for designing and implementing large-scale solutions that ensure seamless execution of award-winning digital experiences.
Responsibilities
* Design and implement large-scale solutions, collaborating with senior stakeholders on best practices for improving reliability across the software development lifecycle.
* Partner with senior stakeholders, leading a data-driven reliability culture, monitoring and automation aligned to SRE principles.
* Scale distributed systems in public, private or hybrid cloud environments, creating tools for operational management and security of software applications and systems.
Qualifications and Experience
* Experts in partnering with senior stakeholders, leading a data-driven reliability culture.
* Deep experience designing, developing, testing and supporting applications and systems.
* Passionate about managing and scaling distributed systems in cloud environments.
* Experience creating tools for operational management, including security, of software applications and systems.
* Able to identify technology limitations and deficiencies, using software engineering to develop scalable and sustainable improvements.
Skills
* Software engineering expertise in at least one programming language.
* Modern software development practices and CI/CD tools, strong public cloud experience.
* Observability tools and extensive knowledge of Linux internals, networking, containers and troubleshooting.
* Applying SRE practices in large organisations, with strong communication and problem-solving skills.
Working with Us
We accelerate our digital strategy, aiming to deliver exceptional customer experiences. We offer flexible working arrangements, a respectful, inclusive and flexible workplace empowering ideas and energy.