Reliability Engineer Role
About the Position
As a Technical Duty Officer, you will be responsible for ensuring the reliability of all products and services within Xero.
Key Responsibilities:
* Develop and implement process frameworks and observability strategies to ensure rapid problem diagnosis, response, and service reliability.
* Collaborate with product teams to thoroughly analyze failures and integrate insights to improve service reliability, scalability, and operational efficiency.
* Provide expert leadership during critical outages, coordinating multiple teams to ensure streamlined decision-making and quick resolution.
* Promote a customer-focused approach by addressing and mitigating global customer environment issues, and fostering a culture of continuous learning and technical excellence within the team.
* Own the incident management process, driving enduring reliability across all products and services within Xero.
Requirements:
* Previous experience as a Site Reliability Engineer in an Operations or Engineering environment.
* Networking knowledge and ability to troubleshoot TCP/IP, SSL/TLS, DNSSEC, IPsec, and BGP issues.
* Coding experience preferably in Python building tools, scripting, or automation.
* Strong communication skills including the ability to translate technical issues/concepts into agreed actions.
What We Offer:
Xero offers generous paid leave, dedicated paid leave to care for your physical and mental wellbeing, health insurance, life insurance, and income protection.
Why Choose Us:
You'll do fulfilling work at Xero.