About the role As a Site Reliability Engineer, your core focus is to ensure that our systems are available, scalable and secure. You will operate with a high degree of autonomy as a deeply skilled technical expert. Our priority is to provide an environment with high availability, integrity and security. Over the past year, we have achieved greater than 99.998% platform uptime, including for planned maintenance. However, our environment has also grown more complex and customers have more than doubled. To support this scale, we aim to have a reliable foundation with one‑step blue/green deployments, best‑in‑class tooling and a strong devops culture that encourages automation and continuous monitoring at every level.
Due to the nature of our work and regulatory requirements, applicants must hold Australian or New Zealand Citizenship.
What you’ll do
AWS Architecture:
Design and implement scalable, secure multi‑account AWS architectures tailored for highly sensitive data environments.
Cloud Networking:
Apply deep expertise in VPCs, DNS, PrivateLink, complex routing, and load balancing to ensure highly resilient connectivity.
IAM & Security:
Define, implement, and enforce strict security boundaries across compute, storage, and networking layers.
Identity Platforms:
Drive the operational management of identity providers (Okta, Auth0), overseeing SSO, RBAC, federation, and secure access patterns.
Containers & Orchestration:
Leverage strong hands‑on experience with Docker and Kubernetes in production environments to scale our workloads efficiently.
CI/CD Systems:
Build and maintain robust deployment pipelines utilising GitHub and Buildkite, with an uncompromising focus on reliability and automation.
Infrastructure as Code:
Develop reusable, modular Terraform patterns to ensure consistent, repeatable infrastructure provisioning.
Monitoring & Incident Management:
Design actionable alerting, manage secure "break‑glass" access procedures, and continuously improve signal‑to‑noise quality for incident responders.
Observability:
Operate, tune, and evolve a modern observability stack (Loki, Grafana, Tempo, Mimir) to provide deep insights into system behaviour.
Database Infrastructure:
Manage backups, replication, high availability, and performance tuning for crucial data stores.
Performance & Toil Reduction:
Continuously patch, optimise, and eliminate bottlenecks to improve overall system reliability and permanently reduce operational toil.
What you’ll bring
Extensive experience with Amazon Web Services.
Experience with infrastructure as code and related scripting—preferably Terraform.
Experience building Docker images and maintaining containers and hosts, with extensive production Kubernetes experience.
Experience with Linux administration.
Deep expertise in cloud networking, observability systems, and Identity/Access Management (IAM).
A proven track record of reducing toil through advanced automation and creating highly resilient, fault‑tolerant systems.
Technologies You’ll Work With
Infrastructure:
AWS services, Linux, Kubernetes.
Databases:
MongoDB (Atlas), Postgres.
Release Engineering:
Terraform, Docker, Git, Buildkite, GitHub.
Observability & Ops:
Loki, Grafana, Tempo, Mimir, AWS CloudWatch, AWS GuardDuty, Opsgenie.
Identity:
Okta, Auth0.
Application Context:
Node.js (hapi), React, React Native.
What’s in it for you?
A mix of in‑office and remote working (3 days in the office).
Learning and career development opportunities.
18 weeks paid primary carers leave.
12 weeks paid secondary carers leave.
Annual team‑based volunteer day.
Birthday leave.
Power Up Day (additional day of leave).
Weekly team social events, snacks, craft beer and wine, ping pong and video games.
Taco Tuesdays.
Mental health and wellness initiatives.
Novated leasing.
Tyro is committed to a diverse, inclusive workplace where everyone thrives. We welcome applicants of all backgrounds and are an equal opportunity employer. If you need accommodations or adjustments at any stage of the recruitment process, simply inform our Talent team during your conversation with them.
#J-18808-Ljbffr