System reliability engineer (sre)

Perth

Talent International

Posted: 28 April

Offer description

Job DetailsLocation:Sydney (4 days in office, 1-day WFH)Reports to:Technical Operations Director, APACDepartment:Global Technical OperationsSalary:$165,000 + super + annual bonusThe Opportunity A leading music organisation is now growing their Global Technical Operations hub in Sydney and looking for a Service Reliability Engineer (SRE) to join their team.This is more than a traditional ops role – it's an opportunity to bring a software engineering mindset to reliability, automation, and scalability in a global, high-impact environment.What You'll Do You'll join a collaborative, hands‐on team responsible for the stability, performance, and scalability of global platforms. Working closely with development, infrastructure, and security teams, you'll help build a resilient environment that keeps music flowing – from studio tools to streaming systems.Design and maintainhigh-availability, high-performance systems for global applications.Automate everything– from infrastructure provisioning to deployment and scaling – using tools like Terraform, Ansible, and Python.Build robust monitoring and observabilityframeworks with AWS CloudWatch, Dynatrace, Prometheus, Grafana, or Splunk.Optimize CI/CD pipelinesto improve reliability and deployment speed.Participate in on‐call rotations, troubleshoot incidents, and lead post‐incident reviews.Champion SRE principles– embed SLOs, SLIs, and error budgets into everyday engineering.Collaborate acrossDev, Infra, and Securityteams to create a culture of continuous improvement and reliability.About You You're a technically strong and level‐headed engineer who loves automation, thrives in complex environments, and knows how to balance pragmatism with perfection.Background insystems administration (Linux/Windows)in a large-scale environment.Proficient in at least one programming language ( Python, Go, or Java ).Hands‐on experience withAWS(GCP or Azure a bonus).Deep understanding ofnetworking, containers (Docker/Kubernetes), and Infrastructure as Code(Terraform, Ansible).Experience withmonitoring and observability toolssuch as Dynatrace, Prometheus, Grafana, or Datadog.Calm, collaborative communicator with strong analytical and problem‐solving skills.Bonus Points ForExperience withServiceNowor ITIL processes.Knowledge ofchaos engineering, resilience testing, or advanced capacity planning.Previous experience managing distributed, global systems in production.Global collaboration and career growth opportunitiesInterested? Apply now or contactSophia Parrelliat Talent International for a confidential chat.#J-18808-Ljbffr

Send an application

Create a job alert

Save

Similar job

System reliability engineer (sre)

Perth

Talent International

Similar job

Sydney-based global sre: reliability & automation

Perth

Talent International

Similar job

Senior finance partner, research (12‑month ftc)

Perth

Talent International