We're hiring a Senior Site Reliability Engineer! Full‐time | Permanent
Preferred locations: Sydney, Brisbane, or Melbourne. Open to Perth based for the right candidate!
Hybrid working: 2 days in office, 3 days WFH.
The Role
We're looking for a Senior Site Reliability Engineer to help drive the reliability of our live SaaS solutions across the region. You will report to the SRE Engineering Manager and work closely with product, engineering, and development teams.
Responsibilities
* Design, build, and maintain infrastructure and platform services, including Kubernetes and observability tooling.
* Implement infrastructure as code, configuration management, and automated testing to ensure reliable, repeatable environments.
* Contribute to code and configuration reviews to improve scalability, maintainability, and reuse.
* Use an AI‐first mindset and development tooling such as Cursor and Kiro.
* Build and tune AI agents to accelerate delivery and automate repetitive tasks.
* Monitor production systems, troubleshoot issues, and improve logging, monitoring, alerting, and runbooks.
* Participate in on‐call rotations, incident response, and post‐incident reviews to improve long‐term reliability.
* Partner with product, engineering, and development teams to translate requirements into practical infrastructure solutions.
* Identify risks related to operability, security, performance, and cost, and recommend appropriate trade‐offs.
* Contribute to operational quality through runbooks, security practices, performance tuning, and process improvements.
* Proactively identify issues, raise concerns, and stay current with emerging SRE practices and technologies.
Qualifications
* 5+ years of experience in SRE, DevOps, or a similar role.
* Solid hands‐on experience with AWS and Terraform.
* Practical experience running workloads on Docker and Kubernetes.
* Proficient in at least one development or scripting language (Python, Go, Bash).
* Knowledge of APM, logging, and metrics systems (New Relic, Prometheus or ELK).
* Understanding of system & network security fundamentals.
* Experience participating in incident management.
* Strong problem‐solving skills and ability to work well autonomously.
Preferred Skills
* Knowledge of databases such as MySQL or PostgreSQL.
* Experience with Azure.
* Understanding of the aged or disability care sector.
* Previous experience with AlayaCare or Procura products.
* Interest in maximizing the usage of AI agents.
Benefits
* Competitive salary + company stock (RSUs).
* 5 wellness days per year.
* $1,000 per year flexible benefits package.
* 22 weeks company‐paid parental leave.
* 2 days company‐paid volunteer leave.
* Team lunches, events & wellness activities.
* Open, inclusive, and collaborative culture.
* Opportunities to make a real impact in the care space.
Need adjustments to participate in the recruitment process? Reach out to our HR team at hr-anz@alayacare.com.
#J-18808-Ljbffr