This is a hands‐on Site Reliability Engineer position focused on improving reliability, raising automation maturity, and shaping the Linux, monitoring, authentication and container platforms at AC3's Hybrid Cloud practice.
Hybrid 50/50 home/office.
Responsibilities
* Own and improve Linux based services, automation tooling, monitoring infrastructure, authentication integrations and container platforms.
* Build smarter automation using tools such as GitHub Actions, Ansible, Puppet, Jenkins and related orchestration technologies.
* Help unify observability across the platform, with monitoring that is scalable, resilient, service‐aware and aligned to SLIs and business outcomes.
* Support and evolve Docker, Kubernetes and Rancher‐based platforms, while contributing to AC3's codebase and engineering IP through GitHub.
* Manage the Linux platform lifecycle end to end: provisioning, configuration, validation, patching, break/fix, capacity and continual improvement.
* Work closely with other platform teams, field escalations and join a shared on‐call roster with strong team support.
Success Metrics
* Maintain exceptional availability standards for the platform.
* Reduce avoidable incidents by engineering out risk and repeat issues.
* Increase automation coverage, improve documentation and leave the platform in better shape than found.
* Work well across teams and build credibility through thoughtful communication, follow‐through and sound judgement.
Essential Qualifications
* Australian citizen, eligibility for Commonwealth Security Clearance.
* Strong Linux systems administration capability, ideally with a command line and automation focus.
* Practical scripting ability in Python, Bash or similar.
* Experience with automation, orchestration and/or CI/CD tooling.
* Exposure to monitoring or observability platforms and a genuine interest in reliability engineering.
* A broad understanding of infrastructure across cloud, virtualisation, networking and platform environments.
Nice to Have
* Experience with OIDC or SAML integrations.
* Exposure to Docker, Kubernetes, Rancher, Terraform or OpenTofu.
* Experience with Red Hat Satellite, Oracle Spacewalk, database platforms or MSP environments.
* Vendor, cloud or ITIL certifications.
Platforms Engineer – Site Reliability • Sydney, NSW, AU
#J-18808-Ljbffr