System Reliability Specialist
We are seeking a highly skilled System Reliability Specialist to ensure the reliability, scalability, and performance of our software systems.
This role bridges the gap between software development and IT operations, focusing on automation, monitoring, and incident response to maintain high system uptime and user satisfaction.
* Monitor system performance and availability using tools like Prometheus, Grafana, and ELK stack.
* Build and maintain scalable infrastructure using tools such as Terraform, Ansible, and Kubernetes.
* Automate operational tasks and deployment pipelines (CI/CD).
Key qualifications include proficiency in programming languages such as Python, Go, Java, or Ruby, along with a strong understanding of Linux systems and networking fundamentals.
Requirements:
* Bachelor's degree in Computer Science, Engineering, or related field.
* 3+ years of experience in System Reliability Engineering, DevOps, or Software Engineering.
Preferred Qualifications:
* Experience with distributed systems and microservices architecture.
* Certifications in cloud technologies.
Location:
Sydney