Reliability Engineer Opportunities
We are seeking an experienced Senior Site Reliability Engineer to fill a critical role in our team.
Key Responsibilities:
* Lead efforts to enhance system reliability, including incident response, traffic planning, and Service Level Objectives (SLOs).
* Maintain and improve monitoring tools for logs, metrics, and traces.
* Manage and optimize the Kubernetes production environment.
* Utilize monitoring and experience to boost system performance and reliability.
* Collaborate with engineers to support Continuous Integration/Continuous Deployment (CI/CD) processes.
* Clearly communicate technical concepts to various stakeholders.
* Contribute to infrastructure projects and related tasks.
* Participate in on-call rotations and contribute to improving incident response playbooks and practices.
* Help define and enforce best practices for service deployment, scalability, and fault tolerance.
* Support the integration of observability and reliability into the development lifecycle.
Required Skills and Qualifications:
* Tertiary education and/or relevant industry certifications.
* Proven experience in DevOps, Site Reliability Engineering, platform operations, or a similar discipline.
* Strong working knowledge of Kubernetes in production environments.
* Hands-on experience with cloud platforms (AWS) and infrastructure-as-code (IaC).
* Proficiency with CI/CD tools such as GitHub Actions or GitLab.
* Exposure to scripting or coding in Python, Rust, Go, or similar languages is a plus.
* Familiarity with observability platforms such as Grafana, Prometheus, or similar.
* Hands-on experience with observability tools (logs, metrics, and traces) and incident management.
* Understanding of CNCF ecosystem projects such as Linkerd and Prometheus.
* Proactive, improvement-focused mindset with a passion for building reliable systems.
Benefits:
* Competitive base salary and bonus structure.
* Working from Home allowance.
* Learning and Development allowance.
* Wellness allowance.
Job Ref: 3946537
Apply Today: