Job Description:
We are seeking a highly skilled Site Reliability / Gitops Engineer to join our Information Systems team. This role offers an exciting opportunity for an automation-first technologist with a passion for Linux to build a career and drive success with Ubuntu and open source products.
Key Responsibilities:
* To develop infrastructure as code practice within the IS team by increasing automation and improving IaC processes
* Automate software operations for re-usability and consistency across private and public clouds, considering distributed systems complexities
* Develop new features and improve the resilience and scalability of existing cloud and container portfolio
* Maintain operational responsibility for all core services, networks, and infrastructure
* Troubleshoot, perform capacity planning, and investigate performance issues, set up and maintain observability tools like Prometheus, Grafana, and Elasticsearch
Requirements:
* Deep experience defining operations in code using version control, peer review, and CI/CD to roll out changes
* Strong modern engineering background including peer-review, unit testing, SCM, CI/CD, Agile
* Python software development experience, particularly with large projects
* Familiarity with Linux networking, routing, firewalls, and various forms of Linux storage
* Hands-on experience administering enterprise Linux servers
* Extensive knowledge of cloud computing concepts and technologies
* Bachelor's degree or higher in computer science or related field
* Effective communication skills in English