At Sony Interactive Entertainment, we strive to create an inclusive environment that empowers employees and embraces diversity.
About the Role
Our organization is seeking a highly skilled Service Reliability Engineer to join our Future Technology Group in Adelaide, Australia. As a key member of this team, you will play a significant role in delivering a great cloud gaming experience to our customers.
The successful candidate will be self-directed and able to participate in decision-making at different levels. You will have opinions on the state of our service and provide critical feedback during different phases of the operational lifecycle.
Key Responsibilities
* Lead technical discussions, especially around ongoing improvements in Reliability and Scalability.
* Contribute to High Level Designs (HLDs) for new products and platforms.
* Mentor junior SRE staff and enable them for success.
* Lead incident response and post-mortem activities within your assigned service team.
* Work with other Engineers in a cross-functional team to prioritize reliability improvements and reduce technical debt and toil.
* Contribute to code to improve reliability.
* Implement automation to reduce ongoing toil.
Requirements
* A minimum of 5+ years working experience in Software Development and/or Linux Systems Administration role.
* Strong interpersonal, written and verbal communication skills.
* Availability to be scheduled in on-call rotation.
Essential Skills
* Proficient as a Linux Production Systems Engineer, with experience managing large scale Web Services infrastructure.
* Development experience in one or more of the following programming languages: Python, Bash, Go, Java, C++, or Rust.
* In addition, experience with at least 3 of the following topics: Distributed data storage at scale, Data Aggregation technologies, Scaling and running traditional RDBMS with High Availability, Monitoring & Alerting, Incident Management toolsets, Kubernetes and/or AWS deployment and management, Software Distribution, Configuration Management, S/W Performance analysis and load testing.