Job Opportunity
We are seeking a Senior Service Reliability Engineer to play a key role in delivering a great cloud gaming experience. As a member of the team, you will be responsible for influencing design and operational decisions towards the overall stability of the gaming service.
About the Role
The successful candidate will lead technical discussions around ongoing improvements in Reliability and Scalability, create High Level Designs (HLDs) for new products and platforms, mentor junior SRE staff, and contribute to code to improve reliability.
Key Responsibilities
* Lead team technical discussions around reliability and scalability enhancements
* Develop HLDs for new products and platforms
* Mentor junior engineers
* Contribute to codebase improvement
Requirements
* 5+ years of experience in Software Development or Linux Systems Administration
* Strong interpersonal, written and verbal communication skills
* Availability for on-call rotation
Skills & Knowledge
* Proficient in Linux Production Systems Engineering with experience managing large-scale Web Services infrastructure
* Development experience in Python, Bash, Go, Java, C++, or Rust
* Familiarity with distributed data storage at scale, NoSQL at scale, Data Aggregation technologies, Scaling traditional RDBMS, Monitoring & Alerting, Incident Management, Kubernetes, AWS, Software Distribution, Configuration Management, and Performance analysis
What We Offer
We offer a dynamic work environment where you can grow your skills and expertise. You will have opportunities to work on challenging projects and collaborate with experienced professionals.