About the Position
We're looking for a technical professional to join our Tech Ops SRE team focused on Commercial Cloud.
The ideal candidate will have expertise in Linux engineering and administration for thousands of bare metal servers and virtual machines.
Key Responsibilities:
* Have deep knowledge of operational aspects including Availability, Latency, Throughput, Monitoring, Issue Response (analysis, remediation, deployment) and Capacity Planning with respect to Latency and Throughput.
* Work collaboratively with other engineers distributed across the globe.
* Participate in on-call rotation with other team members.
* Troubleshoot server hardware issues.
* Use passion for technology to ensure platform operates 24x7.
* Continuously learn and champion new technologies and best practices with others, raising the technical IQ of the team.
Requirements:
* Bachelor's degree or equivalent experience in Computer Science.
* A minimum of three years of experience working in large-scale production environments.
* Experience writing scripts and programs for automation, tools, frameworks, dashboards, and alarms.
* Proficiency in one or more languages such as Java, Python, Go.
* Knowledge of storage technologies including SAN, NAS, NFS, Object Storage, Free NAS, i SCSI.
* Infrastructure experience with Linux, Windows, VMware, Docker, Kubernetes, etc.
* Technical documentation skills.
* Configuration management experience with Puppet, Chef, Ansible.
* Understanding of application design principles and operational trade-offs.
* Analytical skills coupled with a strong sense of urgency, ownership, and drive.
* Ability to work effectively in diverse teams with other SREs and Engineers.
* Strong communication and presentation skills.
About Us
We are a global leader in cybersecurity dedicated to protecting people, processes, and technologies. We're committed to stopping breaches and redefining modern security with AI-native platforms.