Partnering with a high-growth AI infrastructure company building their operations team from the ground up. Two levels available. Ground floor opportunity supporting some of the biggest names in enterprise AI.
As a Data Center Operations Analyst you will:
* Monitor live GPU clusters, compute, network, and physical data center systems across a follow-the-sun operation
* Triage incidents and route to the right resource, working closely with engineering and data center teams
* Manage and clean up alert noise, improve observability tooling, and contribute to workflow automation
* At the senior level: own improvements independently, define reliability metrics, write automation in Python or Bash, and mentor junior analysts
What they are looking for:
* NOC, data center, or infrastructure operations background
* Comfortable in a 24/7 production environment
* Hands-on experience with monitoring or observability tooling
* Senior level: proven ability to own process improvements and reduce operational toil independently
#J-18808-Ljbffr