Description
The DCEO Cluster Manager position encompasses leadership responsibility for multiple data center facilities and their corresponding infrastructure. The successful candidate will direct a substantial organizational structure, providing oversight to Area Managers and Facility Managers and individual contributors while ensuring operational excellence across multiple data center locations.
Key job responsibilities
* Hiring, managing, and developing the Data Centre Engineering Operations management team including facility managers, area managers, chief engineers, and facility technicians.
* Establish performance benchmarks, conduct analyses, and prepare reports on all aspects of the critical facility operations and maintenance.
* Lead activities to advance/promote and improve safety and security culture within the team.
* Safety, security, and availability of incident response, incident management, incident resolution, and root cause analysis.
* Responsible for the on-site management of 24x7 shift technicians, senior shift technicians, sub-contractors and vendors, ensuring that all work performed is in accordance with established practices and procedures.
* Build and maintain sustainable organizational structure.
* Work with business development, real estate, engineering and construction team to forecast staffing and maintain.
* Mentor and develop employees.
* Operation and maintenance of mechanical, electrical, and controls systems for Amazon data centers include preventive maintenance, corrective maintenance, and change management.
* Manage all facility related repairs in a timely manner.
* Daily/weekly/monthly/Data Center Engineering Operations meetings and reporting.
* Run weekly availability meetings and report outcomes.
* Liaise with a global team on global initiatives as well as process and procedures, implement those initiatives locally in Melbourne.
* Vendor management of colocation data center services providers to meet or exceed contracted performance SLAs.
* Cost management including OPEX and CAPEX related to data centers you oversee.
* Continuous improvement of operational processes, procedures, methods, and tools.
* Lead and manage energy efficiency initiatives in your Cluster.
* Support capacity planning and management and actively involved in ongoing construction activities.
A day in the life
Your day will be a blend of strategic oversight and hands‐on problem‐solving. You'll navigate complex facility challenges, coordinate with cross‐functional teams, and ensure our data centers operate with precision and reliability. From monitoring critical systems to managing project timelines, every moment will be an opportunity to make a significant impact.
About the team
The MEL Cluster is growing with over 150 employees (currently) and a capacity growth trajectory that is exciting. The MEL Cluster is a ML hub for the APJC region and
Basic Qualifications
* Bachelor's degree in engineering or a related technical field
* Experience in industrial or commercial engineering or project management in Mission Critical facilities including but not limited to data centers, power generation, or oil/gas facilities
* 5+ years of work in a management position with 5 or more direct reports experience
* 5+ years of operations and on-call support for data center facilities, mission critical plants, or production facilities, or 4+ years of electrical or mechanical experience
* Knowledge of the electrical and mechanical systems involved in critical data center operations including systems such as feeders, transformers, generators, switchgear, UPS systems, ATS units, PDU units, chillers, pumps, air handling units, and CRAC units
* Experience hiring, developing, and managing high‐performing technical teams
* Ability to respond to any facility related emergencies in a timely manner, with sound recovery plans
* Experience in vendor management associated with data center operations, maintenance or improvements
* Experience in leading with a safety culture as the foundation of any and all work activities. Leading safety programs and managing safety activities
Preferred Qualifications
* Experience using Lean Process Improvement, Six Sigma methodologies or related performance metrics/process improvements to increase efficiency within processes, forecasting, planning, optimization, and logistics
* 2+ years of managing or implementing system health, performance monitoring, performing capacity planning, or process improvement and automation experience
* Experience in supporting procedures for production or mission critical environments to include ticketing, monitoring/metrics, and troubleshooting technical issues
* Experience in creating and managing budgets
IDE statement:
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
#J-18808-Ljbffr