Job Description
We are seeking a highly skilled and experienced High Performance Computing Professional to join our team. The ideal candidate will have hands-on experience in managing Linux environments, strong software development skills, and expertise in high-performance computing.
* Design and Implementation:
The successful candidate will be responsible for designing, implementing, maintaining, and supporting high-performance compute and storage systems. This includes monitoring systems and storage performance, including network components.
* Performance Monitoring and Fault Tolerance:
The ideal candidate will implement and support performance monitoring and fault monitoring systems to ensure the smooth operation of our high-performance computing environment.
* Tooling Development:
The successful candidate will develop tooling to compile, package, install, and upgrade software and operating system components at scale.
* Collaboration and Communication:
The ideal candidate will collaborate with team members and across teams to write code and testing infrastructures spanning multiple programming languages.