High-Performance Computing Expert
This role is designed for a skilled professional with expertise in high-performance computing systems. As an Infrastructure Specialist, you will be part of a dynamic team responsible for delivering cutting-edge projects and providing expert training on advanced HPC systems.
Key Responsibilities:
* Provide technical guidance on software management, workflow optimization, and HPC system usage to ensure efficient operations.
* Develop and deliver training sessions to enhance the capabilities of your team members.
* Maintain and optimize HPC systems for batch processing and data-intensive tasks.
* Implement and upgrade infrastructure components following industry best practices.
* Collaborate with vendors and partners to resolve system faults or failures.
* Manage job scheduling software to allocate resources fairly.
* Contribute to strategic planning initiatives and forward-looking projects.
* Perform Linux system administration to maintain secure operations.
Requirements:
* A minimum of 5 years' experience managing and supporting Linux-based cluster environments.
* Knowledge of parallel file systems, mass storage, and hierarchical storage management.
* Proficiency in scripting and programming languages like bash, Perl, Python, and R.
* Strong organizational skills and ability to manage multiple priorities effectively.
* Excellent communication skills, both written and verbal.
Desirable Skills:
* Familiarity with containerization and virtualization technologies.
* Experience with CI/CD tools.
* Knowledge of Jupyter notebooks and related platforms.
* Experience with Windows-based HPC Clusters or tools.
* Proficiency in cloud services and orchestration platforms.
* Familiarity with Windows Subsystem for Linux and related technologies.