Senior Linux System Administrator – Support Engineer (High Performance Computing Focus)
This role has been designed as "Onsite" with an expectation that you will primarily work from an HPE partner / customer office.
Who We Are
Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We help companies connect, protect, analyze, and act on their data and applications wherever they live, from edge to cloud, so they can turn insights into outcomes at the speed required to thrive in today's complex world. Our culture thrives on finding new and better ways to accelerate what's next. We value diverse backgrounds and provide flexibility to manage work and personal needs. We make bold moves, together, and are a force for good.
Job Description
We are seeking an experienced Senior Linux System Administrator / System Support Engineer with expertise supporting High Performance Computing (HPC) environments to join our HPC support team. Design, implement, maintain, and optimize Linux‑based infrastructure—ensuring high availability, security, and performance for mission‑critical systems and services, including complex HPC platforms. Provide advanced technical support and troubleshoot challenging issues across hardware and software, acting as a trusted advisor to internal teams and external customers. On‑site presence is mandatory to deliver exceptional customer support and maintain system performance. A current mandatory TSPV Government Security Clearance is required.
Key Responsibilities
* Deploy, configure, maintain, and troubleshoot Linux servers (Red Hat, CentOS, Ubuntu, or others) across physical, virtual, and cloud environments.
* Support, maintain, and optimize HPC systems, including installation, servicing, and advanced technical troubleshooting of hardware/software and parallel file systems (e.g., Lustre, GPFS).
* Monitor system performance, availability, and security using industry‑standard tools and practices; ensure compliance with organizational policies and external regulations.
* Plan and execute upgrades, patches, enhancements, and migrations to ensure systems are current, secure, and optimized.
* Automate system administration tasks using scripting languages (Bash, Python, Perl, etc.) and configuration management tools (Ansible, Puppet, Chef, Terraform).
* Implement and maintain backup/recovery strategies, disaster recovery plans, and system documentation.
* Collaborate with development, network, and security teams to support application deployments and troubleshoot issues, particularly in multi‑technology HPC environments.
* Provide technical consulting, mentoring, and guidance to junior team members and contribute to internal knowledge sharing.
* Ensure compliance with strict security protocols in sensitive environments (e.g., government, research); TSPV clearance will be required.
* Participate in on‑call rotation and respond to system incidents and outages.
* Assist with technical proposals, solution design, and enterprise‑level architecture for new projects and customer engagements.
Qualifications
* Bachelor's degree in Computer Science, Information Technology, or related field, or equivalent work experience.
* At least 5 years of hands‑on experience managing Linux systems in production environments, including HPC systems.
* Expertise in Linux/Unix operating systems, parallel file systems (Lustre, GPFS), and networking technologies.
* Proficiency in scripting/programming languages (Bash, Python, Perl, C++).
* Experience with automation/configuration management tools (Ansible, Puppet, Chef, Terraform).
* Strong understanding of networking concepts (TCP/IP, DNS, DHCP, firewalls, VPNs).
* Familiarity with monitoring/logging tools (Nagios, Grafana, ELK Stack).
* Experience with containerization technologies (Docker, Kubernetes).
* Excellent problem‑solving, analytical, and communication skills; able to diagnose complex technical problems to root cause.
* Demonstrated ability to work independently in multi‑technology environments and collaborate across teams.
* Relevant certifications (RHCE, LFCS, AWS Certified SysOps Administrator, etc.) are a plus.
* TSPV Government Security clearance (mandatory).
Benefits
* Competitive salary and performance‑based bonuses
* Comprehensive health, dental, and vision insurance
* Retirement plan options
* Paid time off and holidays
* Professional development opportunities
* Flexible work arrangements
HPE is an Equal Employment Opportunity / Veterans / Disabled / LGBT employer. We do not discriminate on the basis of race, gender, or any other protected category, and all decisions we make are based on qualifications, merit, and business need. Our goal is to be one global team that is representative of our customers, in an inclusive environment where we can continue to innovate and grow together. Hewlett Packard Enterprise is EEO Protected Veteran / Individual with Disabilities. HPE will comply with all applicable laws related to employer use of arrest and conviction records, including laws requiring employers to consider for employment qualified applicants with criminal histories.
#J-18808-Ljbffr