Senior High Performance Computing System Engineer Jobs in Golden, CO at National Renewable Energy Laboratory
Title: Senior High Performance Computing System Engineer
Company: National Renewable Energy Laboratory
Location: Golden, CO
Salary: $150K – $200K*
Category: Energy & Utilities
The National Renewable Energy Laboratory (NREL), located at the foothills of the Rocky Mountains in Golden, Colorado, is the nation’s primary laboratory for research, development, and deployment of renewable energy and energy efficiency technologies. The NREL Computational Science Center (CSC) has an immediate opening for a Senior HPC Systems Engineer. This senior position is responsible for implementing and operating HPC systems and infrastructure in support of NREL’s mission.
Job duties andresponsibilitiesof the position:
Vision and Planning:
Help develop and articulate vision, and strategic planning for the Computational Science Center’s (CSC) High Performance Computing System Operations Team.
HPC System architecture, design, implementation and integration to serve the diverse computing needs of the HPC User Facility.
Develop technical specifications for HPC system procurements based on input from current and potential HPC system users.
Assess new computing technologies and the feasibility of system enhancements to determine potential value.
Identify emerging Computing and Storage technologies to be assimilated, integrated, and introduced within HPC Systems.
Implement technology tracking testbeds and help adapt workload to new architectures.
Demonstrate leadership for the “day-to-day” and ongoing project work and operations of the HPC environment and Energy Systems Integration Facility(ESIF) data center operations.
Maintain the integrity and continual operation of the computing, parallel and archival storage systems, and data center networks.
Maintain cyber security and privacy of the information and computing systems, communication lines, and equipment.
Soft Skills: (Communication/Team/Leadership):
Strong interpersonal and communication skills and ability to work as a team player is required.
Must be a proactive and solution-oriented problem solver.
Ability to bridge gaps between domain language (engineering, science) and computing solution language.
Proven ability to embrace and drive change.
Clear dedication to excellence and advancing beyond the current state.
Strong personal motivation.
Good networking skills, able to develop and maintain relationships with wider Scientific Computing community.
Bachelor’s Degree in Computer Science, Engineering or related and 9 or more years of experience. Or, Master’s Degree in Computer Science, Engineering or related and 7 or more years of experience. Or, PhD in Computer Science, Engineering or related and 4 or more years of experience.
Additional Required Qualifications
A solid foundation in scientific computing, experience with High Performance Computing or Cloud clustered environments and knowledge of the applications that run on them.
Experience with high performance computers and associated high performance networks and storage systems.
Experience with developing and implementing cyber security controls.
Architecture, engineering, installation and maintenance of clustered environments, including automated installation methods.
Management of Linux operating systems in a large-scale system environment.
Distributed file systems (particularly Lustre).
Scripting languages Python, PERL, Shell(s), Expect.
HPC workload management software (Slurm).
Monitoring tools (Zenoss, Nagios); confi…