Join the NIH High Performance Computing Team
|
The NIH HPC Staff advances research at NIH by planning, managing, and supporting the NIH High Performance Computing (HPC) resource, including
a high-performance cluster, petabye-scale storage, and
leading-edge biomedical applications.
The team of system administrators and scientists supports the computational needs of over 600 labs and 2500 NIH intramural researchers in multiple research areas, including genomics, computational chemistry, structural biology, and machine/deep learning.
The positions are located on the NIH campus in Bethesda, Md.
We are currently recruiting for the following positions. If interested, send your resume to hpc-jobs@hpc.nih.gov
(Updated Sep 2022)
|
Storage Administrator
We are currently looking for an enthusiastic individual who will work with our staff of system administrators and scientists to design, deploy
and administer the storage systems that support the Linux-based NIH HPC cluster.
The incumbent will be a critical part of the team responsible for managing 40+ PB of storage for Biowulf.
This individual will:
- Configure, monitor, and upgrade large scale, high performance storage systems
- Propose, evaluate, and recommend new storage technologies such as: parallel filesystems, object storage systems, and cloud based storage
for use in and with the HPC environment
- Respond to user requests regarding storage allocations and serve as a subject matter expert on storage for users and other HPC staff members
- Consult and collaborate with scientist and system administrator coworkers to solve storage issues encountered by users
- Perform general Linux administration and user support tasks
Required skill set:
- 5+ years Linux administration experience
- Experience designing, implementing, and administering high performance or enterprise storage
- A demonstrated track record of successfully providing end user facing support in a fast-paced production environment
- Excellent oral and written communication skills
- Ability to work on-site as needed
Additional desired skills are below. Candidates with a subset of these skills are encouraged to apply.
- A working knowledge of one or more parallel filesystems (e.g. GPFS/Spectrum Scale, Vast, Lustre, BeeGFS, etc.)
- Professional experience in environments with greater than 1 PB of storage
- Strong knowledge of block and network level storage protocols (e.g. SCSI, SAS, Fibre Channel, FCoE, SRP)
- Experience administering NFS and/or SMB/CIFS
- Networking expertise (e.g. Ethernet, Fibre Channel, TCP/IP, and/or InfiniBand)
- Experience with enterprise backup software (e.g. NetBackup, CommVault, etc.)
- Knowledge of Linux filesystem auditing tools
This is a contractor position.
System Administrator
We are currently looking for enthusiastic individuals who will work with our staff of system administrators and scientists to design, deploy and administer the NIH HPC cluster, petabyte-scale data storage systems, and associated specialized systems.
This individual will:
- Configure, build, and install upgrades to the Linux operating system and related utilities.
- Participate in optimizing resource management and scheduling software and policies.
- Provide technical support, troubleshoot problems and develop appropriate computational strategies for researchers using HPC resources.
- Consult and collaborate with scientist coworkers to determine best system configurations for applications.
Required skill set:
- RedHat or CentOS Linux system administrator, certified or equivalent experience.
- Ability to program in at least one of the following languages: Python, C/C++, or Perl.
- Ability to work both independently and as part of the team; flexibility in dealing with assignments and in working on
several projects simultaneously; ability to communicate well with people of diverse backgrounds and computer knowledge.
- Demonstrated ability to configure, deploy and manage a major system area such as batch system, network, data storage, backup system, database system, or distributed computing.
- Excellent oral and written communication skills.
Additional desired skills are below. Candidates with a subset of these skills are encouraged to apply.
- Experience working in a HPC environment.
- Experience with batch systems such as SLURM or PBS.
- Experience with virtual machine technology and/or cloud computing.
- Configuration management using Ansible or similar tools.
- Working knowledge of MySQL.
- Network management experience especially Infiniband.
- Experience managing parallel and cluster file systems such as NFS, GPFS, or Lustre.
- Developing secure web applications.
- Experience presenting and/or teaching.
- Knowledge of Linux security auditing tools.
- Familiarity with vulnerability management tools such as Tenable.
- Implementing technical security configuration baselines such as CIS benchmarks.
A bachelor's degree and 2+ years work experience or equivalent knowledge and expertise in Linux systems administration is required. This is a contractor position.
Support Scientist
We are currently looking for motivated biomedical scientists who will work with our existing staff of scientists and system administrators to:
- Provide guidance, support, training and advice to NIH intramural researchers.
- Help install, maintain and support a wide variety of biomedical applications.
- Communicate with NIH scientists to answer questions, troubleshoot problems and develop appropriate computational strategies.
- Collaborate with system administrator coworkers to determine best system configurations for applications.
- Keep up-to-date with current and emerging computational biology technologies and tools.
Candidates must have:
- Familiarity with Linux/Unix, shell scripting
- Excellent oral and written communication skills.
- Experience presenting and/or teaching.
- Demonstrated ability to work independently and as part of a team.
- Self-motivation with good organizational, troubleshooting and problem-solving skills.
- Ability to work on several projects simultaneously.
Desired skills are below. Candidates with a subset of these skills are encouraged to apply.
- Hands-on experience with Cryo-EM analysis or Machine/Deep Learning analysis
- Experience using clusters and batch systems.
- Familiarity with a scripting or programming language (e.g. Perl, Python, C)
- Experience compiling and building scientific applications.
- Understanding of bottlenecks associated with large-scale data analysis.
- Experience with biological databases.
- Demonstrated experience learning and utilizing new technologies or software.
Candidate must have a Masters or PhD in the biological sciences. This is a contractor position.