High Performance Computing Linux Systems Engineer (#S4670PO)

University of Vermont   Burlington, VT   Full-time     Information Services / Technology (IT)
Posted on November 13, 2023
Apply Now

High Performance Computing Linux Systems Engineer (#S4670PO)

 

The University of Vermont (UVM) is looking for an experienced Linux systems engineer to help build state-of-the-art high-performance computing (HPC) solutions for our researchers.  The Vermont Advanced Computing Core (VACC) has three advanced clusters, with thousands of compute cores and large-scale GPU clusters.  We also host a large sharded in-memory database.

 

We are looking for someone who can join our team and help design, build, configure, and run the VACC cluster hardware and software.  Our team works collaboratively to deliver excellent Research Computing services to our researchers.  This position will help researchers effectively use the resources available, and recommend changes to improve research services.  The candidate will always have new and exciting challenges to work on with the support of peer administrators.

 

For the right candidate, fully remote (US-based) work is allowed for this position.

 

UVM is a comprehensive research university comprising eight schools and colleges, and offers its employees competitive salaries, outstanding benefits including tuition remission, and a superior academic workplace.

 

Required Qualifications:

  • Bachelor’s degree in a STEM related field such as Computer Science plus four years of systems experience in a large-scale server environment. Four additional years of systems administration/engineering work experience can substitute for the degree requirement.
  • Thorough knowledge of Linux operating systems, network architecture and Linux shell scripting.
  • Demonstrated experience deploying effective research computing systems.
  • Expertise with at least one systems programming language (C, Python, Perl, Rust, etc…) and software version control.
  • Experience building and packaging open source software from source.
  • Configuration management experience creating repeatable processes and desired dynamic system states.
  • Excellent troubleshooting skills, including system and network debugging experience.

  • Effective customer service ethic, communication skills and collaborative teamwork with peer sysadmins.

 

Any of the following qualifications would strengthen your candidacy:

  • Experience with HPC technologies such as SchedMD Slurm, parallel computing, MPI programming, Spack, Singularity/Apptainer, and CUDA.
  • Experience with MongoDB in a sharded architecture.
  • Low latency networking experience (e.g., Infiniband).
  • Open OnDemand, Jupyter Notebook and supporting technologies experience.
  • Clustered filesystem expertise (e.g., GPFS/Storage Scale).
  • Experience compiling, installing, and running open-source software.
  • Experience working at an academic institution.


 

We may hire at either a senior-level position or a mid-level position. For further information, or to apply with electronic application, resume, and cover letter, visit our website: https://go.uvm.edu/hpcadmin

 

The University of Vermont is an Equal Opportunity/Affirmative Action Employer. Applications from women, veterans, individuals with disabilities and people from diverse racial, ethnic, and cultural backgrounds are encouraged.