HPC Systems Administrator/Programmer IV, IT-Research Cyberinfrastructure
Job no: 494281
Work type: Staff
Categories: Information Technology, Full Time
Pay Grade: 30S
Context of the Job:
The IT Research Cyberinfrastructure (IT-RCI) group is responsible for the operation, maintenance, support, and growth of research computing at the University of Delaware (UD). IT-RCI supports the University’s High Performance Computing (HPC) and other systems with direct research impact. Researchers and department IT staff rely heavily upon IT-RCI for services and technical support, and it is one of several groups within the larger organization of UD Information Technologies.
The University’s HPC systems are a resource giving UD researchers access to leading edge computational and data storage systems. The HPC Systems Programmer IV facilitates the purchase and ongoing maintenance of the systems maintained by central IT, and will provide training and application support for UD researchers, staff and students. The incumbent designs, builds, writes programs, installs, tests, debugs, and integrates systems and documents his/her work to maintain and enhance the HPC systems to meet the needs of University researchers. The incumbent will play a critical role in expanding IT-RCI resources, including new cloud computing efforts and the introduction of an XSEDE level 2 resource provided by UD. The HPC Systems Programmer IV is considered essential personnel and provides off-hour, on-call coverage to support research computing resources.
The Systems Programmer IV (HPC) reports to the Director, Research Cyberinfrastructure
- Installs, configures, and supports a Linux HPC cluster to include rack management, networking, storage subsystems, provisioning system, authentication, workgroup management, scheduling, storage and research software installation/management.
- Develops and implements a cloud HPC framework within the University.
- Facilitates the procurement of HPC computers and related, including bid solicitation and vendor negotiations.
- Includes repair of, or execution of maintenance agreements for the HPC systems. Interfaces with hardware/software vendors to coordinate and schedule maintenance and system updates.
- Recommends and develops modifications and enhancements to existing hardware and software, new implementations, and installation standards to increase system utilization and improve monitoring.
- Provides on-call support for critical systems.
- Participates in or leads work designing specifications and implementing new systems software packages. Writes software programs for customizations to address user needs, adds functionality, modifies systems, and maintains interfaces with other systems.
- Develops and maintains documentation, procedures and policies supporting use of central HPC systems and infrastructure.
- Participates in workshops and training in support of HPC systems and infrastructure.
- Provides direct technical support to users of HPC systems. Consults with faculty and staff on computing related matters and assists in the preparation of computational budgets for proposals.
- Works collaboratively with other computing staff and campus researchers to implement new HPC technologies. Assists, mentors, and trains junior systems programmers.
- Maintains a high degree of familiarity with software and hardware associated with Research Cyberinfrastructure. Cross trains with and assists colleagues with their work as needed.
- Participates in strategic and long-range planning for campus network and system services.
- Performs miscellaneous job-related duties as assigned.
- Bachelor's degree and four years of related HPC experience and/or equivalent combination of education, certification, and experience.
- Proficiency with Linux/Unix system administration, system and network security, development tools and methodology, and system management.
- In-depth knowledge of one or more operating systems, common networking equipment, and major network protocols, and multiple technologies.
- Ability to plan, design, develop, implement, test, and modify complex systems.
- Analytical, evaluative, and objective critical thinking skills.
- Effective verbal and written communication, interpersonal, and organizational skills.
- Ability to work independently as well as part of a team.
- Knowledge of current technological developments/trends in area of expertise.
- Ability to maintain close interactions with a diverse group of researchers in a dynamic environment.
- Experience using cloud technologies with preference for cloud HPC and cloud automation.
- Ability to specify, write, and develop computer programs using a range of programming. languages; experience with UNIX shell, C, Python, Perl, and PHP is preferred.
- Experience integrating with Internet2 or XSEDE services is preferred.
- Experience in a University computing service environment is desirable.
- Experience using advanced scientific and engineering applications is desirable.
Back to search results Apply now Refer a friend