job summary: Randstad Federal is seeking a Sr HPC Administrator for an opportunity supporting NASA location: BELTSVILLE, Maryland job type: Contract salary: $85 - 95 per hour work hours: 8am to 4pm education: Bache
[
"
\n\n
\n\n
Duties and Responsibilities:
\n\n
Design and develop enhancements to the PBSPro batch scheduler based on customer-driven requirements.
\n\n
Work extensively with PBS vendor, Altair, on bug fixes and feature releases
\n\n
Write and shepherd scalable feature designs through the entire software development process, from requirements and use cases to release
\n\n
Apply best practices in software engineering, delivering projects on time, on budget, and with excellent quality
\n\n
Provide support to staff and end users to resolve batch scheduler issues
\n\n
Modify existing software to correct errors and/or improve performance
\n\n
Mentoring junior staff and cross training peers
\n\n
After hours/weekend support as required
\n\n
\n\n
Moderate Supercomputing System Administration that contributes to:
\n\n
Day-to-day operations of the Linux HPC clusters and storage systems
\n\n
Proactive monitoring, analyze, and correct system issues
\n\n
Development of scripts to automate repetitive tasks or tools to enhance support of the HPC systems
\n\n
System performance analysis and tuning
\n\n
Building, installing, and supporting user-requested software
\n\n
Supporting evaluation and assessment of new HPC technology
\n\n
Resolving user report issues and manage support tickets requests in Remedy
\n\n
\n\n
\n\n
\n\n
Requirements:
\n\n
Bachelor's degree in computer science or related field
\n\n
strong computer science background with in-depth systems-level knowledge in operating systems and networking
\n\n
Solid understanding of the software development process, including requirements, use cases, design, coding, documentation and testing of scalable, distributed applications in a Linux environment
\n\n
A minimum of 5 years' experience working with HPC systems and scheduling software (PBS, Slurm, or Moab/Torque)
\n\n
A minimum of 10 years of experience developing system software in heterogeneous, multi-platform HPC environments
\n\n
strong ability to analyze, debug and maintain the integrity of an existing code base
\n\n
Demonstrated equivalence of 5 years of Linux/UNIX user support experience and hands-on experience with administration of Linux systems
\n\n
Experience working with HPC applications and proficiency in at least C, C++, or Fortran
\n\n
Superior scripting skills and excellent attention to detail; proficiency in at least Python, Perl, or Bash
\n\n
strong ability to interact with customers to understand needs, elicit requirements, and get feedback on prototype solutions
\n\n
Excellent communication and people skills; excellent time management and organizational skills
\n\n
Experience with system configuration management tools e.g. , puppet, chef, ansible
\n\n
Experience with revision control software e.g. CVS, SVN, Git
\n\n
Track record of delivering commercial quality software on schedule with excellent quality through multiple release cycles
\n\n
Proficiency at technical writing
\n\n
\n\n
Preferred Skills (Requesting Manager Defines):
\n\n
Proficiency with analysis and problem-solving skills for debugging and optimization of applications
\n\n
Familiarity/proficiency with OpenMP and Message Passing Interface (MPI) programming
\n\n
Experience with Lustre, and InfiniBand
\n\n
Experience with cloud technologies (AWS, Azure, GCP), OpenStack or Kubernetes is a plus
\n
"
]