HPC Specialist

Updated: 4 months ago
Location: New York City, NEW YORK

The incumbent will be responsible for supporting the teaching and research computing needs of the NYU Center for Data Science (CDS) scholars and the NYU High Performance Computing (HPC) systems as a member of the HPC team. This will require working directly with CDS instructors, instructional designers, and researchers, students and other scholars in support of utilizing efficiently the HPC resources in research projects and instruction. Work with faculty and researchers to ensure priority access to department-purchased hardware, especially during critical time periods. This work will determine the needs for computing within various courses, advising on software and other architectural components that would be deployed in support of instruction and research, select the architectural stack for instruction, determining how to procure and provision that stack, negotiating with vendors, installation, testing, maintenance, upgrades and administration of compilers and applications on HPC systems, generating system usage and utilization data, developing and teaching tutorials on HPC topics and maintaining documentation for a rapidly evolving HPC environment. Additionally, the selected candidate will fine-tune system configuration which involves managing queues, schedulers and other resources in accordance with HPC policies while working closely with the Systems Administration team on hardware and storage and ensure that students and instructors in courses receive appropriate ongoing support for provisioning and issue resolution serving as the point of contact for other parties involved in designing and providing the required services.

Required Education:
Bachelor's Degree
Preferred Education:
Master's Degree and Ph.D. preferred
Required Experience:
3+ years relevant experience; related experience with the following: Jupyter technology, running scientific applications on large scale computers, optimizing and/or developing applications on UNIX-based systems, containers, designing/developing system enhancements and software applications, programming experience with modern languages and, application software, protocols, tools and utilities and may include installation, maintenance and support or an equivalent combination.
Preferred Experience:
5+ years 5 years of relevant experience. Experience working in an academic environment supporting faculty and researchers. Having a record of success in building and supporting large computing systems for HPC, Big Data Analytics and Artificial Intelligence research and teaching. Experience maintaining and building web sites.
Required Skills, Knowledge and Abilities:
Proficiency with multi-vendor hardware/software configuration. May require any of the following: problem identification/resolution, performance management/tuning, and design configuration/planning. Knowledge of related large-scale computing systems and/or product installs and maintenance. Ability to provide technical leadership and management of complex, large-scale computing systems projects. Ability to clearly communicate technical concepts to non-technical audience. Excellent organizational and communication skills.
Preferred Skills, Knowledge and Abilities:
Queuing systems, schedulers and workload managers, configuration management, iPython notebooks and JupyterHub, Lustre, Hadoop. Experience in popular Artificial Intelligence frameworks (TensorFlow, pytorch) as well as Matlab, R, shell scripts, Perl and/or Python. Word Press

EOE/AA/Minorities/Females/Vet/Disabled/Sexual Orientation/Gender Identity

We can recommend jobs specifically for you! Click here to get started.
View or Apply

Similar Positions