High Performance Computing Planning Engineer

Posted May 05, 2023
Apply: PGTEK


Do you have a high performance computing (HPC) background and want to have a global impact? As an HPC Delivery Engineer, you will deliver HPC, Hadoop, and Openstack implementation services for leading Global Services group. In addition, the hire will also provide technical implementation planning to facilitate integration of the purchased solution into a customer's environment.
When not at a customer site, the incumbent will work from his/her home office participating and contributing to best practices, IP development, and/or other related HPC, Hadoop, and Openstack related initiatives. The position requires up to 75% travel to customer sites nationally.
The prospective candidate should possess experience with HPC, big data, and cloud service delivery according to field guides, Solution Integration Documents, and/or custom statements of work. Primary experience should be within the HPC domain, specifically with Head & Compute node clusters, InfiniBand Networks, 10 GbE Networks, Red Hat Enterprise Linux Proficiency, Power & Cooling, GPU Clusters, and related HPC technologies, tools and software products. Integration and testing of varied applications, performance testing and tuning of HPC including ability to utilize various hardware, software solutions to maximize application specific performance, benchmarking performance of cluster using HPL, integration and configuration of job schedulers.

Responsibilities:
  • Lead the integration of hardware and software for HPC, big data, and cloud solutions.
  • Provide leadership to supervise junior technicians at customer locations to ensure quality, efficiency and compliance with deployment procedures and installation guides.
  • Interface directly with customer site contacts, Dell project managers, and other key stakeholders.
  • Identify potential risk/resolution approaches for targeted HPC, big data, and cloud deployments.
  • Provide pre-sales scoping assistance and SOW reviews as needed
  • Provide solution integration planning as needed
  • Contribute to improving delivery quality and optimization through best practices and IP development
  • Identify corrective actions to drive resolutions for systemic issues
  • Stay up-to-date with current technology trends
  • Escalate break/fix troubleshooting and issue resolution.

Skills Required:
  • CentOS, Scientific Linux, and SUSE Linux Enterprise Server (SLES)
  • Experience with schedulers including SGE, PBS Pro, Moab, and Torque/Maui
  • Networking expertise including Cisco, PowerConnect, Force 10, and Arista
  • InfiniBand expertise including Mellanox and QLogic (Intel)
  • Experience with cluster management software including Bright Cluster Manager (BCM), Rocks, and xCAT
  • Familiarity with Big Data and Cloud platforms including Hadoop and OpenStack
  • Storage experience including PowerVault, DDN, GPFS, and Lustre
Education Requirements:
  • Undergraduate degree and 6+ years, or Graduate Degree and 4+ years of relevant experience to include Linux expertise such as Red Hat Enterprise Linux (RHEL)

Certification Requirements:
  • Certifications such as RHCE/RHCSA, CCNA, CCNP, and Network+ certifications

ABOUT PGTEK

PGTEK is a true consulting organization dedicated to helping clients achieve their business and technology objectives utilizing our decades of experience and business relationships. PGTEK invests in the educational advancements of our staff by providing the necessary resources to complete Professional and Business Certifications. Our company is our people, and we treat them like family.

EOE, including disability/veterans