Elanco logo

High-Performance Computing Principal Engineer

Elanco
Remote friendly (Indianapolis, IN)
United States
IT

Role Summary

High-Performance Computing (HPC) Principal Engineer is the technical authority and strategic leader for Elanco's HPC ecosystem. You will shape the architectural vision, design, and long-term roadmap for computational platforms powering critical research and development efforts. This role requires mentoring a team, influencing stakeholders, and building the next generation of scientific computing at Elanco.

Responsibilities

  • Architectural Leadership and Strategy: Shape the design, architecture, and strategic evolution of Elanco’s HPC, storage, and networking infrastructure to meet future research demands.
  • Technology Road Mapping and Innovation: Evaluate emerging technologies, conduct proof-of-concept projects, and build business cases for new investments to keep Elanco at the cutting edge of scientific computing.
  • Mentorship and Technical Guidance: Act as a senior mentor and technical escalation point for other engineers and support staff, fostering technical excellence and knowledge sharing within the team.
  • HPC System Management: Design, deploy, configure, and maintain Elanco’s HPC clusters and associated storage and networking infrastructure.
  • Advanced Performance Optimization: Proactively monitor system performance, troubleshoot bottlenecks, and tune the environment to ensure optimal efficiency and resource utilization.
  • User Support and Enablement: Act as the primary technical contact for our research and scientific user base, providing support, training, and guidance on how to best leverage HPC resources.
  • Automation and Tooling: Develop and maintain scripts and automation tools to streamline system administration, job scheduling, and monitoring tasks.
  • Job Scheduler Management: Manage and configure job scheduling systems to ensure fair and efficient allocation of computational resources.
  • Security and Compliance: Implement and maintain security best practices to protect sensitive data and ensure the integrity of the HPC environment.
  • Capacity Planning: Collaborate with stakeholders to forecast future computing needs and contribute to the strategic planning and evolution of Elanco’s HPC capabilities.

Qualifications

  • Required: Educational Background: A Bachelor's or Master's degree in Computer Science, Engineering, or a related technical field.
  • Required: Strategic Thinking and Business Acumen: Ability to align technical strategy with business goals, develop multi-year roadmaps, and justify major technology investments.
  • Required: System Administration: Deep expertise in Linux/Unix system administration in a large-scale environment.
  • Required: HPC Technologies: Broad experience with HPC cluster management, including job schedulers and parallel file systems.
  • Required: Scripting Proficiency: Exceptional scripting skills for automation, particularly in Python and Bash.
  • Required: Networking Knowledge: Solid understanding of high-speed networking fabrics like InfiniBand or Omni-Path.
  • Required: Cloud and Hardware Acumen: Familiarity with Public Cloud services, specifically Microsoft Azure and Google Cloud Platform (GCP), as well as server, storage, and networking hardware components common in HPC environments.
  • Required: DevSecOps: Proven experience with relevant DevSecOps concepts and tooling, including CI/CD, Git, Docker/Kubernetes, and Infrastructure-as-Code (HashiCorp Terraform).
  • Required: Problem-Solving: Excellent analytical and troubleshooting skills, with the ability to diagnose and resolve complex technical issues efficiently.
  • Required: Communication Skills: Strong interpersonal and communication skills, with a customer-centric approach to supporting a diverse scientific user community.
  • Required: Leadership and Mentoring: Proven experience leading complex technical projects and mentoring junior and senior engineers.

Education

  • Bachelor's or Master's degree in Computer Science, Engineering, or a related technical field.

Additional Requirements

  • Travel: 0-10%
  • Location: Indianapolis, IN Global HQ (Hybrid Work Environment)