Role Summary
High-Performance Computing (HPC) Principal Engineer is the technical authority and strategic leader for Elanco's HPC ecosystem. You will shape the architectural vision, design, and long-term roadmap for computational platforms powering critical research and development efforts. This role requires mentoring a team, influencing stakeholders, and building the next generation of scientific computing at Elanco.
Responsibilities
- Architectural Leadership and Strategy: Shape the design, architecture, and strategic evolution of Elanco’s HPC, storage, and networking infrastructure to meet future research demands.
- Technology Road Mapping and Innovation: Evaluate emerging technologies, conduct proof-of-concept projects, and build business cases for new investments to keep Elanco at the cutting edge of scientific computing.
- Mentorship and Technical Guidance: Act as a senior mentor and technical escalation point for other engineers and support staff, fostering technical excellence and knowledge sharing within the team.
- HPC System Management: Design, deploy, configure, and maintain Elanco’s HPC clusters and associated storage and networking infrastructure.
- Advanced Performance Optimization: Proactively monitor system performance, troubleshoot bottlenecks, and tune the environment to ensure optimal efficiency and resource utilization.
- User Support and Enablement: Act as the primary technical contact for our research and scientific user base, providing support, training, and guidance on how to best leverage HPC resources.
- Automation and Tooling: Develop and maintain scripts and automation tools to streamline system administration, job scheduling, and monitoring tasks.
- Job Scheduler Management: Manage and configure job scheduling systems to ensure fair and efficient allocation of computational resources.
- Security and Compliance: Implement and maintain security best practices to protect sensitive data and ensure the integrity of the HPC environment.
- Capacity Planning: Collaborate with stakeholders to forecast future computing needs and contribute to the strategic planning and evolution of Elanco’s HPC capabilities.
Qualifications
- Required: Educational Background: A Bachelor's or Master's degree in Computer Science, Engineering, or a related technical field.
- Required: Strategic Thinking and Business Acumen: Ability to align technical strategy with business goals, develop multi-year roadmaps, and justify major technology investments.
- Required: System Administration: Deep expertise in Linux/Unix system administration in a large-scale environment.
- Required: HPC Technologies: Broad experience with HPC cluster management, including job schedulers and parallel file systems.
- Required: Scripting Proficiency: Exceptional scripting skills for automation, particularly in Python and Bash.
- Required: Networking Knowledge: Solid understanding of high-speed networking fabrics like InfiniBand or Omni-Path.
- Required: Cloud and Hardware Acumen: Familiarity with Public Cloud services, specifically Microsoft Azure and Google Cloud Platform (GCP), as well as server, storage, and networking hardware components common in HPC environments.
- Required: DevSecOps: Proven experience with relevant DevSecOps concepts and tooling, including CI/CD, Git, Docker/Kubernetes, and Infrastructure-as-Code (HashiCorp Terraform).
- Required: Problem-Solving: Excellent analytical and troubleshooting skills, with the ability to diagnose and resolve complex technical issues efficiently.
- Required: Communication Skills: Strong interpersonal and communication skills, with a customer-centric approach to supporting a diverse scientific user community.
- Required: Leadership and Mentoring: Proven experience leading complex technical projects and mentoring junior and senior engineers.
Education
- Bachelor's or Master's degree in Computer Science, Engineering, or a related technical field.
Additional Requirements
- Travel: 0-10%
- Location: Indianapolis, IN Global HQ (Hybrid Work Environment)