Regeneron logo

Staff Engineer, Data Science (PMPD)

Regeneron
Remote friendly (Tarrytown, NY)
United States
Operations

Role Summary

Staff Engineer, Data Science to join the Data Science team in the Data Enablement and Analytics (DEA) group within PMPD, pairing bioprocess engineering expertise with AI/ML capabilities to accelerate biologics development and manufacturing. Design, implement, and operationalize models for upstream and/or downstream processes, turning data into prescriptive guidance and deploying production-grade models to enhance process understanding, optimization, and automation.

Responsibilities

  • Develop, validate, and maintain mechanistic, hybrid, and data-driven models for cell culture and/or purification processes.
  • Translate complex bioprocess questions into quantitative modeling strategies for scale-up, tech transfer, and continuous improvement.
  • Advance PMPD’s broader data-science and digital-maturity initiatives.
  • Collaborate with process engineers, citizen data scientists, IT, and manufacturing to coordinate modeling efforts enterprise-wide.
  • Build and deploy AI/ML-powered digital solutions on cloud-based analytics platforms.
  • Mentor citizen data scientists and champion best practices in model development, method selection, and code quality.
  • Explore and prototype GenAI approaches to enhance knowledge management and decision support.

Qualifications

  • Ph.D. in Chemical/Biochemical Engineering, Biotechnology, Applied Mathematics, or related field with 4+ years of industrial experience OR Master’s with 7+ years.
  • Deep mechanistic understanding of upstream and/or downstream bioprocess unit operations, scale-up/down, and critical quality attributes.
  • Demonstrated success modeling bioprocesses via first-principles, hybrid, or data-driven (ML) methods (preferred).
  • Strong foundation in AI/ML algorithms (regression, classification, Bayesian methods, deep learning, time-series, probabilistic modeling) and multivariate statistics for process modeling, real-time monitoring, and control (plus).
  • Expert programming proficiency in Python and SQL; experience with JMP, SIMCA, MATLAB (helpful).
  • Proven ability to communicate technical concepts to multidisciplinary stakeholders.

Skills

  • Cloud analytics platforms (e.g., Dataiku, Databricks).
  • Quality-by-Design (QbD) principles and Design-of-Experiments (DoE) for design space and robust control strategies.
  • PAT and chemometric modeling (e.g., Raman spectroscopy) for bioprocess monitoring and control.
  • Operations research techniques (e.g., combinatorial optimization, linear programming, MILP).
  • GenAI stacks (LLMs, vector databases, retrieval-augmented generation) and multimodal techniques.
  • Publication record in bioprocess modeling or AI for biomanufacturing (preferred).

Education

  • Ph.D. in Chemical/Biochemical Engineering, Biotechnology, Applied Mathematics, or related field (required for 4+ years) or Master’s degree with 7+ years of experience.
Apply now
Share this job