Amgen logo

Senior Data Scientist

Amgen
Remote friendly (Cambridge, MA)
United States
Operations

Role Summary

Senior Data Scientist responsible for developing, maintaining, and deploying first principles and/or machine learning models for biopharmaceutical processes, including cell culture, biologics formulation development, and fill/finish processes. Lead model design, select underlying technologies and infrastructure, and ensure successful application of models in Process Development. Work with cross-functional teams and communicate technical foundations of models for engineers and scientists.

Responsibilities

  • Design, develop, and deploy first principles, machine learning, and hybrid models to optimize biopharmaceutical processes.
  • Integrate modeling software with real-time equipment data to enable digital twins for improved process performance and understanding.
  • Build and maintain robust data pipelines and agentic workflows to streamline generation of impact-level insights.
  • Collaborate with cross-functional teams to translate scientific challenges into data-driven solutions and ensure seamless model integration into process development workflows.
  • Champion software development best practices, including version control, testing, and continuous integration to ensure model reliability, scalability, and reproducibility.

Qualifications

  • Required: High school diploma / GED with 10 years of Data Sciences experience OR Associate’s degree with 8 years of Data Sciences experience OR Bachelor’s degree with 4 years of Data Sciences experience OR Master’s degree with 2 years of Data Sciences experience OR Doctorate degree.
  • Preferred: Ph.D. in Chemical Engineering, Mechanical Engineering, Applied Math, or related field; strong background in mechanistic modeling and first principles; experience applying AI/ML algorithms to engineering problems; track record of leading modeling and data science projects; 3+ years coding experience in Python; understanding of biopharmaceutical processes and unit operations; experience using Git; familiarity with DevOps and software best practices; experience with data engineering & visualization tools (e.g., Databricks, Spotfire) and cloud computing (AWS); experience leveraging AI tools for scientific workflows; independent, self-motivated, organized, and able to multi-task in time-sensitive environments; excellent written and verbal communication; strong analytical skills.

Education

  • Doctorate degree (preferred); Ph.D. in Chemical Engineering, Mechanical Engineering, Applied Math, or related field.

Skills

  • Python programming (3+ years)
  • Git version control
  • Data engineering & visualization tools (e.g., Databricks, Spotfire)
  • Cloud computing and data storage (AWS)
  • DevOps practices (CI/CD, test-driven development)
  • Machine learning, first-principles modeling, and hybrid modeling
  • Strong communication and collaboration across cross-functional teams