Sanofi logo

CLD Senior Data Scientist

Sanofi
Remote friendly (Framingham, MA)
United States
IT

Role Summary

We seek a highly motivated Senior Scientist to join Cell Line Development, focusing on innovation and implementation of automated clone selection processes, high throughput data analytics, and data management solutions. The role supports end-to-end development, management, and optimization of data pipelines for Cell Line Development workflows in line with Sanofi’s Digital Data Strategy. The senior scientist will operationalize CLD data to support development projects by applying code-based analytics and data flow expertise to connect the laboratory with information systems. Location: Framingham, MA.

Responsibilities

  • Build and maintain integrated CLD data infrastructure.
  • Establish and maintain connectivity of laboratory equipment with existing data management solutions.
  • Work directly with data generated from wet bench experiments to ensure accurate integration and analysis.
  • Develop and implement data management solutions for clone screens and automation systems to support cell line development activities.
  • Gather and organize large and complex CLD data assets, perform relevant analysis.
  • Work with CLD team to understand data requirements and translate them into technical needs.
  • Propose and implement relevant data models and workflows.
  • Actively contribute to Data governance community and Sanofi’s “Play to Win” Digital Data Strategy.

Qualifications

  • Required: Master’s degree in science, engineering, or information management with minimum 4+ years working with data models and database architectures; OR Bachelor in science, engineering, or information management and 8 years of relevant experience; OR PhD with minimum 2 years of relevant experience.
  • Required: Experience in the biopharmaceutical industry.
  • Required: Proven experience working with wet bench data and translating experimental outputs into structured data pipelines.
  • Required: Experience working with database models and query tuning.
  • Required: Working knowledge of SQL and Python (familiarity with other scripting languages is a plus).
  • Required: Self-motivated with attention to detail, excellent organization, time-management, and communication skills.
  • Preferred: Experience supporting laboratory-based workflows.
  • Preferred: Experience with high throughput laboratory automation equipment (e.g., Hamilton, Beacon, or Ambr).
  • Preferred: Experience with a data pipelining application (e.g., Biovia Pipeline Pilot).
  • Preferred: Experience working with biological registration systems (e.g., Genedata Biologics) or LIMS in general.
  • Preferred: Experience with scientific analysis and BI software packages (e.g., Tableau or PowerBI).
  • Preferred: Good understanding of cloud database technologies.

Education

  • Master’s degree in science, engineering, or information management; or Bachelor’s degree in the same fields; or PhD in a related discipline with corresponding experience as outlined above.