Sr Scientist, Data Science – R&D DSDH – Discovery Biologics (primary location: Cambridge, MA or Spring House, PA)
Key Responsibilities:
- Design, develop, and maintain Discovery Biologics data pipelines, integrating third-party solutions and external-partner data ingestion.
- Collaborate with data product owners and cross-functional partners to understand data requirements and deliver high-quality Biologics discovery solutions.
- Build integrations with Therapeutics Discovery systems, discovery repositories, and adjacent data sources.
- Optimize data workflows for ease of use, performance, scalability, and reliability.
- Monitor and resolve platform issues in a timely manner.
Qualifications:
Required:
- Advanced degree in Computational Biology, Bioinformatics, Data Science, Biomedical Engineering, Computer Science, or related field.
- Experience applying ML/AI in scientific domains (drug discovery/biology/chemistry/systems biology).
- Strong Python programming skills and experience with scientific/ML libraries (e.g., PyTorch, TensorFlow, scikit-learn, RDKit).
- Data engineering experience: data modeling, workflow orchestration, ETL/ELT pipelines, and cloud (AWS/GCP/Azure).
- Ability to work directly with experimental scientists to solve real R&D challenges.
Preferred:
- Life science experience (ideally antibody/protein engineering), strong problem-solving/analytical skills, familiarity integrating across HPC and/or MLOps; <10% travel.
Benefits (time off): Vacation (120 hrs/yr), Sick time (40 hrs/yr; CO 48; WA 56), Holidays incl. floating (13 days/yr), Work/Personal/Family time (up to 40 hrs/yr).