Role Summary
We are seeking a Computational Infrastructure Scientist to help build and extend the data systems that power large-scale biological and genetic research — the foundations of modern biology and precision medicine. This is a scientific engineering position, ideal for a PhD-level scientist who bridges biology, computation, and informatics. The successful candidate will have a deep understanding of the complexity and diversity of biological and genetic data, and a passion for designing robust, reusable infrastructure that makes this data usable, reproducible, and scalable across research and discovery pipelines.
Responsibilities
- Strategize and implement scientific data processing workflows that transform complex biological datasets into actionable insights.
- Design and develop innovative algorithms and ETL systems to address emerging challenges in biological and drug discovery data integration.
- Collaborate cross-functionally with domain scientists and engineers to translate biological questions into computational frameworks.
- Contribute to the long-term architecture and evolution of the data platform, ensuring scalability, transparency, and reproducibility.
- Develop cloud-based workflows and APIs that enable efficient access and analysis across diverse biological datasets.
- Document and share design decisions to promote reuse and institutional knowledge.
Qualifications
- Required: PhD in Computational Biology, Chemistry, Bioinformatics, or a related scientific field.
Additional Skills/Preferences
- Preferred: Strong programming experience in Python and strong familiarity with R.
- Preferred: Experience working in Linux environments.
- Preferred: Knowledge of biological databases, ontologies, and metadata systems.
- Preferred: Knowledge in PostgreSQL databases
- Preferred: Proficiency in Linux environments and Git (required).
- Preferred: Exposure to cloud platforms (e.g., AWS S3, EC2, or equivalent).
- Preferred: Experience working with workflow execution environments including NextFlow
- Preferred: Experience developing data-driven decision support applications including data and visual analytical tools
- Preferred: Exposure to Docker or containerized environments.
- Preferred: Strong communication skills and the ability to work independently on open-ended technical problems.
- Preferred: Understanding of web design and API is a plus.