Role Summary
Principal Scientist in Genomic Technologies within the Human Genetics and Genomics (HGG) group at GSK. The role focuses on applying computational and statistical methods to analyze genetics and genomics data to guide drug discovery and development, and to support portfolio decisions. The successful candidate will join an existing Statistical Genetics team and work on projects including discovery of causal gene-phenotype links, identification of disease-relevant genes and targets, and predictive modelling using multi-omic data.
Responsibilities
- Evaluate, improve, test, and develop production implementations of best-in-class methods for analysis of genetic data, in collaboration with scientists in Data and Predictive Sciences, Biostatistics, or/and AI/ML teams.
- Generate large-scale analysis outputs, and/or implement state-of-the-art tools for dynamically analysing, interpreting, and visualizing genetic and genomic data.
- Collaborate with partners in applied and translational teams across HGG, in experimental Target Discovery, and in GSK disease area Research Units, to influence portfolio and pipeline decisions.
- Stay informed about recent research in the field and consider its potential for application within GSK.
- Contribute to a culture of innovation, quality, and willingness to learn and improve.
Qualifications
- Required: Advanced degree (PhD or equivalent) in a relevant scientific discipline.
- Required: Experience working with large-scale biobank data, such as UK Biobank and All of Us.
- Required: Excellent programming skills in R or Python, and application of techniques in reproducible research, literate programming, FAIR data principles, or software development.
- Required: Experience of evaluating, improving, testing, and/or developing methods for analysis and interpretation of large-scale genetics data.
- Required: Strong communication and team-working skills.
- Preferred: Deep understanding of statistical genetics methodology.
- Preferred: Experience working in cloud-based Trusted Research Environments.
- Preferred: Experience with machine learning and/or advanced statistical methods.
- Preferred: Experience working in multidisciplinary teams on complex and impactful projects.
- Preferred: Experience with analysis of very large datasets using distributed or cloud computing technologies (e.g., SQL, PySpark, BigQuery, Docker, Nextflow).
Additional Requirements
- Location: This position may be based at a GSK R&D site in UK (Stevenage), USA (Cambridge Tech Square, MA; or Upper Providence, PA), or Germany (Heidelberg).