Want to see how your resume matches up to this job?
A free trial of our JobsAI will help! With over 2,000 biopharma executives loving it, we think you will too!
Try it now — JobsAI.
Role Summary
Manager, Clinical Informatics
Responsibilities
Developing and maintaining a toolkit with key functions, APIs, and summaries that help users understand, interpret, and interact with phenotype data. In addition to Python, knowledge of R, SQL and/or C++ is a definite plus.
Developing the tools and code that will transform electronic health records, surveys, laboratory assays, or digital device data into a harmonized tall and narrow format compatible with RGC analytical tools, applications and processes. You will probably be writing and updating code in Python and using associated data science libraries, such as pandas, Polars, NumPy, scikit, and others.
Reviewing the structure, content, and quality of phenotype data extracted from electronic health records, surveys, digital devices, or laboratory assays. Each of these datasets may include data on 100,000s of people and require coordination and input from multiple stakeholders with varied expertise.
Discussing the challenges and opportunities of using health data from electronic health records, surveys, digital devices, and laboratory assays to characterize human health and disease. Expertise in cutting-edge statistical methods for epidemiology is a definite plus.
Working with modern cloud environments and platforms. A knowledge of AWS and related toolkits will be useful for your day-to-day work. You will be using your computational skills to execute analysis and data processing at scale and to facilitate automation and repeatability of all key processes.
Presenting results and summaries of these datasets and data processing plans to a variety of technical audiences, ranging from experts in statistics, epidemiology, genetics, and computation to experts in biology, drug design, and medicine. You will need outstanding communication skills and an ability to summarize and present to a variety of technical audiences.
Working in a highly interactive environment with a team of colleagues. We highly value the ability to interact, learn, and teach so that you and other skilled individuals consistently achieve high levels of motivation, enthusiasm, and performance.
Qualifications
A demonstrated knowledge of Python and key data science libraries is a must. Knowledge of R, SQL and/or C/C++ is also highly valued. If you have contributed to code in GitHub or another public repository, let us know.
Understanding strategies for mapping structured and unstructured data to ontologies such as ICD-10, RxNORM and LOINC.
Knowledge and experience applying best practices for data quality control, summarization and visualization.
A passion for learning. We are a fast-moving team in a fast-moving company. You should expect to encounter challenging work and to learn many new skills.
Skills
Python and data science libraries; experience with R, SQL, and/or C/C++
Data transformation and harmonization of health data
EHR, survey, laboratory, and device data handling
Cloud platforms (AWS) and scalable data processing
Effective communication to diverse technical audiences
Education
PhD (or MS with additional years in lieu) in Computer Science, Health Informatics, Clinical Informatics, Biostatistics, or a related field
Additional Requirements
At least 3 years of relevant experience organizing large datasets in a research setting, whether in academia or industry