Role Summary
Leads data infrastructure and management strategy, with a strong focus on managing multi-omics and research data. Responsible for designing, implementing, and maintaining a robust data platform that supports Championโs research and development efforts, in compliance with FAIR (Findable, Accessible, Interoperable, and Reusable) principles.
Responsibilities
- Implement and maintain a comprehensive data platform architecture that supports the entire research data lifecycle
- Develop and support multiomic and biological data analysis tools and platforms that enable efficient research workflows
- Develop and enforce data governance policies and standards, ensuring FAIR principles are consistently applied across all research data
- Manage and optimize data storage, processing, and retrieval infrastructure using AWS cloud services
- Develop robust data management strategies that support reproducibility and data integrity in scientific research
- Collaborate with research teams, bioinformaticians, and IT departments to ensure seamless data flow and accessibility
- Implement advanced data security and privacy measures to protect sensitive research information
- Design and maintain metadata management systems for comprehensive data cataloging
- Understand customer user needs as pertains to data access, analysis, and utilization and enable efficient collaboration models
- Ensure compliance with relevant regulatory requirements and industry best practices
- Manage partner data-licensing agreements, including:
- Tracking and ensuring compliance with licensing conditions
- Monitoring data usage and adherence to contractual obligations
- Oversee data transfer processes with third-party partners, including:
- Developing secure data transfer protocols
- Implementing robust data exchange mechanisms
- Ensuring data privacy and regulatory compliance during external data transfers
- Managing data access controls and audit trails for third-party interactions
- Other duties may be assigned verbally at any time.
Qualifications
- Required: Extensive experience in data platform management, preferably in a biotech or pharmaceutical research environment
- Required: Deep understanding of and direct experience with multi-omics data types, including genomics, transcriptomics, proteomics, and metabolomics
- Required: Expertise in handling Next-Generation Sequencing (NGS) data
- Required: Advanced proficiency in AWS cloud services and infrastructure
- Required: Strong SQL skills and experience with database management systems
- Required: Comprehensive knowledge of FAIR data principles and scientific data management best practices
- Required: Strong programming skills (Python and/or R)
- Required: Demonstrated experience in managing data licensing agreements and third-party data transfers
- Required: Understanding of data protection regulations and compliance requirements
- Required: Excellent strategic planning and technical leadership skills
- Required: Strong communication abilities across technical and non-technical teams
- Required: Ability to translate complex technical concepts for diverse audiences
- Preferred: Experience with bioinformatics data platforms and analysis tools
- Preferred: Experience with data pipeline development and workflow management tools
- Preferred: Knowledge of contractual negotiations and intellectual property considerations
- Preferred: Familiarity with data transfer agreements and compliance frameworks
- Preferred: Understanding of machine learning and AI applications in biomedical research
Skills
- Strong strategic planning and technical leadership
- Excellent communication across technical and non-technical audiences
- Ability to translate complex technical concepts for diverse audiences
- Programming in Python and/or R
- Experience with AWS cloud infrastructure and data governance
- Knowledge of FAIR data principles and scientific data management best practices
Education
- PhD or masterโs degree in Bioinformatics, Computer Science, Data Science, or a related field
Additional Requirements
- Must be able to sit for long periods of time using a computer in a typical office environment