When & Where
- Remote position in the US or on-site at Armonk, NY or Warren, NJ offices.
Discover Your Role
- Work with a cross-functional team to optimize and implement the data strategy for digital transformation and AI/ML.
- Design and document end-to-end data architectures for analytic, operational, and research needs.
- Implement a modern data platform (e.g., Snowflake, Databricks).
- Improve data interoperability and standardization across systems and business units.
- Develop pipelines to monitor and improve internal and external (e.g., CRO partner) data quality.
- Partner with informatics and AI engineers to optimize data utility.
- Monitor and optimize performance of data architectures and platforms.
- Develop/implement critical metrics to measure impact of the data strategy.
- Stay current with advances and evaluate for adoption.
This Role Requires
- Advanced degree preferred (PhD +2 yrs or MS +4 yrs); minimum 5 years leading data engineering implementations in life sciences/healthcare.
- Expertise designing/maintaining clinical or biomedical data infrastructure/architecture.
- Proficiency with modern data platforms (e.g., Snowflake, Redshift, BigQuery, Databricks) and Python/SQL/R.
- Experience maintaining code repositories (e.g., Bitbucket) with version control.
- Cloud architecture (AWS/Azure/GCP) and DevOps practices (certifications a plus).
- Experience building/scaling pipelines for structured and unstructured data and integrating across the enterprise.
- Knowledge of HIPAA, GDPR, 21 CFR Part 11; CDISC, HL7, FHIR.
- Knowledge of ML pipelines and integration with clinical data platforms.
- Travel up to 20%.