Role Summary
Praxis is seeking a Senior Statistical Data Scientist to support the development, validation and automation of analytical pipelines and statistical models that support metadata-driven clinical data processing, reporting, and regulatory submissions. This hands-on, technical role is ideal for a data scientist or statistical programmer who enjoys coding, problem-solving, and working cross-functionally to bring rigor, reproducibility, and automation to clinical reporting workflows. The Senior Statistical Data Scientist will contribute directly to R/Python development while also setting technical standards, mentoring peers, and ensuring readiness for R-based regulatory submissions.
Responsibilities
- Support the design, development, and validation of R/Python code to automate generation of analytical datasets and TLFs within a metadata-driven pipeline.
- Translate SAPs and metadata specifications (YAML/CSV) into executable and reproducible code.
- Build and validate R packages and data science tools supporting both exploratory and confirmatory analyses, ensuring full traceability and audit readiness.
- Implement and validate statistical models (e.g., MMRM, ANCOVA, logistic regression) using R packages such as mmrm, emmeans.
- Collaborate with IT to integrate data science and statistical programming workflows within Databricks and CI/CD pipelines for continuous validation and reproducibility.
- Collaborate across programming, biostatistics, and data standards functions to ensure dataset definitions, derivations, and metadata align with controlled standards.
- Conduct peer code reviews, unit testing, and automated validation; ensuring deliverables meet submission-quality and reproducibility standards.
- Mentor and guide team members in best practices for programming, validation, and automation.
Qualifications
- Bachelor’s or Master’s degree in Statistics, Biostatistics, Data Science, or a related field with 5+ years of statistical programming experience in the pharmaceutical/biotech industry including hands-on experience with R and/or Python.
- Proven experience preparing or supporting R-based regulatory submissions (e.g., R package validation, R-based analysis delivery, or submission readiness).
- Strong understanding of CDISC ADaM and SDTM data structures, and their use in analytical workflows.
- Experience developing and validating reusable R/Python libraries and functions.
- Proficiency with Git, Bitbucket, and CI/CD automation pipelines.
- Working knowledge of GxP and Part 11 compliance.
- Preferred familiarity with YAML/JSON configuration and metadata-driven programming workflows.
- Prior experience migrating from SAS to R/Python environments.
- Knowledge of R validation frameworks (e.g., risk-based testing, reproducibility documentation).
- Experience with exploratory analytics or visualization in R or Python within a regulated framework strongly preferred.
- Excellent documentation and validation practices.
- Collaborative and proactive mindset; able to operate independently in a small, agile team.
Education
- Bachelor’s or Master’s degree in Statistics, Biostatistics, Data Science, or a related field.
Additional Requirements
- The physical and mental requirements of our roles include regular use of a computer, devices or other office equipment, clear communication, and occasional movement. You’ll need comfort with screen work, basic hand coordination, and focus.