Vertex Pharmaceuticals logo

Data Engineering, Principal Research Data Engineer

Vertex Pharmaceuticals
over 2022 years ago
Remote friendly (Boston, MA)
United States
$148,000 - $222,000 USD yearly
IT

Role Summary

The Principal Research Data Engineer will lead Vertex's Data Engineering team within the Data & Software Engineering (DSE) organization, enabling scientists with data. You will develop, curate, and maintain data assets to support analytics, modeling, and investigations, collaborating with data scientists, data engineers, and platform engineers to scale the Vertex Data Platform. The role drives scalable data solutions that accelerate research and enable data-driven decision making.

Responsibilities

  • Data Engineering – Integrate and curate data from research systems and artifacts to support analytics, modelling, machine learning, and investigation. Collaborate closely with research engagement teams to understand requirements, and translate them into data solutions.
  • Data Engineering management – Manage, maintain, and improve the Vertex Data Platform solutions that support and enable Research scientists.
  • Delivery management – Estimate, architect, and execute on delivery of critical data solutions across the Research domain, in partnership with the DSE leadership team.
  • Operations Management – Manage a team of data engineers to maintain compliant, timely, secure, and reliable data workloads for Research. Work alongside our DSE MLOps team to support complex workloads that leverage curated and model ready data.
  • Innovation champion – Advocate for process enhancements and opportunities to improve our capabilities with a focus on efficiency, scale, and data connectivity.

Qualifications

  • Minimum of 9 years of development experience using Snowflake, Databricks, Spark, Redshift, or equivalent data technologies.
  • Minimum of 9 years of experience in pharmaceutical research, with an emphasis on data engineering, data science, data integrity, and data governance.
  • Prior experience leading Data Engineering projects and teams.
  • 3+ years leveraging Databricks, Snowflake, AWS, or equivalent cloud data platforms.
  • Demonstrated experience with pipeline technologies like Astronomer / Airflow, MLFlow, etc.
  • Demonstrated ability to work independently and manage multiple projects that require collaboration across functional areas.
  • Skillful, collaborative team player able to develop rapport and credibility with stakeholders.
  • Demonstrated ability and willingness to teach, engage and support others as they learn new technologies and concepts.
  • Enthusiasm for and the ability to quickly learn new technologies and tackle difficult problems.
  • Strong presentation, verbal, and written communication skills.
  • Working knowledge of key workflow tools, including JIRA and Confluence.