Role Summary
Process Data Engineer III – MSAT, based in Framingham, MA. Hybrid work arrangement: 3 days per week in Framingham. Work directly with process engineers to manage the development and design of automated systems and provide advanced analytics and system modeling support across multiple functions of Cell Culture, Purification and Analytics.
Responsibilities
- Design, develop, and maintain robust ETL processes to integrate data from various sources
- Create and optimize data models to support business intelligence and analytics initiatives
- Collaborate with cross-functional teams to identify data requirements and deliver tailored solutions
- Implement data quality checks and ensure data integrity across all systems
- Develop and maintain documentation for data processes, models, and pipelines
- Continuously improve data infrastructure and processes to enhance performance and scalability
- Partner with internal stakeholders from multiple departments to identify opportunities for applying data engineering and process monitoring solutions for new manufacturing facilities
- Exploit opportunities to leverage manufacturing data to develop data engineering and ML models and real-time process monitoring approaches
- Translate data analytics outcomes to non-scientific audiences, champion data-driven decision making, and empower end-users to perform simple analytics
- Support various stakeholders to ensure timely delivery of data engineering, visualization, ML and AI capabilities, with a focus on data integrity, validation, and data governance
Qualifications
- Required: Bachelor's degree with 5+ years or a Master's Degree with 3+ years or a PhD with 1 year of experience in data sciences, computer sciences, chemical engineering, or a related discipline in pharmaceutical industry
- Required: Proficiency in SQL and experience with relational databases (e.g., MySQL, PostgreSQL)
- Required: Strong programming skills in Python or R
- Required: Experience with ETL tools and processes
Skills
- Knowledge of data historian systems (e.g., Aspen IP21 or AVEVA PI AF & PI Event Frames)
- Familiarity with Snowflake, AWS, Azure, GitHub, and no-code/low-code tools such as Dataiku
- Experience developing data visualization platforms (Power BI, R Shiny, Streamlit)
Education
- As specified in Basic Qualifications (see above) including degree requirements