Regeneron logo

Data Engineer, Analytical & Biological Mass Spectrometry (ABMS)

Regeneron
5 hours ago
Remote friendly (Tarrytown, NY)
United States
IT
Key Responsibilities:
- Design, develop, and maintain scalable, automated LC-MS analytical data pipelines from raw data acquisition through cloud storage, processing, structured archival, and visualization.
- Design, build, and optimize ETL/ELT workflows, data integrations, and APIs for interoperability across heterogeneous systems (e.g., LC-MS instruments, LIMS, SDMS, data processing software, enterprise data stores).
- Partner with IT and DEA teams to design, deploy, and manage data infrastructure (data lakes, data warehouses) for robust ingestion, processing, and storage.
- Drive platform reliability and performance via proactive monitoring, observability, and continuous improvement.
- Establish standards for data quality, code quality, compliance, accessibility, and platform reliability; support adoption through documentation, training, and best-practice guidance.
- Stay current with emerging technologies and evaluate innovative approaches in data engineering, scientific informatics, and operational analytics.

Required Qualifications:
- Bachelorโ€™s or Masterโ€™s degree in Computer Science, Data Engineering, Software Engineering, Data Science, Bioinformatics, Computational Biology, Computer Engineering, Information Systems, or a related quantitative discipline.
- 0โ€“5 years of hands-on experience in data engineering or scientific data infrastructure.
- Proficiency in Python and SQL; production-quality data pipeline code with version control, testing, documentation, and code review.
- Experience developing APIs, ETL/ELT pipelines, or data access layers.
- Experience with relational database design and building structured data stores from semi-structured/unstructured scientific data.
- Experience with cloud platforms (AWS preferred).
- Understanding of data validation, logging, and error-handling in production pipelines.
- Strong communication skills translating scientific requirements into well-documented, maintainable technical solutions.

Preferred Qualifications:
- Biopharmaceutical/biotech/life sciences experience, especially in analytical laboratory environments.
- Familiarity with mass spectrometry raw/processed data formats.
- Familiarity with LC-MS software ecosystems (e.g., Skyline, LabKey Panorama, Protein Metrics Byosphere, Genedata Expressionist, Waters UNIFI/Empower).
- Familiarity with LIMS/SDMS/ELN platforms (e.g., Benchling, NuGenesis, IDBS) including API integration or workflow configuration.
- Familiarity with orchestration tools (e.g., Nextflow), shell scripting (e.g., Bash), JSON/configuration formats.
- Experience with Docker; building data connectors/API integrations for Power BI, Spotfire, or Tableau.

Benefits (if explicitly stated):
- U.S. benefits may include health and wellness programs (medical, dental, vision, life, disability), fitness centers, 401(k) match, equity awards, annual bonuses, paid time off, and paid leaves (e.g., military and parental leave).

Application Instructions:
- Apply now to take your first step towards living the Regeneron Way.