Position: Scientific Software Developer, Data Foundry
Responsibilities
- Design, build, and maintain data processing pipelines for chemical/biological, high-throughput, and automation-generated datasets, ensuring FAIR compliance and machine-actionability.
- Develop RESTful APIs and microservices for unified access to LIMS, ELNs, instruments, data warehouses (Postgres, Redshift, Snowflake), and analytical databases.
- Support continuous improvement of LIMS and adjacent systems for evolving scientific workflows, security, and scalability.
- Work with bench scientists to prototype custom applications, dashboards, and workflow tools; validate via iterative feedback.
- Hand off mature prototypes to Tech@Lilly for enterprise scaling (transition criteria, documentation standards, SLAs).
- Build lab automation integrations (equipment, scheduling systems, instrument data streams) with metadata and execution traceability.
- Develop robotic workflow control, instrument driver interfaces, and real-time data capture; create modular automation components configurable without code.
- Support agentic lab interfacing between automation platforms and AI-driven experimental planning.
- Build/operate cloud-native components (AWS/Azure/GCP) for containerized workflows, infrastructure-as-code, CI/CD, and orchestration (Prefect/Airflow/Nextflow).
- Apply DevSecOps (security scanning, code review, automated testing); participate in agile development.
Basic Requirements
- BS/MS in CS, Bioinformatics, Cheminformatics, Computational Biology, Chemistry, Biology, Biomedical Engineering, or related STEM.
- BS: 3+ years; MS: 1+ years scientific software development with experimental data and scientific workflows.
- Proficiency in Python and one additional language (Java, C#, Go, TypeScript); appropriate-level SQL.
- Authorized to work in the United States full-time; no visa sponsorship.
Preferred Qualifications
- Building REST APIs, data pipelines, and/or microservices.
- AWS/Azure/GCP, Docker/Kubernetes, Git; LIMS/ELN (e.g., Benchling) and instrument integration.
- Lab automation/digital platform integration (OPC-UA, serial/USB, scheduling platforms).
- Data warehousing (Postgres/Redshift/BigQuery/Snowflake) and scientific standards/ontologies.
- Cheminformatics (RDKit/SchrΓΆdinger/MOE) or bioinformatics (Biopython/Bioconductor).
- SciPy/NumPy and scientific computing for modeling/optimization; orchestration (Prefect/Airflow/Nextflow/WDL) and CI/CD.
- Experience with compiled languages (C/C++ or others) for performance-critical workflows.
Benefits (if eligible)
- Company bonus; comprehensive benefits including 401(k), pension, vacation, medical/dental/vision, flexible benefits, life insurance, time off/leave, and well-being benefits.
Locations
- San Diego, CA; San Francisco, CA; Boston, MA; Louisville, CO; Indianapolis, IN.