Role Summary
Senior Scientist to lead the design and implementation of AI orchestration systems within the AI/ML Foundation team. This role combines expertise in agentic AI systems, data engineering, and software architecture to build the infrastructure that enables seamless coordination of AI capabilities across Computational Sciences and Global Research. You will be responsible for creating intelligent orchestration layers that connect diverse AI models, data pipelines, and computational resources to accelerate drug discovery across all therapeutic modalities.
Responsibilities
- Design and implement AI orchestration frameworks that integrate agentic systems, data pipelines, and model serving infrastructure to enable coordinated multi-model workflows.
- Build scalable orchestration layers connecting predictive models, generative models, and foundation models with experimental data sources for end-to-end workflow automation.
- Develop data engineering solutions including data ingestion pipelines, transformation workflows, feature stores, and model serving infrastructure supporting discovery across modalities.
- Create intelligent orchestration systems that coordinate agentic AI components for autonomous task decomposition, tool selection, and execution across scientific domains.
- Establish monitoring, observability, and governance frameworks ensuring reliability, reproducibility, and transparent decision-making across AI systems.
- Partner with computational scientists, data engineers, and research teams to ensure orchestration infrastructure meets the needs of diverse discovery workflows.
- Stay current with advances in agentic AI, workflow orchestration, and distributed systems; evaluate and integrate emerging technologies.
Qualifications
- Required: PhD in Computer Science, Data Engineering, Computational Science, or related field with 2+ years relevant experience, OR MS with 6+ years relevant experience.
- Required: Strong software engineering skills with proficiency in Python and experience with distributed systems and cloud infrastructure (AWS, GCP).
- Required: Experience with workflow orchestration frameworks (Airflow, Prefect, Dagster, or similar) and data pipeline development.
- Required: Familiarity with agentic AI frameworks (LangChain, AutoGen, or similar) and LLM integration patterns.
- Required: Experience with containerization (Docker, Kubernetes) and microservices architecture.
- Required: Strong problem-solving skills and ability to work across teams in a fast-paced R&D environment.
Preferred
- Preferred: Experience building AI/ML platforms or infrastructure in pharmaceutical or life sciences settings.
- Preferred: Familiarity with scientific computing workflows and computational chemistry/biology tools.
- Preferred: Experience with model serving frameworks (TorchServe, Triton, BentoML) and feature stores.
- Preferred: Knowledge of monitoring and observability tools (Prometheus, Grafana, MLflow).
- Preferred: Experience with event-driven architectures.
Education
- PhD in Computer Science, Data Engineering, Computational Science, or related field with 2+ years relevant experience, OR MS with 6+ years relevant experience.
Additional Requirements
- The position will be based in Cambridge, MA