SC24 BoF


Collaborative Pathways for Designing an Integrated Infrastructure for Research Excellence

SC24 BoF

Tuesday - Nov 19, 2024
5:15pm-6:45pm ET
Room B212

In modern science, integrating scientific workflows, especially those involving AI and spanning multiple facilities, is paramount. AI workflows, leveraging machine learning and data analytics, are crucial for processing and interpreting vast amounts of data, driving unprecedented breakthroughs. Multi-facility workflows have expanded the horizons of scientific research by enabling the seamless integration of computational resources, data, and experiments across different locations, fostering collaboration and resource sharing.

The inaugural BoF session at SC22 focused on integrating HPC and AI workflows. Building on this, the second BoF at SC23 gathered the Workflows Community to explore challenges, opportunities, and future pathways for continuum and multi-facility workflows. Scientists provided insights into domain-specific challenges, discussing coordination among computing and experimental facilities, metadata tracking, co-design of experiment facilities, task standardization, edge-to-cloud data exchange, adaptive compression, and defining common workflow concepts. These sessions highlighted the importance of seamless integration across cloud-based data pipelines, multi-facility instruments, and HPC infrastructures. The enthusiasm and feedback underscored the need for deeper exploration of continuum and multi-facility workflows, critical for the resilience and real-time demands of modern scientific workflows.

In this BoF session, we will deepen the discourse initiated during the previous BoF sessions and elevate these discussions by incorporating a wider array of perspectives, including those of scientists who develop and utilize these workflow systems within their specific domains. By examining the challenges and opportunities within multi-facility workflows, the session will explore themes such as near real-time processing and integration of data from multiple sources and further discuss the topics of coordination among different computing and experimental facilities, and co-design of experiment facilities. This initiative is motivated by the need to address the continuum of workflows that demand uninterrupted computing access and multi-facility workflows that enhance resilience and efficiency across distributed environments.

Agenda