9am - 9:10am CST | Opening Remarks |
9:10am - 10am CST | Invited Talk 1: Memorization vs Reasoning in MoEs and Estimating Memory Consumption in Distributed Training
Rio Yokota Institute of Science Tokyo |
10am - 10:30am CST | Morning Break |
10:30am - 11:10am CST | Invited Talk 2: TBD
Matthew Dearing Argonne National Laboratory |
11:10am - 11:30am CST | Enabling Unstructured Sparse Fine-Tuning and Inference for Foundation Models on Wafer-Scale Engine
Haoyu Zheng, Yifan Zeng, Linghao Song, Murali Emani, and Wenqian Dong Session chair: Seyong Lee |
11:30am - 11:50am CST | WAGES: Workload-Aware GPU Sharing System for Energy-Efficient Serverless LLM Serving
Tianyu Wang, Gourav Rattihalli, Aditya Dhakal, Xulong Tang, and Dejan Milojicic Session chair: Seyong Lee |
11:50am - 12:10pm CST | OmniFed: A Modular Framework for Configurable Federated Learning from Edge to HPC
Sahil Tyagi, Andrei Cozma, Olivera Kotevska, and Feiyi Wang Session chair: Pedro Valero-Lara |
12:10pm - 12:30pm CST | Enhancing ChatPORT with CUDA-to-SYCL Kernel Translation Capability
Zheming Jin, Swaroop Pophale, and Keita Teranishi Session chair: Pedro Valero-Lara |
12:30pm CST | Closing Remarks |
August 8 (11:59pm, AOE), 2025
August 15 (11:59pm, AOE), 2025
September 5, 2025
September 26, 2025