How Do ML Jobs Fail in Datacenters? Analysis of a Long-Term Dataset from an HPC Cluster
ICPE HotCloudPerf 2023
Ph.D. student, Vrije Universiteit Amsterdam
Tech Lead MagnaData project
Research Focus
Scheduling complex data-driven workflows in datacenters