How Do ML Jobs Fail in Datacenters? Analysis of a Long-Term Dataset from an HPC Cluster
The 6th Workshop on Hot Topics in Cloud Computing Performance (HotCloudPerf 2023)
Ph.D. student, Vrije Universiteit Amsterdam
Tech Lead MagnaData project
Research Focus
Scheduling complex data-driven workflows in datacenters