An open-source, framework and hardware agnostic, extensible and customizable, distributed platform design for evaluating and profiling ML models across datasets/frameworks/systems.
Automatic μBenchmark Generation to Compute “Lower-bound” Latency and Inform Optimizations of Deep Learning Models on GPUs.
A Scalable Project Submission System for Parallel Programming Courses.
A Scalable Lab Submission System for Parallel Programming Courses.