As somebody who works along with Applied Scientist helping them with tasks relat...

dayeye2006 · 2024-01-23T03:13:19 1705979599

I think no optimization is possible withoutprofiling. I think getting yourself familiar with the tools to understand the performance of a model might be the 1st step, e.g., https://pytorch.org/tutorials/recipes/recipes/profiler_recip...

tanelpoder · 2024-01-23T03:25:23 1705980323

Yes - understand first, then fix. And you’ll understand by measuring/profiling things.

I’d also recommend the detailed pytorch optimization case studies by Paul Bridger:

https://paulbridger.com/

grepLeigh · 2024-01-23T04:07:59 1705982879

Brendan Gregg's work on system performance and profiling is a good place to start. A lot of ML perf boils down to Linux perf or what the heck is happening in an HPC scheduling system like SLURM. https://www.brendangregg.com/linuxperf.html