Hacker News new | past | comments | ask | show | jobs | submit | acetabulum's comments login

If you use Horovod Elastic, I think you can avoid this problem working across a cluster of Spot instances.

https://horovod.readthedocs.io/en/stable/elastic_include.htm...


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: