Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
danielhanchen
on Dec 2, 2023
|
parent
|
context
|
favorite
| on:
Show HN: 80% faster, 50% less memory, 0% loss of a...
Great question - I'm actually not sure - I'll probably write up some minimum requirements - I know QLoRA's paper
https://arxiv.org/pdf/2305.14314.pdf
has an approximate calculation of the weights size, but not the gradient updates
Consider applying for YC's Spring batch! Applications are open till Feb 11.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: