Hacker News new | past | comments | ask | show | jobs | submit login

Are the weights open source?




For anyone curious about file sizes for the PyTorch models:

* default/sam_vit_h_4b8939.pth: 2.4GB

* sam_vit_l_0b3195.pth: 1.2GB

* sam_vit_b_01ec64.pth: 358MB


Yes


Where? From the article:

>Currently, the code (without the weights) is available on GitHub


The weights are linked from the readme; avoiding large binary resources in GitHub repositories is fairly normal.



The weights are not "source".


That is likely true in the most pedantic sense, but in practice, if I create an algorithm that works by using a series of matrix transformations against a set of carefully chosen (read: "trained") matrices and I open-source only the matrix manipulation code but not the specially chosen matrices, I think there's a fair argument to say that I haven't open-sourced the entire algorithm.

In the phrase "the model is open-source in every sense of the word", that, IMO, must include the weights.


I'd be curious to know if you think open source companies should share all of their database records too.


I think of two broad categories of database records: transactional data (data created while running the system) and domain data (data created during development and shipped to production as part of the release process).

The former type of data I wouldn't expect to ever be open-sourced. The latter type might or might not be, depending on the intent of open-sourcing the related system.

If I created a human language translation system that used a SQL database to store the dictionary (domain) data and claimed the system was open-source without shipping the domain data, I think people would rightly say that the system was not fully opened.


The actual source code is not important. The source code can be printed on a single A4 page, the valuable final product is the weights you get after running the code for fifty million dollars of compute time.


If they start claiming they're "open weights" or "open final product" I'll be up in arms.


They are claiming that. They are also delivering on that claim: https://github.com/facebookresearch/segment-anything#model-c...


The weights seem to be under the same license, just distributed separately because it doesn’t make sense for the giant binary artifacts of training to be part of the source repository.


If I autogenerated a huge amount of C based on the weights, that added/multipled variables the same way the existing code+weights does, then would it be “source”?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: