Meta does stellar work in AI. I’m quite certain the recent DMCAs were a case of the left hand not knowing what the right hand is doing. This model is open source in every sense of the word, and they’ve now released two impactful models in the span of a month.
> left hand not knowing what the right hand is doing.
No it was trying to stop abuse. Unlike OpenAI, facebook can't get away with releasing a thing that is more than capable of making up libellous, racist or any other type of illegal or PR decimating thing.
The point of the really restrictive release was to at least try to limit that kind of issue.
Fortunatly for meta, most people who use the weights aren't the end user with a large twitter following, so the risk is (now) low.
I agree, Meta's work in AI has been really impressive, and it's encouraging how they are open-sourcing so much.
It's funny, back in the 90s I disliked Microsoft and thought they were inherently culturally opposed to open source. But big corporations really just follow their business incentives when it comes to open source.
It's not like Meta "loves to be open". But they would hate a world where all the powerful AI development happened on one of the big clouds. Imagine if every researchers used some closed-source API from Google or Azure or AWS. Meta would try to hire some AI people, and they'd be like, ugh if we take the job we have to use this weird Facebook-specific thing. (<cough>Flow</cough>)
So, supporting PyTorch and providing open source models is just a good business strategy for Meta. I'm glad to see it.
In particular, open source software prevents people from getting locked into any of the AWS, Linux, iOS, or Android ecosystems. All of those are popular software development ecosystems that Microsoft does not own or influence very much.
Sure. Any proof at all for that? I mean it's the obvious answer, but generally in business people who don't pay for things tend not to pay for things. In fact, in the enterprise having a big budget is the way you keep your corporate power. Free stuff isn't a way to keep your budget high.
Again, I would like to agree with your thinking. I just don't think the real world backs it up.
> I agree, Meta's work in AI has been really impressive, and it's encouraging how they are open-sourcing so much.
My usual reaction to anything Facebook releases is "yawn, they released yet another model that is practically useless because it's released under a non-commercial license", but I'm pleasantly surprised that this one seems to be actually liberally licensed! Hopefully this continues into the future.
That is likely true in the most pedantic sense, but in practice, if I create an algorithm that works by using a series of matrix transformations against a set of carefully chosen (read: "trained") matrices and I open-source only the matrix manipulation code but not the specially chosen matrices, I think there's a fair argument to say that I haven't open-sourced the entire algorithm.
In the phrase "the model is open-source in every sense of the word", that, IMO, must include the weights.
I think of two broad categories of database records: transactional data (data created while running the system) and domain data (data created during development and shipped to production as part of the release process).
The former type of data I wouldn't expect to ever be open-sourced. The latter type might or might not be, depending on the intent of open-sourcing the related system.
If I created a human language translation system that used a SQL database to store the dictionary (domain) data and claimed the system was open-source without shipping the domain data, I think people would rightly say that the system was not fully opened.
The actual source code is not important. The source code can be printed on a single A4 page, the valuable final product is the weights you get after running the code for fifty million dollars of compute time.
The weights seem to be under the same license, just distributed separately because it doesn’t make sense for the giant binary artifacts of training to be part of the source repository.
If I autogenerated a huge amount of C based on the weights, that added/multipled variables the same way the existing code+weights does, then would it be “source”?
This tweet chain is a pretty great overview: https://twitter.com/drjimfan/status/1643647849824161792?s=61...