More and more companies that were once devoted to being 'open', or were previous...

loudmax · 2024-03-05T12:53:25 1709643205

It's hard to build a business on "open". I'm not sure what Stability AI's long term direction will be, but I hope they do figure out a way to become profitable while creating these free models.

sharmajai · 2024-03-05T14:23:05 1709648585

Maybe not everything should be about business.

smith7018 · 2024-03-05T14:38:09 1709649489

Agreed but this isn't the same as an open source library; it costs A LOT of money to constantly train these models. That money has to come from somewhere, unfortunately.

TehCorwiz · 2024-03-05T14:58:23 1709650703

Yeah. The amount of compute required is pretty high. I wonder, is there enough distributed compute available to bootstrap a truly open model through a system like seti@home or folding@home?

Filligree · 2024-03-05T15:02:41 1709650961

The compute exists, but we'd need some conceptual breakthroughs to make DNN training over high-latency internet links make sense.

altruios · 2024-03-05T16:25:51 1709655951

Distributing the training data also opens up vectors of attack. Poisoning or biasing the dataset distributed to the computer needs to be guarded against... but I don't think that's actually possible in a distributed model (in principal?). If the compute is happing off server: then trust is required (which is not {efficiently} enforceable?).

TehCorwiz · 2024-03-05T17:32:43 1709659963

Trust is kinda a solved problem in distributed computing, The different "@Home" projects and Bitcoin handle this by requiring multiple validations of a block of work for just this reason.

altruios · 2024-03-05T19:15:42 1709666142

How do you verify the work of training without redoing the exact same work for training? (That's the neat part: you don't)

Bitcoin is trust-solved because of how the new blocks depends on previous blocks. With training data, there is no such verification (prompts/answers pairs do not depend at all on other prompt/answer pairs) (if there was, we wouldn't need to do the work of training the data in the first place).

You can rely on multiplying the work where gross variations are ignored (as you suggest): but that will take a lot more overhead in compute, and still is susceptible to bad actors (but much more resistant).

There is no solid/good solution - afaik - for distributed training of an AI (Open assistant I think is working on open training data?), if there is: I'll sign up.

hackpert · 2024-03-06T01:38:32 1709689112

There has been some interesting work when it comes to distributed training. For example DiLoCo (https://arxiv.org/abs/2311.08105). I also know that Bittensor and nousresearch collaborated on some kind of competitive distributed model frankensteining-training thingy that seems to be going well. https://bittensor.org/bittensor-and-nous-research/

Of course it gets harder as models get larger but distributed training doesn't seem totally infeasible. For example if we were to talk about MoE transformer models, perhaps separate slices of the model can be trained in an asynchronous manner and then combined with some retraining. You can have minimal regular communication about say, mean and variance for each layer and a new loss term dependent on these statistics to keep the "expertise" for each contributor distinct.

pksebben · 2024-03-05T15:21:21 1709652081

Forward-Forward looked promising, but then Hinton got the AI-Doomer heebie-jeebies and bailed. Perhaps someone picks up the concept and runs with it - I'd love to myself but I don't have the skillz to build stuff at that depth, yet.

TehCorwiz · 2024-03-05T14:34:09 1709649249

I agree, but Y-Combinator literally only exists to squeeze the most bizness out of young smart people. That's why you're not seeing so much agreement.

phkahler · 2024-03-05T16:30:34 1709656234

>> but Y-Combinator literally only exists to squeeze the most bizness out of young smart people.

YC started out with the intent to give young smart people a shot at starting a business. IMHO it has shifted significantly over the years to more what you say. We see ads now seeking a "founding engineer" for YC startups, but it used to be the founders were engineers.

te_chris · 2024-03-05T19:43:34 1709667814

Squeezed all the alpha out of the idealists now it’s the business guys turn

bufferoverflow · 2024-03-05T19:37:34 1709667454

If you agree, do you mind paying a few hundred thousand for my neural net training expenses?

mvkel · 2024-03-05T15:39:33 1709653173

The choice facing many companies that insist on remaining "open" is:

Do you want to 1. be right

or

2. stay in business

This is one of the reasons why OpenAI pivoted to be closed. Not bc of greedy value extractors; because it was the only way to survive.

bufferoverflow · 2024-03-05T19:36:33 1709667393

Training these big models is very very expensive. If they don't make money, and they run out of their own money, there will be no more SDXL.

sandworm101 · 2024-03-05T19:42:37 1709667757

>> Training these big models is very very expensive.

Which is why they are not the future. A big model that can generate a picture about anything in response to any input makes for a great website. It generates lots of press. But it is not a reasonable tool for content generation. If you want to produce content in a specific area or genre, the best results come from a model trained or modified in the area. So the big generalized AI, if you use it, would only be the framework on which you built your specialized tool. Building that specialized tool, such as something dedicated to images of a particular politician, does not require huge amounts of computation. That sort of thing can and is being done by individuals.

I am waiting for a tool trained on publicly-accessible mugshots. It wouldn't be a very big project but could yield a tool to generate very believable mugshots of politicians.

bufferoverflow · 2024-03-05T19:50:33 1709668233

I think it's unreasonable to expect a model for every possible use case. You would need billions of models, if not trillions.

Big generalist models are the future.

mikkom · 2024-03-05T16:11:38 1709655098

That was basically why openai was founded.

Too bad they decided to get greedy :-(

probablynish · 2024-03-05T19:01:31 1709665291

Most individuals like being able to acquire more goods and services. A lot follows from there

kelseyfrog · 2024-03-05T19:23:07 1709666587

You're right, a lot follows from there. But I'm so tired of being a consumer. I just want to be me for a chance. I'm so, so tired.

probablynish · 2024-03-05T19:29:10 1709666950

Depending on your background and circumstances, there are ways to opt out of the race to a greater/lesser degree. Moving to a cheaper city in your country, or a cheaper country altogether, is one of them. Finding a less stressful way of making less money is another.

I don't know you but I hope things work out :)

kelseyfrog · 2024-03-05T19:35:13 1709667313

Thank you, appreciate it.

It's just hard being reminded that there's no escape hatch - we've welded them all shut for eternity. Being reduced to choices within a system but the choice horizon never extends to the system itself and won't within my lifetime makes me feel trapped.

natebc · 2024-03-05T21:30:11 1709674211

well, know that you're not alone in that feeling.

ben_w · 2024-03-05T15:25:23 1709652323

Great, but aren't they simultaneously losing money and getting sued?

baq · 2024-03-05T14:34:10 1709649250

Maybe. Paychecks help with not being hungry, though.

I’d be happy if my government or EU or whatever offered cash grants for open research and open weights in AI space.

The problem is, everyone wants to be a billionaire over there and it’s getting crowded.

michaelt · 2024-03-05T16:59:39 1709657979

Maybe, but in image generation it's also hard to be closed.

The big providers are all so terrified they'll produce a deepfake image of obama getting arrested or something, the models are so locked down they only seem capable of producing stock photos.

sandworm101 · 2024-03-05T17:28:09 1709659689

>> The big providers are all so terrified they'll produce a deepfake image of obama getting arrested or something

I think the content they are worried about is far darker than an attempt to embarrass a former president.

greenavocado · 2024-03-05T18:59:24 1709665164

AI-created child sexual abuse images ‘threaten to overwhelm internet’

https://www.theguardian.com/technology/2023/oct/25/ai-create...

pleasantpeasant · 2024-03-05T17:19:59 1709659199

The internet wouldn't have become as big as it is, if it wasn't for the internet's open source models.

The internet has been taken over by Capitalist and have ruined the internet, in my opinion.

caycep · 2024-03-05T17:41:06 1709660466

are they still the commercial affiliate of the CompVis group at Ludwig Maximillian University?

londons_explore · 2024-03-05T12:55:17 1709643317

But they used to let you download the model weights to run on your own machine... But stable diffusion 3 is just in 'limited preview' with no public download links.

One has to wonder, why the delay?

nuz · 2024-03-05T13:10:29 1709644229

Both SD1.4 and SDXL was in limited preview for a few months before a public release. This has been their normal course of business for about 2 years now (since founding). They just do this to improve the weights via a beta test with less judgemental users before official release.

Sharlin · 2024-03-05T13:21:29 1709644889

How is a closed beta anything out of the ordinary? They know they would only get tons of shit flinged at them if they publicly released something beta-quality, even if clearly labeled as such. SD users can be a VERY entitled bunch.

causal · 2024-03-05T15:41:56 1709653316

I've noticed a strange attitude of entitlement that seems to scale with how open a company is - Mistral and Stable Diffusion are on very sensitive ground with the open source community despite being the most open.

idle_zealot · 2024-03-05T17:50:23 1709661023

If you try to court a community then it will expect more of you. Same as if you were to claim to be an environmentalist company then you would receive more scrutiny from environmentalists confirming your claims.

Sharlin · 2024-03-05T21:16:50 1709673410

That's… not really relevant to Stability AI at all. SAI isn't "claiming" anything. They are show, not tell (well, mostly). They give a technology away for free the likes of which everybody else keeps very tightly locked behind SaaS. Then people bitch about said free technology.

bee_rider · 2024-03-05T21:25:47 1709673947

I wonder why they didn’t call their company SharewareAI.

Filligree · 2024-03-05T14:22:34 1709648554

Moreover, people would start training on the beta model, splitting the ecosystem if it doesn't die entirely. There's nothing good in that timeline.

pennomi · 2024-03-05T18:46:36 1709664396

Happened already with the SDXL 0.9 weights leak. People started training off of that and it quickly became wasted effort.

Sharlin · 2024-03-05T15:09:17 1709651357

Uff, that's a good point.

cthalupa · 2024-03-05T13:00:23 1709643623

That's nothing new with Stability. Even 1.5 was "released early" by RunwayML because they felt Stability was taking too long to release the weights instead of just providing them in DreamStudio.

Stability will release them in the coming weeks.

gopher2000 · 2024-03-05T20:37:48 1709671068

Because first impressions matter. See the current perception of Gemini and its "woke parameters".