Anthropic’s $5B, 4-year plan to take on OpenAI

anentropic · on April 11, 2023

If Apple would wake up to what's happening with llama.cpp etc then I don't see such a market in paying for remote access to big models via API, though it's currently the only game in town.

Currently a Macbook has a Neural Engine that is sitting idle 99% of the time and only suitable for running limited models (poorly documented, opaque rules about what ops can be accelerated, a black box compiler [1] and an apparent 3GB model size limit [2])

OTOH you can buy a Macbook with 64GB 'unified' memory and a Neural Engine today

If you squint a bit and look into the near future it's not so hard to imagine a future Mx chip with a more capable Neural Engine and yet more RAM, and able to run the largest GPT3 class models locally. (Ideally with better developer tools so other compilers can target the NE)

And then imagine it does that while leaving the CPU+GPU mostly free to run apps/games ... the whole experience of using a computer could change radically in that case.

I find it hard not to think this is coming within 5 years (although equally, I can imagine this is not on Apple's roadmap at all currently)

[1] https://github.com/hollance/neural-engine

[2] https://github.com/smpanaro/more-ane-transformers/blob/main/...

mike_hearn · on April 11, 2023

If I were Apple I'd be thinking about the following issues with that strategy:

1. That RAM isn't empty, it's being used by apps and the OS. Fill up 64GB of RAM with an LLM and there's nothing left for anything else.

2. 64GB probably isn't enough for competitive LLMs anyway.

3. Inferencing is extremely energy intensive, but the MacBook / Apple Silicon brand is partly about long battery life.

4. Weights are expensive to produce and valuable IP, but hard to protect on the client unless you do a lot of work with encrypted memory.

5. Even if a high end MacBook can do local inferencing, the iPhone won't and it's the iPhone that matters.

6. You might want to fine tune models based on your personal data and history, but training is different to inference and best done in the cloud overnight (probably?).

7. Apple already has all that stuff worked out for Siri, which is a cloud service, not a local service, even though it'd be easier to run locally than an LLM.

And lots more issues with doing it all locally, fun though that is to play with for developers.

I hope I'm wrong, it'd be cool to have LLMs be fully local, but it's hard to see situations where the local approach beats out the cloud approach. One possibility is simply cost: if your device does it, you pay for the hardware, if a cloud does it, you have to pay for that hardware again via subscription.

horsawlarway · on April 11, 2023

> but it's hard to see situations where the local approach beats out the cloud approach.

I think the most glaring situation where this is true is simply one of trust and privacy.

Cloud solutions involve trusting 3rd parties with data. Sometimes that fine, sometimes it's really not.

Personally - LLMs start to feel more like they're sitting in the confidant/peer space in many ways. I behave differently when I know I'm hitting a remote resource for LLMs in the same way that I behave differently when I know I'm on camera in person: Less genuinely.

And beyond merely trusting that a company won't abuse or leak my data, there are other trust issues as well. If I use an LLM as a digital assistant - I need to know that it's looking out for me (or at least acting neutrally) and not being influenced by a 3rd party to give me responses that are weighted to benefit that 3rd party.

I don't think it'll be too long before we see someone try to create an LLM that has advertising baked into it, and we have very little insight into how weights are generated and used. If I'm hitting a remote resource - the model I'm actually running can change out from underneath me at any time, jarring at best and utterly unacceptable at worst.

From my end - I'd rather pay and run it locally, even if it's slower or more expensive.

mike_hearn · on April 11, 2023

People have trusted search engines with their most intimate questions for nearly 30 years and there has been what ... one? ... leak of query data during this time, and that was from AOL back when people didn't realize that you could sometimes de-anonymize anonymized datasets. It hasn't happened since.

LLMs will require more than privacy to move locally. Latency, flexibility and cost seem more likely drivers.

horsawlarway · on April 11, 2023

You're still focused on trusting that my data is safe. And while I think that matters - I don't really think that's the trust I care most about.

I care more about the trust I have to place in the response from the model.

Hell - since you mentioned search... Just look at the backlash right now happening to google. They've sold out search (a while back, really) and people hate it. Ads used to be clearly delimited from search results, and the top results used to be organic instead of paid promos. At some point, that stopped being true.

At least with google search I could still tell that it was showing me ads. You won't have any fucking clue that OpenAI has entered into a partnering agreement with "company [whatever]" and has retrained the model that users on plans x/y/z interact with to make it more likely to push them towards their new partner [whatever]'s products when prompted with certain relevant contexts.

Karrot_Kream · on April 11, 2023

> Hell - since you mentioned search... Just look at the backlash right now happening to google. They've sold out search (a while back, really) and people hate it. Ads used to be clearly delimited from search results, and the top results used to be organic instead of paid promos. At some point, that stopped being true.

Only people in HN-like communities care about this stuff. Most people find the SEO spam in their results more annoying.

> At least with google search I could still tell that it was showing me ads. You won't have any fucking clue that OpenAI has entered into a partnering agreement with "company [whatever]" and has retrained the model that users on plans x/y/z interact with to make it more likely to push them towards their new partner [whatever]'s products when prompted with certain relevant contexts.

You won't know this for any local models either.

horsawlarway · on April 11, 2023

> You won't know this for any local models either.

But you will know the model hasn't changed, and you can always continue using the version you currently have.

> Most people find the SEO spam in their results more annoying.

This is the same problem. These models will degrade from research quality to mass market quality as there's incentive to change what results they surface. Whether that's intentional (paid ads) versus adversarial (SEO) doesn't matter all that much - In either case the goals will become commercial and profit motivated.

People really don't like "commercial and profit motivated" in the spaces that some of these LLMs stepping into. Just like you don't like SEO in your recipe results.

Karrot_Kream · on April 11, 2023

> But you will know the model hasn't changed, and you can always continue using the version you currently have.

Will you? What happens when an OS update silently changes the model? Again this is one of those things only HN-types really care/rant about. I've never met a non-technical person care about regular updates beyond being slow or breaking an existing workflow. Most technical folks I know don't care either.

> This is the same problem. These models will degrade from research quality to mass market quality as there's incentive to change what results they surface. Whether that's intentional (paid ads) versus adversarial (SEO) doesn't matter all that much - In either case the goals will become commercial and profit motivated.

Not at all. Search providers have an incentive to fight adversarial actors. They don't have any incentive to fight intentional collaboration.

> People really don't like "commercial and profit motivated" in the spaces that some of these LLMs stepping into. Just like you don't like SEO in your recipe results.

I disagree. When a new, local business pops up and pays for search ads, is this "commercial and profit motivated?" How about advertising a new community space opening? I work with a couple businesses like this (not for SEO, just because I like the space they're in and know the staff) and using ads for outreach is a pretty core part of their strategy. There's no neat and clean definition of "commercial and profit motivated" out there.

mike_hearn · on April 11, 2023

You wouldn't know that even if the model ran locally.

rileyphone · on April 11, 2023

This happened with ChatGPT a few weeks ago.

https://news.ycombinator.com/item?id=35291112

nr2x · on April 11, 2023

Two issues though: leak of data from one party to another, and misuse of data by the party you gave it to. Most big companies don’t leak this type of data, but they sure as hell misuse it and have the fines to prove it.

sebzim4500 · on April 11, 2023

Almost everyone is willing to trust 3rd parties with data, including enterprise and government customers. I find it hard to believe that there are enough people willing to pay a large premium to run these locally to make it worth the R&D cost.

horsawlarway · on April 11, 2023

Having done a lot of Bank/Gov related work... I can tell you this

> Almost everyone is willing to trust 3rd parties with data, including enterprise and government customers.

Is absolutely not true. In it's most basic sense - sure... some data is trusted to some 3rd parties. Usually it's not the data that would be most useful for these models to work with.

We're already getting tons of "don't put our code into chatGPT/Copilot" warnings across tech companies - I can't imagine not getting fired if I throw private financial docs for my company in there, or ask it for summaries of our high level product strategy documents.

allturtles · on April 11, 2023

Yes, just like you might get fired for transacting sensitive company business on a personal gmail account, even if that company uses enterprise gmail.

Saying that cloud models will win over local models is not the same as saying it will be a free-for-all where workers can just use whatever cloud offering they want. It will take time to enterprisify cloud LLM offerings to satisfy business/government data security needs, but I'm sure it will happen.

anentropic · on April 11, 2023

But right now what incentive have I to buy a new laptop? I got this 16GB M1 MBA two years ago and it's literally everything I need, always feels fast, silent etc

1. the idea would be that now there is a reason to buy loads more RAM, whereas currently the market for 64GB is pretty niche

2. 64GB is a big laptop today, in a few years time that will be small. And LLaMA 65B int4 quantized should fit comfortably

4. LLMs will be a commodity. There will be a free one

6. LLMs seem to avoid the need for finetuning by virtue of their size - what we see now with the largest models is you just do prompt engineering. Making use of personal data is a case of Langchain + vectorstores (or however the future of that approach pans out)

mike_hearn · on April 11, 2023

1. You're working backwards from a desire to buy more RAM to try and find uses for it. You don't actually need more RAM to use LLMs, ChatGPT requires no local memory, is instant and is available for free today.

2. Why would anybody be satisfied with a 64GB model when GPT-4 or 5 or 6 might even be using 1TB of RAM?

3. That may not be the case. With every day that passes, it becomes more and more clear that large LLMs are not that easy to build. Even Google has failed to make something competitive with OpenAI. It's possible that OpenAI is in fact the new Google, that they have been able to establish permanent competitive advantage, and there will no more be free commodity LLMs than there are free commodity search engines.

Don't get me wrong, I would love there to be high quality local LLMs. I have at least two use cases where you can't do them or not really well with the OpenAI API and being able to run LLama locally would fix that problem. But I just don't see that being a common case and at any rate I would need server hardware to do it properly, not Mac laptop.

anentropic · on April 11, 2023

1. You're working backwards from a desire to buy more RAM to try and find uses for it.

I'm really not

I had no desire at all until a couple of weeks ago. Even now not so much since it wouldn't be very useful to me

But the current LLM business model where there are a small number of API providers, and anything built using this new tech is forced into a subscription model... I don't see it sustainable, and I think the buzz around llama.cpp is a taste of that

I'm saying imagine a future where it is painless to run a ChatGPT-class LLM on your laptop (sounded crazy a year ago, to me now looks inevitable within few years), then have a look at the kind of things that can be done today with Langchain... then extrapolate

rnk · on April 11, 2023

It sounds like we are in a similar position. I had no desire to get a 64gb laptop from apple until all the interesting things from running llama locally came out. I wasn't even aware of the specific benefit of that uniform memory model on the mac. Now I'm looking at do I want to do 64, 96 or 128gb. For an insane amount of money, 5k for that top end one.

anentropic · on April 12, 2023

The unified memory ought to be great for running LLaMA on the GPU on these Macbooks (since it can't run on the Neural Engine currently)

The point of llama.cpp is most people don't have a GPU with enough RAM, Apple unified memory ought to solve that

Some people have it working apparently:

https://github.com/remixer-dec/llama-mps

rnk · on April 12, 2023

Thank you, that's exactly what I was looking for, specific info on perf.

anentropic · on April 12, 2023

I think the GPU performance for inference is probably limited currently by immaturity of PyTorch MPS (Metal) backend

before I found the repo above I had a naive attempt to get llama running with mps and it didn't "just work" - bunch of ops not supported etc

mike_hearn · on April 11, 2023

I think llama.cpp will die soon because the only models you can run with it are derivatives of a model that Facebook never intended to be publicly released, which means all serious usage of it is in a legal limbo at best and just illegal at worst. Even if you get a model that's clean and donated to the world, the quality is still not going to be competitive with the hosted models.

And yes I've played with it. It was/is exciting. I can see use cases for it. However none are achievable because the models are (a) not good enough and (b) too legally risky to use.

BrutalCoding · on April 11, 2023

(A) is very use case depending. Even with some of the bad smaller models now, I can see devs making use of them to enhance their app (e.g. local search, summaries, sentiments, translations)

(B) llama.cpp supports gpt4all, which states that its working on fixing your concern. This is from their README:

Roadmap Short Term

- Train a GPT4All model based on GPTJ to alleviate llama distribution issues.

littlestymaar · on April 11, 2023

> is instant and is available for free today.

It's free for the user up to a point, but it costs OpenAI a lot of money.

Apple is a hardware vendor, so commoditization of the software while finding more market segments is definitely something that'd benefit them.

OTOH, if they let OpenAI become the unrivaled leader of AI that end up being the next Google, they end up losing on a topic they wanted to lead for long time (Apple has invested quite a lot in AI, and the existence of a Neural Engine in Apple CPUs isn't an accident)

lobocinza · on April 12, 2023

"A lot of money" is a lot less money per user than to buy 64GB RAM to run an inferior model locally + energy and opportunity costs. The OpenAI APIs are super cheap for a single user needs. I expect them to be at least close to breaking even with their APIs pricing.

littlestymaar · on April 12, 2023

> "A lot of money" is a lot less money per user than to buy 64GB RAM

if OpenAI isn't able to get couple hundred bucks over the typical lifetime of a computer it means the added value they provide is very low (like several times less than Spotify or Netflix for instance), meaning they'll never be “the next Google”.

And if they are it means it make sense to buy it once instead of paying several times the price through subscription.

> The OpenAI APIs are super cheap for a single user needs. I expect them to be at least close to breaking even with their APIs pricing.

“Close to breaking even” means the price you pay is VC-subsidized, the expected gross margin for such kind of tech company is more than 50%. Expect to pay a lot more if/when the market is captive. And this will scale linearly with your use of the technology.

> energy and opportunity costs

What opportunity cost?

lobocinza · on April 17, 2023

> Expect to pay a lot more if/when the market is captive.

Yes, this is a possibility but cloud computing became a commodity.

But I see why people would pay to have their own private and unfiltered models/embeddings.

> if OpenAI isn't able to get couple hundred bucks over the typical lifetime of a computer it means the added value they provide is very low (like several times less than Spotify or Netflix for instance), meaning they'll never be “the next Google”.

They don't have to worry about this today.

> What opportunity cost?

You could utilize the money and the time spent to do other things.

highwaylights · on April 11, 2023

I think it’s quite likely that the RAM onboard these devices expands pretty massively, pretty quickly as a direct result of LLMs.

Google had already done some very convincing demos in the last few years well before ChatGPT and GPT-4 captured the popular imagination. Microsoft’s OpenAI deal I would assume will lead to a “Cortana 2.0” (obviously rebranded, probably “Bing for Windows”, “Windows Copilot” or something similar). Google Assistant has been far ahead of Siri for many years longer than that, and they have extensive experience with LLMs. Apple surely realises the position their platforms are in and the risk of being left behind.

I’m also not sure the barrier on iPhone is as great as you suggest - it’s obviously constrained in terms of what it can support now but if the RAM on the device doubles a few times over the next few years I can see this being less of an issue. Multiple models (like the Alpaca sets) could be used for devices with different RAM/performance profiles and this could be sold as another metric to upgrade (i.e. iPhone 16 runs Siri-2.0-7b while iPhone 17 runs Siri-2.0-30b - “More than 3x smarter than iPhone 16. The smartest iPhone we’ve ever made.” etc).

matthewdgreen · on April 11, 2023

How much does 64GB of RAM cost, anyway? Retail it's like $200, and I'm sure it's cheaper in terms of Apple cost. Yet we treat it as an absurd luxury because Apple makes you buy the top-end 16" Macbook and pay an extra $800 beyond that. Maybe in the future they'll treat RAM as a requirement and not a luxury good.

anentropic · on April 11, 2023

and we know that more will be cheaper in future

rnk · on April 11, 2023

With the integrated ram and cpu and gpu on apple silicon, however it's done it yields perf results. I do think that probably has higher cost than separately produced ram. And even separate from that, because they have that unified memory model unlike every other consumer device they can charge for it. So 64, 96 or 128 gb?

rasz · on April 12, 2023

Its not done for perf results, Xbox doesnt have ram on package and somehow does 560 GB/s

rnk · on April 12, 2023

The perf results I was referring to was the ability to run an llm locally (like llama.cpp) that uses a giant amount of ram in the gpu, like 40gig. Without this uniform memory model, you end up paging endlessly, so it's actually much faster for this application in this scenario. Unlike on a pc with a graphics card, you can use your entire ram for gpu. This isn't possible on the xbox because it doesn't have uniform memory as far as I know. So having incredible throughput still won't match not having to page.

Edit - I found an example from h.n. user anentropic, pointing at https://github.com/remixer-dec/llama-mps . "The goal of this fork is to use GPU acceleration on Apple M1/M2 devices.... After the model is loaded, inference for max_gen_len=20 takes about 3 seconds on a 24-core M1 Max vs 12+ minutes on a CPU (running on a single core). "

anentropic · on April 11, 2023

4. Weights are expensive to produce and valuable IP, but hard to protect on the client unless you do a lot of work with encrypted memory.

No, it'll be a commodity

Apple wouldn't care if the weights can be extracted if you have to have a Macbook to get the sweet, futuristic, LLM-enhanced OS experience

rnk · on April 11, 2023

I've been looking into buy a mac for llm experimentation - 64, 96 or 128gb of ram? I'm trying to decide if 64gb is enough, or should I go to 96gb or even 128gb. But it's really expensive - even for an overpaid software engineer. Then there's the 1 or 2 tb storage question. Apple list price is another $400 for that second tb of storage.

For 64gb of ram, you can get an m2 pro, or get 96gb which requires the upgraded cpu on the pro. The studio does 64gb or 128gb. But the 128 requires you to spend 5k.

I can't decide between 64 or 96 on m2 pro, and 128 on the studio. Probably go for 96gb. Also what's the impact of the extra gpu cores on the various options? And there are still some "m1" 64gb pros & studios out there. What's the perf difference for m1 vs m2? This area needs serious perf benchmarking. If anyone wants to work with me, maybe I would try my hand. But I'm not spending 15k just to get 3 pieces of hardware.

List prices:

64gb/2tb m2 12cpu/30gpu 14" pro $3900

96gb/2tb m2 max 12/38 14" pro $4500

128gb/2tb m2 max 28/48 studio $5200

anentropic · on April 12, 2023

Check out the LLaMA memory requirements on Apple Silicon GPU here: https://github.com/remixer-dec/llama-mps

bugglebeetle · on April 11, 2023

I’m pretty sure you can get a purpose-built pc tower in that range. Why would you favor a Mac over that? A lot of this stuff only has limited support for MacOS.

selectodude · on April 12, 2023

The unified GPU/CPU memory structure on ARM Macs is very, very helpful for running these LLMs locally.

EntrePrescott · on April 12, 2023

Is there a big difference in principle between that and the "shared video memory" that has long existed on cheap x86 machines?

…or is it just that the latter had a way too weak iGPU and not enough RAM for AI purposes, whereas the bigger ARM MACs have more GPU power and enough RAM (more than most affordable discrete graphic cards) so that they are usable for some AI models?

evilduck · on April 12, 2023

You can't get that much VRAM on a PC for a comparible price.

nerpderp82 · on April 12, 2023

Running models locally is the future for most inferencing cycles. There is a lot of accuracy that could be improved in your numbered list trying to dissuade people.

> 64GB probably isn't enough for competitive LLMs anyway

I am trying to charitable, but this is pretty not true. And the hedging in your statement only telegraphs your experience.

bootsmann · on April 11, 2023

> Even if a high end MacBook can do local inferencing, the iPhone won't and it's the iPhone that matters

Doesn't the iPhone use the local processor for stuff like the automatic image segmentation they currently do? (Hold on any person in a recent photo you have take and iOS will segment it)

mike_hearn · on April 11, 2023

Yes but I'm not making a general argument about all AI, just LLMs. The L stands for Large after all. Smartphones are small.

sebzim4500 · on April 11, 2023

>One possibility is simply cost: if your device does it, you pay for the hardware, if a cloud does it, you have to pay for that hardware again via subscription.

Yeah but in the cloud that cost is ammortized among everyone else using the service. If you as a consumer buy a gpu in order to run LLMs for personal use, then the vast majority of the time it will just be sitting there depreciating.

littlestymaar · on April 11, 2023

But then again, every apple silicon user has an unused neural engine sitting around in the SoC an taking a significant amount of die space, yet people don't seem to worry too much about its depreciation.

fshbbdssbbgdd · on April 11, 2023

> 7. Apple already has all that stuff worked out for Siri, which is a cloud service, not a local service, even though it'd be easier to run locally than an LLM.

iOS actually does already have an offline speech-to-text api. Some part of Siri that translates the text into intents/actions is remote. Since iOS 15, Siri will also process a limited subset of commands while offline.

gregbander · on April 11, 2023

Chips have a 5-7 year lead time. Apple has been shipping neural chips for years while everyone else is still designing their v1.

Apple is ahead of the game for a change getting their chips in line as the software exits alpha and goes mainstream.

rnk · on April 11, 2023

But they haven't exposed them to use. They are missing a tremendous opportunity. They have that unique unified memory model on the m1/m2 arms so they have something no other consumer devices have. If they exposed their neural chips they'd solidify their lead. They could sell a lot more hardware.

gleenn · on April 11, 2023

They are though. Apple released a library to use Apple Silicon for training via PyTorch recently, and has libraries to leverage the NE in CoreML.

anentropic · on April 12, 2023

> Apple Silicon for training via PyTorch recently

This is just allowing PyTorch to make use of the Apple GPU, assuming the models you want to train aren't written with hard-coded CUDA calls (I've seen many that are like that, since for a long time that was the only game in town)

PyTorch can't use the Neural Engine at all currently

AFAIK Neural Engine is only usable for inference, and only via CoreML (coremltools in Python)

rnk · on April 11, 2023

Thank you! I wasn't aware of that. Let me research that. May 2022 announcement. Is this suitable for the the apps like llama.cpp since it's a Python library? It appears to be a library but they didn't document how to use the underlying hardware - but I welcome more info.

gleenn · on April 11, 2023

iPhones have similar Neural Engine capabilities, obviously far more limited but still quite powerful. You can run some pretty cool DNNs for image generation using e.g. Draw Things app quite quickly: https://apps.apple.com/us/app/draw-things-ai-generation/id64...

LargoLasskhyfv · on April 12, 2023

1. Quadruple it.

2. see above

Should be cheap, or why else are Samsung, Micron and Kioxia whining about losses?

Maybe go for something like Optane memory while doing so.

mike_hearn · on April 12, 2023

Optane is sadly no longer being manufactured.

LargoLasskhyfv · on April 12, 2023

I know. That's why I wrote something like ;-)

ThrowawayR2 · on April 11, 2023

> "If you squint a bit and look into the near future it's not so hard to imagine a future Mx chip with a more capable Neural Engine and yet more RAM, and able to run the largest GPT3 class models locally. (Ideally with better developer tools so other compilers can target the NE)"

Very doubtful unless the user wants to carry around another kilogram worth of batteries to power it. The hefty processing required by these models doesn't come for free (energy wise) and Moore's Law is dead as a nail.

anentropic · on April 11, 2023

Most of the time I have my laptop plugged in and sit at a desk...

But anyway, there are two trends:

- processors do more with less power

- LLMs get larger, but also smaller and more efficient (via quantizing, pruning)

Once upon a time it was prohibitively expensive to decode compressed video on the fly, later CPUs (both Intel [1] and Apple [2]) added dedicated decoding hardware. Now watching hours of YouTube or Netflix are part of standard battery life benchmarks

[1] https://www.intel.com/content/www/us/en/developer/articles/t...

[2] https://www.servethehome.com/apple-ignites-the-industry-with...

gregw134 · on April 11, 2023

My latest mac seems to have about a kilogram of extra battery already compared to the previous model.

vineyardmike · on April 11, 2023

Apple’s move to make stable diffusion run well on the iPhone makes me think they’re watching this space, just waiting for the right open model for them to commit to.

qumpis · on April 11, 2023

I wonder how good the neural engine with the unified memory is compared to say intel cpu with 32gb ram. Could anyone give some insight?

anentropic · on April 11, 2023

There seems to be a limit to the size of model you can load before CoreML decides it has to run on CPU instead (see the second link in my previous comment)

If it could use the full 'unified' memory that would be a big step towards getting these models running on it

I'm unsure how the performance compares to a beefy Intel CPU, but there's some numbers here [1] for running a variant of the small distilbert-base model on the Neural Engine... it's ~10x faster than running on the M1 CPU

[1] https://github.com/anentropic/experiments-coreml-ane-distilb...

fudged71 · on April 11, 2023

Siri was launched with a server-based approach. It wouldn't be surprising if Apple's near-term LLM strategy would to put a small LLM on local chips/MacOS and a large model running in the cloud. The local model would only do basic fast operations while the cloud could provide the heavyweight intensive analysis/generation.

rldjbpin · on April 12, 2023

I can see how the apple silicon memory system can help with LLMs, but a couple points of reality check:

- such amounts of memory is locked behind very expensive sku which even most of mac userbase will not use ( <5% in the new purchases to be very conservative ). - not too long ago apple would restrict the amount of ram in their system for their own reasoning (source: https://9to5mac.com/2016/10/28/apple-macbook-pro-16gb-ram-li...) - just like mid 2010s gpus with 6-8 gb vram but with little to benefit from it, i don't see the ml accelerators/gpu in current models being capable enough to make the most of the memory available to it.

anentropic · on April 12, 2023

That's today... think of the future

My first computer had 512KB RAM and 20MB was an expensive hard drive.

64GB Macbooks are currently an expensive 'Pro' novelty, they will be the vanilla of tomorrow

> i don't see the ml accelerators/gpu in current models being capable enough to make the most of the memory available to it

that's exactly my point (and apparently today's Neural Engine can't even take advantage of all the unified memory available)

until LLaMA there was no reason to have more than this, they probably imagined it would just run a bit of face-detection and speech-to-text on the side

but if they got serious and beefed it up it could be the next wave of computing IMHO

anentropic · on April 13, 2023

Update: https://www.bjnortier.com/2023/04/13/Hello-Transcribe-v2.2.h...

The iPhone 14 runs Whisper model faster than an M1 Max, because it has a newer Neural Engine

I look forward to the M3 Macbook launch eagerly, while expecting mild disappointment

samstave · on April 11, 2023

Can we re-invent SETI with such LLMs/new GPU folding/whatever hardware and re-pipe the seti data through a Big Ass Neural whatever you want to call it and see if we have any new datapoints to look into?

What about other older 'questions' we can point an AI lens at?

khimaros · on April 11, 2023

https://github.com/bigscience-workshop/petals might be up your alley.

boringuser2 · on April 12, 2023

You need state of the art consumer tech to run a model comparable to GPT-3 locally at a glacial pace.

Or, you can use a superior GPT 3.5 for free.

dpflan · on April 11, 2023

"Dario Amodei, the former VP of research at OpenAI, launched Anthropic in 2021 as a public benefit corporation, taking with him a number of OpenAI employees, including OpenAI’s former policy lead Jack Clark. Amodei split from OpenAI after a disagreement over the company’s direction, namely the startup’s increasingly commercial focus."

So Anthropic is the Google-supported equivalent of OpenAI? Isn't the founder going to run into the same issues as before (commercialization at OpenAI)? How does Google not use Anthropic as either something commercial or nice marketing material for its AI offerings?

rebolek · on April 11, 2023

There may have been a disagreement, but now they're focused on profit, as everybody else. From the same article:

“Anthropic has been heavily focused on research for the first year and a half of its existence, but we have been convinced of the necessity of commercialization, which we fully committed to in September [2022],” the pitch deck reads. “We’ve developed a strategy for go-to-market and initial product specialization that fits with our core expertise, brand and where we see adoption occurring over the next 12 months.”

dpflan · on April 11, 2023

This is hilarious to me, in that the disgruntled departure just did a 180… how long until the next disgruntled spin-out for higher reasons chases the dollar too…

lelanthran · on April 11, 2023

> This is hilarious to me, in that the disgruntled departure just did a 180… how long until the next disgruntled spin-out for higher reasons chases the dollar too…

The cynic in me wants to ask "What makes you think his departure was because of an anti-commercialisation position?"

My take (probably just as wrong as everybody's else take) is that he saw the huge commercialisation potential and realised that he could make even more money by having a larger stake, which he got when he started his own venture.

dpflan · on April 11, 2023

It’s pretty clear, the words say he was anti, then the company he helped create apparently has marketing material all about being commercialization. Unless he leaves tomorrow for the same reasons it is quite hard to disbelieve that “cash rules everything around me”.

robertlagrant · on April 11, 2023

That does seem more likely. Let's hope his VP of research does the same thing to him (-:

wittenbunk · on April 11, 2023

If you look read the parent comment in this thread you'd get an answer...

lelanthran · on April 11, 2023

> If you look read the parent comment in this thread you'd get an answer...

I looked and I didn't get an answer. hence my comment.

To clarify, we know what he said his reason was, we don't know if that really was his reason.

When people leave they very rarely voice the actual reason for leaving; the reason they give is designed to make them look as good as possible for any future employer or venture.

dtagames · on April 11, 2023

Everybody in the chip business was a spin-off from Fairchild. This is pretty common when a huge, new tech comes along.

adamsmith143 · on April 11, 2023

To be fair I think he had the same realization that they had at OpenAI. Sam Altman has gone on the record saying it's basically impossible to raise significant amounts of money as a pure nonprofit and you aren't going to train cutting edge foundation models without a lot of cash. Anthropic is saying they literally need to spend $1B over 18 months to train their next Claude version.

nr2x · on April 11, 2023

Also, to chase those dollars while being on the leash of Google’s massive investment.

So they lost the plot on the altruistic mission within months of setting up shop, and now are just a pawn in a bigger game between other companies.

Karrot_Kream · on April 11, 2023

The same thing happened back in the processor arms race days and before that in the IC days. Ex-Fairchild engineers created a lot of the most durable IC and chip companies out there. Intel's founders were ex-Fairchild.

menzoic · on April 11, 2023

Its not about making money, its about opening up the tech to the public (including source, weights...etc)

JohnFen · on April 11, 2023

It sure looks like it's about the money to me.

RC_ITR · on April 11, 2023

>So Anthropic is the Google-supported equivalent of OpenAI? Isn't the founder going to run into the same issues as before (commercialization at OpenAI)? How does Google not use Anthropic as either something commercial or nice marketing material for its AI offerings?

I think the unstated shift that has happened in the past few years is that we've gone from researchers thinking about Fourier transforms to efficiently encode positional data into vectors to researchers thinking about how to train a model with a 100k+ token batch size on a super-computer-like cluster of GPUs.

I can totally see why people believed the math could be done in a non-profit way, I do not see how the systems engineering could be.

naillo · on April 11, 2023

More like FTX-supported. They got half a billion in investment from them according to an earlier blog post by Anthropic.

samwillis · on April 11, 2023

I believe I read somewhere that that investment may have to be returned.

ac29 · on April 11, 2023

The article says the shares are expected to be sold as part of the FTX bankruptcy process.

speed_spread · on April 11, 2023

Hence the race to an AI smart enough to figure out a way to keep the money.

potamic · on April 11, 2023

What does a policy lead do and how are they relevant to an early stage startup? I would be more interested in seeing which researchers and engineers join.

neuronexmachina · on April 11, 2023

I assume it's basically this position: https://thriveml.com/jobs/product-policy-lead-e296c565

> As the Product Policy Lead, you will set the foundation for Anthropic’s approach to safe deployments. You will develop the policies that govern the use of our systems, oversee the technical approaches to identifying current and future risks, and build the organizational capacity to mitigate product safety risks at-scale. You will work collaboratively with our Product, Societal Impacts, Policy, Legal, and leadership teams to develop policies and processes that protect Anthropic and our partners.

> You’re a great fit for the role if you’ve served in leadership positions in the fields of Trust & Safety, product policy, or risk management at fast-growing technology companies, and you recognize that emerging technology such as generative AI systems will require creative approaches to mitigating complex threats.

> Please note that in this role you may encounter sensitive material and subject matter, including policy issues that may be offensive or upsetting.

iandanforth · on April 11, 2023

Jack is pretty well known in the community since he runs not only the Import AI newsletter, but also has been a partner in the AI Index report. He also has a media background so is generally well connected even beyond his influential reach. Also, though not relevant to your question, he's a really nice guy :)

jjtheblunt · on April 11, 2023

Curiously, Anthropic.com was launched in 2021, but a small custom software shop in Arizona around since the mid-late 90s had registered and been using Anthropic.ai in 2020 for a couple projects.

How does that name collision work?

nr2x · on April 11, 2023

“Hi we’re here to save humanity, and we’re stealing your name! We have a ton of lawyers, buckets of cash from Google to hire more lawyers, and if you don’t like it, you’re fucked. Now please enjoy being saved by us.”

dan-robertson · on April 11, 2023

Maybe they bought the domain name for a mutually agreeable price?

jjtheblunt · on April 11, 2023

It's the trademark that matters i thought (possibly naively), since anthropic.ai was registered in 2020 for a product built in 2019, it seems, and the Anthropic spin off from OpenAI was formed in 2021, seems to have purchased a squatted domain name of anthropic.com then.

Kind of unsure how it all works.

dan-robertson · on April 11, 2023

Well you can also buy a trademark, right? Though I think different companies are allowed the same trademarks if the things being trademarked aren’t confusable

two_in_one · on April 12, 2023

"public benefit".. Ah, they are so good. May be they will _open_ something after all.. ;)

EntrePrescott · on April 12, 2023

yeah, LOL on that. Their idea of "public benefit" is of course that they benefit publicly, though for marketing purposes "public benefit" sounds nicer (just like the "Open" in "ClosedAI") because people would tend to emotionally associate something nicer with it.

Reminds me of the (possibly LLM-generated) marketing tirade of a voice faking text-to-speech service recently here on hn, which ended with: "We are thrilled to be sharing our new model, and look forward to feedback!":

https://news.ycombinator.com/item?id=35328698

… "share" yeah right… like: where can I download the model then? Of course they didn't mean to actually share their model but only to rent out remote access to it, but that doesn't sound as nice as "share".

alexb_ · on April 11, 2023

If someone released a chatGPT/characterAI with NSFW content enabled it would eat into a big share of their users (and for characterAI, maybe take all of them). Seriously, look into what people are posting about when it comes to characterAI, and it's 80% "here's how to get around NSFW filters".

Unsure why nobody is taking this very very obvious hole in AI tech.

xena · on April 11, 2023

The main reason why companies don't allow NSFW content is because of puritan payment processors that see that stuff and then go absolute sicko mode and lock people out of the traditional finance system.

cubefox · on April 11, 2023

It is amazing that in the year 2023, where things are possible that were science fiction until recently, we still rely on private payment processors, credit card companies, which extract fees for a service that doesn't have any technical necessity anymore. I think the reason is just inertia. They work well enough in most cases, and the fees aren't so high as to be painful, so there is little pressure to switch to something more modern.

Analemma_ · on April 11, 2023

> I think the reason is just inertia.

It is not just inertia; it is government malice. The government loves that there are effectively only two payment processors, because this lets them exercise policy pressure without the inconvenience of a democratic mandate.

skybrian · on April 11, 2023

Yes, financial companies mostly regulate themselves. They have lawyers telling them what regulators are likely to approve of, and make rules based on that for themselves and their customers. If they do something sufficiently bad, regulators go after them. That’s how banks get regulated, too.

That’s how most law works, actually. There’s a question of how detailed the regulations are, but mostly you don’t go to court, and if you do, whether it looks bad to the judge or jury is going to make a difference.

I’m wondering what you’re expecting from democracy? More oversight from a bitterly divided and dysfunctional Congress? People voting on financial propositions?

Karrot_Kream · on April 11, 2023

If entire classes of financial transactions can be blocked through backroom conversations between financial companies and regulators, don't you think that's bad for democracy? We have laws which allow the US to tackle money laundering issues and it's understandable that regulators would create regulations along those laws; they have a clear mandate to. It's not clear to me that other classes of transaction should be blocked based on amorphous dealings with regulators and companies.

barrkel · on April 12, 2023

Part of the issue is that it's usually American regulators setting these rules, but they're applied globally due to the more-or-less duopoly of credit card companies.

nemothekid · on April 11, 2023

>we still rely on private payment processors, credit card companies, which extract fees for a service that doesn't have any technical necessity anymore

The technical necessity is there; for your chase-backed visa card to pull money from chase and deposit it into your shop's citibank, there needs to be some infrastructure. Whether a private company or the government provides this infrastructure is another story.

(Although if the government provided you could argue that there would likely be even more political headaches that prevent what goes across the wire).

tangjurine · on April 11, 2023

Do you have an estimate of the cost of the infrastructure required vs. how much credit card companies charge today?

bhelkey · on April 11, 2023

In Q4 2022, Visa had a revenue of 7.94B, net income of 4.18B, and net profit margin of 52.66%.

cubefox · on April 12, 2023

Bank transfers are a thing. They don't require an intermediary credit card company. The problem is that currently such a transfer is usually slow because of software/protocol issues.

dan-robertson · on April 11, 2023

I think companies accept credit card payments because that’s what their customers want and companies want to get paid.

cubefox · on April 11, 2023

Yes, the current system is a good-enough solution, and any better alternative has to be not just better but so much better that it is worth the large cost of switching to a different solution. Game theoretically, it's an "inadequate equilibrium".

msm_ · on April 11, 2023

And conversely: For online payments, credit card payments are my least preferred method. But I still use them quite often, because everyone accepts them.

andsoitis · on April 11, 2023

> switch to something more modern

such as?

cubefox · on April 11, 2023

I would be very surprised if something based on Blockchain or similar software doesn't offer a solution here. Another route would be to establish a protocol for near instantaneous bank transfers, and try to get a lot of banks on board. The immediacy of transfers seems to be the main reason why companies use credit card services, not buyer protection or actual credit.

xpe · on April 11, 2023

I have no particular love of legacy systems (whether they be banks, imperialism, or Electoral Colleges), but what about your comment is plausible given the widespread recognition that blockchain technologies have been oversold, leading to widespread fraud and failure?

Maybe I’m missing something, but the above comment reminds me of the blockchain naïveté of 10 years ago. I don’t mean to be dismissive; I’m suspending disbelief long enough to ask what you mean in detail.

cubefox · on April 11, 2023

This is possible, and I don't have any deeper knowledge of cryptocurrencies / Blockchain. But payment systems don't seem to have a necessary connection to speculation and the high volatility which comes with holding a cryptocurrency. Maybe I overestimate the amount of problems those payment systems can solve.

Karrot_Kream · on April 11, 2023

There are permissioned "blockchains" which are just private ledgers, that banks could use with permission from the US gov't. These can be anything from a centrally run DB with ACLs or something like Hyperledger with permissioned P2P access. Whether you call it a blockchain or a DB with ACLs is immaterial; it's still much cheaper, faster, and pleasant to use this system over the current system of complex intermediaries in the US. Europe seems to have solved this problem with SWIFT.

phatfish · on April 11, 2023

There is a system called Faster Payments in the UK, which is "near instantaneous" between the UK banks which participate (most of them offering current accounts as far as i know).

But it is a permanent and final transfer, no easy charge backs like with a credit card, or fraud protection from debit cards.

You have to know which account you are paying into (sort code and account number), which is the main part of what Visa/Mastercard do. They are the layer in front of the bank account which means customers don't have to send money directly to an account.

I suppose now everyone has a smart phone it would be easier to hook up something like Faster Payments in a user friendly way with an app and a QR code/NFC reader that the merchant has. But Visa/Mastercard are entrenched obviously.

xpe · on April 11, 2023

> The immediacy of transfers seems to be the main reason why companies use credit card services, not buyer protection or actual credit.

I think the above is quite wrong (with moderate confidence). Is the above claim consistent with survey data? It is my understanding that:

1. companies care a lot about risk reduction. This includes protection from chargebacks.

2. companies benefit when customers have credit: it enables more spending and can smooth out ups and downs in individual purchasing ability

3. Yes, quick transfers matter, but not in isolation from the above two.

cubefox · on April 11, 2023

Well, chargebacks are not possible for ordinary bank transfers. The problem is that the they are too slow and not convenient enough. This is a software / standardization issue. Credit: PayPal is successful despite it only offering very short credits in order to ensure quick transfers. And in physical shops credit cards often seem to be no more than a convenient way to pay without cash. In Germany you can actually pay in all shops and restaurants with a form of debit card, which is just as convenient as paying with credit cards, but has less fees for the shop, since there is no credit card company in the middle. As a result most people don't own a credit card. This doesn't work so well online though.

ChadNauseam · on April 11, 2023

> I would be very surprised if something based on Blockchain or similar software doesn't offer a solution here.

There is, it's a layer-2 on Ethereum called zkSync. It's not totally satisfactory (the company that makes it can steal your money, centralized sequencer, etc), but it's pretty mature and works quite well. To replace Visa you want high throughput and low latency and zk-rollups like zkSync can provide both. (There are other options too, like Starknet, but AFAIK zkSync is the most mature.)

worik · on April 11, 2023

How long does it take before someone says "Blockchain"

Still faith in magic on HN

Gasp0de · on April 11, 2023

PayPal?

cubefox · on April 11, 2023

No PayPal is basically a credit card company. It is an intermediary which gives short credits in order to achieve near instantaneous payments. And extracts fees along the way.

kranke155 · on April 11, 2023

Crypto obviously solved this. If you remove the speculation idiocy that surrounds it, yes crypto does work as an anti-censorship currency.

Someone is going to mention flashbots or something. “See this specific example proves…”

cubefox · on April 11, 2023

The main selling point to online shops would have to be a substantial reduction in fees compared to credit cards / PayPal. Most shops don't care about censorship since they wouldn't be affected anyway.

kranke155 · on April 11, 2023

Yeah I don’t see that happening anymore. Crypto is always going to have fees and off-ramps, although I do think it’s helped create competition in the transfer space.

The real way crypto will work or not work is programmable money. If that works that will be huge, if it doesn’t then maybe someone will pick it back up 50 years from now.

nl · on April 12, 2023

The problem in this specific space is that many people don't want their payments to a NSFW company to be associated with them. Most blockchains makes this trivially traceable by design.

The ones that don't (eg Tornado cash) end up being used for money laundering so on/off ramps won't touch them. We'll see what happens with the ZK-based chains, but this seems a systematic problem that is difficult to fix.

leadingthenet · on April 13, 2023

Monero?

programmer_dude · on April 11, 2023

Unified payments interface: https://en.m.wikipedia.org/wiki/Unified_Payments_Interface

cubefox · on April 11, 2023

Wow, this seems to be just what I meant. Unfortunately it appears it is so far only widely supported by Indian banks. (In Germany there is a similar system, called GiroPay, but it hasn't really caught on yet. And it isn't even intended as an international solution.)

dpkirchner · on April 11, 2023

I think it's equally likely that they just don't want their product to be known as "the porn bot".

speed_spread · on April 11, 2023

Why not? As long as it's not official. Bing was/is known as "the porn search engine" which never seemed to bother Microsoft.

dpkirchner · on April 11, 2023

I think the difference is that OpenAI wants to sell their text generation services to big companies that will show AI content directly on their platforms (think chat support bots), whereas Bing is selling eyeballs to advertisers (who also don't want their ads shown alongside porn by the by).

If OpenAI has the reputation of serving up porn to whoever asks, there's no way the Walmarts of the world will sign up.

bobbyi · on April 11, 2023

It's also because the companies are backed by VCs. VCs get their money from limited partners like pension funds who don't want their money invested in porn.

addisonl · on April 11, 2023

I don’t buy this, if this was the reason then paid porn couldn’t exist, and we know that’s not the case.

boeingUH60 · on April 11, 2023

It's because NSFW content has higher risks of chargeback and fraud (there's a reason their payment processors charge 20%+). Besides, companies don't want to be on the bad side of outrage; it only takes one mistake of processing a payment for child pornography and your name will be plastered everywhere as a child porn enabler.

Do you really think the execs at Visa and Mastercard are puritans and not profiteering capitalists that will process payments for NSFW content if they were able to?

ActorNightly · on April 11, 2023

Nothing to do with outrage.

Everything to do with one politician essentially getting their way by targeting a payment processor with legal shit concerning potential enablement of CP/CT. Nobody wants that kind of attention.

nickpp · on April 11, 2023

The whole US society seems more puritan while more capitalist at the same time, seen from this side of the pond. It’s a paradox I can’t really explain, any clues?

edgyquant · on April 11, 2023

US society isn’t some anti-sex dystopia. Its average compared to the rest of the world, It’s just Europe that is super pro-nudity etc and projects. Like everything else they think they are objectively right in their beliefs and systems.

nickpp · on April 11, 2023

Not allowing sex apps on AppStores and banks and credit cards refusing to process sex-related transactions seems pretty anti-sex to me.

Also getting all bent out of shape at a the image of a nipple, breast or pubic hair while not batting an eye at a person dying in evening TV movies seem a bit unbalanced.

JohnFen · on April 11, 2023

> US society isn’t some anti-sex dystopia

Not a dystopia, but certainly US society has, shall we say, a very strange and complicated relationship with sex and nudity.

LargoLasskhyfv · on April 12, 2023

https://en.wikipedia.org/wiki/Protestant_work_ethic

Besides that: 'There is no such thing as society!'

realfeel78 · on April 11, 2023

You aren't paying attention:

https://www.wired.com/story/twitter-porn-block-germany-age-v...

dpflan · on April 11, 2023

Perhaps it’s as easy as “ethics and laws are not the same thing”. One can profit either way, but unethical profiteering may not be prevented by a law.

robertlagrant · on April 11, 2023

You're conflating capitalism and greed. Plenty of greedy people in non-capitalist systems.

nickpp · on April 11, 2023

> Plenty of greedy people in non-capitalist systems.

Totally agreed. But I am not placing any moral value on either greed or capitalism. I would think, however, that capitalists would not ignore such an obvious profit center as the sex industry. Thus my bafflement.

cowl · on April 11, 2023

What you missing is that by chosing this obvious profit center they risk a much larger profit center because the backlash. It's not a moral thing, it's a calculated choice. That's why who takes this risks also charges a much higher fee to make up for the opprtunity cost in other areas.

worik · on April 11, 2023

> But I am not placing any moral value on either greed or capitalism

That is a missed opportunity

* Capitalism: A system where who owns resources matters more tan who needs them is a morally bankrupt system. A system where starvation and homelessness is an acceptable outcome

* Greed. Greed is bad for everybody. Concentrates scarce resources where they are not needed, that too is moral bankruptcy

nickpp · on April 11, 2023

Funny enough my country was starving under communism but we are living in plenty under capitalism. Since I lived under the alternative and I have seen its evilness, I will take capitalism any day - the very system that allowed and incentivized us to create those resources you are eyeing in the first place.

As for greed, I have yet to meet a person more greedy than the ones claiming to know where to direct those scarce resources they did not create, if only we’d give them the power to do so. Such high morals too, unlike those "morally bankrupt" capitalists who greedily built businesses, jobs, countless goods and services to only enslave us and enrich themselves, obviously.

robertlagrant · on April 12, 2023

I'm glad you chimed in with this. This is the point: capitalism knows self-interest exists, and creates a system to harness it. Communism and similar pretend greed doesn't exist, and creates overly powerful central bodies to make everything fair.

LargoLasskhyfv · on April 12, 2023

https://duckduckgo.com/?t=ftsa&q=Greed+is+good!+Greed+works!

Edit: Quoted from memory of https://en.wikipedia.org/wiki/Wall_Street_(1987_film)

robertlagrant · on April 11, 2023

> I would think, however, that capitalists would not ignore such an obvious profit center as the sex industry

Because you're conflating capitalism and greed. Capitalism doesn't mean "do anything for money". It means "as much as possible, people get to decide among themselves how to allocate their money and time". Some of them will invest in anything, just as people in non-capitalist countries. Most will only invest in certain things.

nickpp · on April 11, 2023

But look at how investment in weed, which was once considered "drugs == bad", flourished after legalization, with ETFs and such. Lots of sex work, including porn, is legal afaik. However banks and other civilian gate holders (Apple AppStore, etc) keep stifling investment in it.

robertlagrant · on April 12, 2023

I'm sorry, I don't see how that relates to what I was saying.

JohnFen · on April 11, 2023

> Capitalism doesn't mean "do anything for money".

In the abstract, perhaps not. The way it exists in the US, though, it means exactly that.

nickpp · on April 11, 2023

This very thread is exactly about how, in US, it doesn’t.

drexlspivey · on April 11, 2023

> Do you really think the execs at Visa and Mastercard are puritans and not profiteering capitalists that will process payments for NSFW content if they were able to?

Pornhub was blocked by Visa and Mastercard after an op-ed in NYT generated a lot of outrage

worik · on April 11, 2023

> Do you really think the execs at Visa and Mastercard are puritans and not profiteering capitalists...?

Yes, "and"

gear54rus · on April 11, 2023

This comment right here can be shown to snobs who still denounce crypto btw

mschild · on April 11, 2023

This is not an argument for crypto, it's an argument for better regulations so that processors don't make up their own rules.

robertlagrant · on April 11, 2023

> This is not an argument for crypto, it's an argument for better regulations so that processors don't make up their own rules.

Better (which I assume is your euphemism for "more") regulation isn't neceesarily the answer, or even particularly the answer. Do you want to force payment processors to do work they don't want to do? Isn't there a word for that?

mschild · on April 11, 2023

Not necessarily more. Better in this context means clearer and enforced.

PayPal is the prime example where it's operating very similar to a bank. You have an account with a balance and can send and receive money, but it doesn't see itself as a bank and in many countries doesn't have a bank license. At least in part this is done to avoid the regulatory work that comes with it.

I absolutely want to force payment processors to do work they don't want to do. For example, banks in Germany are forced to provide you with a basic bank account regardless whether they want to or not. That's because a bank account is simply a must have to take part in modern life. If PayPal decides it doesn't want to do business with you, for whatever arbitrary reason, you are effectively locked out of a lot of online stores that only accept PayPal as a payment method. There is plenty of examples of PayPals really sketchy behaviour online. Every few months you can even see complaints on HN about it.

robertlagrant · on April 11, 2023

> it's operating very similar to a bank

We might be talking at cross purposes; I'm not sure! How is it like a bank?

mschild · on April 11, 2023

PayPal offers you a virtual account that you can pay money into. You can use that money to make purchases online, send and receive money from friends or other businesses. In effect, it acts like a bank account. However, it's not an actual bank account. In Europe, any money you put into that account is also not ensured by the government, like a normal account would.

If I pay with a credit card, there are processes in place to deal with fraud and charge backs. PayPal is well known to automatically close accounts with little recourse to access the money on those accounts.

They should absolutely be regulated.

worik · on April 11, 2023

I agree they should be regulated

But they are nothing like a bank

The feature of a bank is credit creation. Lending more money than they hold.

Unless I missed some news PayPal does not do that

robertlagrant · on April 12, 2023

This is what I was wondering - to my understanding the main reason banks need to be regulated is to stop them over-lending.

layer8 · on April 11, 2023

> Do you want to force payment processors to do work they don't want to do? Isn't there a word for that?

Public utility. That’s what payment processors are at this point, and they should be regulated as such.

robertlagrant · on April 11, 2023

If we think there's no more innovation to be had then this could happen, but I'm not sure that's the case.

psychlops · on April 11, 2023

Authoritarian solutions are very attractive today.

mschild · on April 11, 2023

Reasonably regulating payment processors is far from authoritarian.

If you are on a scale like Visa and MasterCard you're not just any private company anymore. Just those 2 companies control well over 75% of the US market alone. Not having access to a debit/credit card today will effectively block you from taking part in many aspects of modern life. It's absolutely reasonable to place stipulations on what they can and cannot do.

psychlops · on April 11, 2023

I don't disagree with your objective, it's the path you are taking to get there. Legislating obedience is authoritarian and is solution that many people love due to its simplicity.

Regulators love working with large businesses like your card duopoly, I don't think you will see much improvement.

robertlagrant · on April 11, 2023

In what sense do they control the market?

gear54rus · on April 11, 2023

Well you can wait a lifetime or you can take control away from them with a couple clicks. The choice is obvious.

JohnFen · on April 11, 2023

As a rule of thumb, whenever anyone says "the choice is obvious", the choice they're talking about is usually far from obvious.

sharemywin · on April 11, 2023

crypto + NSFW generative AI = ????

that's not going to lead to a whole lot of black market images.

xp84 · on April 11, 2023

It certainly stretches the bounds of reason for me that you could put a person in an isolation chamber with a powerful computer with no network connection, and after they type a few words into it, if the output of the computer has certain qualities, they are now a felon and the output is illegal to possess.

But this seems like the world the “AI-regulators” seem to want.

sharemywin · on April 11, 2023

you don't think it would be problematic for someone to create deep fake images of some ones kids in explicit sexual positions?

I certainly think if the parents found out about it and the law wouldn't do anything about it the parents would take the law into their own hands.

I'm sorry if this wasn't phrased very well. I just didn't know how else to make my point with out be very specific.

xp84 · on April 14, 2023

A skilled artist can already easily do that and there's no law against it that I know of. (Granted, I haven't researched it because I'm neither an artist nor a pervert...)

Now, if they were drawn to resemble specific people and the producer of the "artwork" used them to harass those people, that's harassment. If they used them to groom other kids, that's an existing crime too. But my point was that the production of gross art in isolation, or the possession of it, didn't need to be criminalized. (Actual photographs of the same were criminalized because of the pretty decent assumption that minors were coerced, harmed, exploited. Probably all of the above.)

alexb_ · on April 11, 2023

That's already illegal - you're using someone's image and likeness in a way they did not approve of.

xena · on April 11, 2023

Taking all payment in Ethereum doesn't matter when you have to pay for servers and domain names in fiat.

Rufbdbskrufb473 · on April 11, 2023

Servers and domains are one of the easiest things to buy with crypto.

I actually just migrated away from Hetzner last week (for unrelated reasons) to two new providers to whom I'm paying crypto (no KYC required) based on this list: https://bitcoin-vps.com/

sharemywin · on April 11, 2023

would be nice if you could pay in your own token.

Rufbdbskrufb473 · on April 12, 2023

I'm not sure to what you're referring by "your own token", but most do offer a range of popular tokens by using one of the 3rd party payment providers like Coingate.

I had paid for my servers with some Litecoin I have that I usually use for small purchases because of the low fees.

sharemywin · on April 12, 2023

kind of like a free tier for token projects. if it gets traction you would need more servers but the token would have value so there you go.

gear54rus · on April 11, 2023

Lots of work on that front no doubt, and not only wrt domains

world2vec · on April 11, 2023

If you check /g/ on 4chan (NSFW!!!) you'll see multiple threads on LLMs and LLM-driven chatbots for such content.

Already quite advanced topic these days, all kinds of servers, locally run models, tips & tricks discussions, people sharing their prompts and "recipes", and so on.

It's a whole new world out there but I am not sure if such niche (albeit a potentially really big one, see pr0n sites for example) is worth all the liability issues these big AI companies might face (puritan/queasy payment processors, parental controls, NSFW content potentially blocking some enterprise access, etc, etc). But it will probably all be captured by one or two companies that will specialize in such "sexy" chatbots. Doubt it will be OpenAI and Anthropic, they have their sights on "world domination".

flangola7 · on April 11, 2023

At least for AI image generators it is a giant liability. As of two years ago AI-generated CSAM that is indistinguishable from original photographic CSAM is considered equally criminal. If users can spawn severely illegal content at will using your product you will find yourself in a boiling cauldron 30 seconds after going live.

Stable diffusion no longer uses even adult NSFW material for the training dataset because the model is too good at extrapolating. There are very few pictures of iguanas wearing army uniforms, but it has seen lots of iguanas and lots of uniforms and is able to skillfully combine them. Unfortunately the same is true for NSFW pictures of adults and SFW pictures of children.

cubefox · on April 11, 2023

I realize this is a highly taboo topic, but I think there are studies which suggest that access to (traditional) pornography reduces frequency of rape. So maybe Stable Diffusion could actually reduce the rate of abuse? (Disclaimer: I know nothing about the empirical research here, I just say the right answer isn't obvious.)

Edit: It seems also that language models are a very different topics, since they block any erotic writing outright.

neuronexmachina · on April 11, 2023

Yep. No sane company wants to deal with the legal and PR nightmare of their product being used to generate realistic CSAM based on a child star and/or photos taken in public of some random person's kid.

nl · on April 11, 2023

It's trivial to fine tune llamma to be NSFW if that's what you want.

But there's an entire universe of much more interesting apps that people don't want NSFW stuff in. That's why most foundation models filter it out.

alexb_ · on April 11, 2023

Anything involving llamma is not trivial - if I can't do it on my phone through a website, then you shouldn't expect anyone else to be able to do it. If your instructions involve downloading something, or even so much as touching the command line, it makes it a non-starter for 95% of users.

Get something on the level of character.ai and then you can tell me it's "trivial".

nl · on April 11, 2023

The context of this thread is a company spending $4B with a 4 year plan to build foundation models. One could do what I suggested in between days and months of work for a single person, including building a user-friendly front end.

In the context of this thread it is trivial.

cubefox · on April 11, 2023

I don't think that's the reason. You wouldn't get anything "NSFW" if you don't ask/prompt for it.

sharemywin · on April 11, 2023

the point is though the market potential is huge. and it would be a way to grow fast with cash flow. as a side effect you would probably develop the best NSFW filter in the world also.

pixl97 · on April 11, 2023

> way to grow fast with cash flow.

Until the US payment processors cut you off, then you go bankrupt.

xp84 · on April 11, 2023

You’re not wrong, but the consumer market for chatbots is (perceived to be) tiny and I think nobody really cares about it. the real money places like openAI are chasing is business money.

comboy · on April 11, 2023

What's with the NSFW need? I'd understand if this is some image generator, but here? Is it some sexting, "romance", or is NSFW about something else altogether?

cubefox · on April 11, 2023

ChatGPT refuses to write erotic fan fiction.

Related: I still remember when I used GPT-3 (davinci in the OpenAI playground) for the first time a few years ago. The examples were absolutely mind blowing, and I wanted it to generate something which would surprise me. So I tried a prompt which went something like

> Mike peeked around the corner. He couldn't believe his eyes.

GPT-3 continued with something like

> In the dimly lit room, Vanessa sat on the bed. She wore nothing but a sheer nightgown. She looked at him and

Etc. I think I laughed out loud at the time, because I probably expected ghosts or aliens more than a steamy story, though of course in retrospect it makes total sense. I wanted it to produce something surprising, and it delivered.

RobertDeNiro · on April 11, 2023

Fanfiction. It is a huge deal to some people. Many prefer reading stories over watching porn, and we all know how big of a market pornography is.

cubefox · on April 11, 2023

I wonder whether this actually an area where many women would push for, who have usually a much weaker interest in (visual) pornography.