Saying Goodbye to GitHub

bruce511 · on April 3, 2023

I'm gonna get hated on for this, but I don't think "give back" is an open source concept.

I'm not aware of any Open Source license,or Free license for that matter,that has a give-back clause. Source code is available to -users- ,not prior-authors.

Some Open Source licenses can be used in proprietary code, (MIT, BSD etc) with little more than simple attribution.

Those developers chose that license for a reason, and I've got no problem with commercial entities using that code.

There is a valid argument to be made about the training of models on GPL code. (Argument in the sense that there's two sides to the coin.) On the one hand we happily train humans on GPL code. Those humans can then write their own functions,but for trivial functions they're gonna look a lot like GPL Source.

If the AI is regurgitating GPL code as-is, then that's a problem- not dissimilar to a student or employee regurgitating the same code.

But this argument is about Free software licenses,not really (most?) Open Source licenses.

Either way OSS/Free is not about "giving back",its about giving forward.

In the specific case here of co-pilot making money,I'd say A) you're allowed to make money from Free/OSS code. B) no one is forcing you to use this feature.

uuuguaii · on April 3, 2023

> I'm not aware of any Open Source license,or Free license for that matter,that has a give-back clause. Source code is available to -users- ,not prior-authors.

In essence, copyleft licenses are exactly that. They oblige the author of a derived work to publish the changes to all users under the same terms. The original authors tend to be users. So, a license which would grant this directly to the original authors would end up providing the same end result since the original authors would be both allowed to and reasonably expected to distribute the derived work to their users as well.

This aligns with the reason why some people publish their work under copyleft licenses: You get my work, for free, and the deal is that if you find and fix bugs then I get to benefit from those fixes by them flowing back to me. Obviously as long as you only use them privately you are not obliged to anything, the copyleft author gives you that option, but once you publish any of this, we all share the results.

That's the spirit here and trying to argue around that with technicalities is disingenuous. That's what Copilot does since it ignores this deal.

tm-guimaraes · on April 3, 2023

> That's the spirit here

not really.

All this Free Software movement started by something really similar to "right to repair", a firmware bug in a printer that was proprietary software. Free Software is about being in control of software you use. The spirit was never "contribute back to GNU", the spirit was always "if you take GNU software, you can't make it non-free". Those GNU devs at the time just wanted a good and actually free/libre OS, that would remain free no matter who distributed it.

You are using expectations of modern day devs in the world of a lot of social development thanks to Github.

You might claim that GP was using the technicalities of the licences, but you can actually check the whole FSF philosophy an you note that they align perfectly with "giving forward" not "giving back".

Free Software is about user's freedom. Not dev rights, or politeness, etc. Now obviously, some devs picked up copyleft licenses with the purpose of improving their own software from downstream changes (Linus states that is the reason he picked GPL), but that's a nice side effect, not the purpose. Which ofc, with popular social sharing platforms like github, those things gets confused.

ilyt · on April 3, 2023

> All this Free Software movement started by something really similar to "right to repair", a firmware bug in a printer that was proprietary software. Free Software is about being in control of software you use. The spirit was never "contribute back to GNU", the spirit was always "if you take GNU software, you can't make it non-free". Those GNU devs at the time just wanted a good and actually free/libre OS, that would remain free no matter who distributed it.

Distinction without a difference. The end result is the same.

nine_k · on April 3, 2023

A very distinctive difference.

GPL software is a box that must be kept open, so that everybody would be able to take from it.

If you pick the box and build an altered version of it, you must keep it open, you are legally prohibited from attaching a lid to it.

There's nothing about any expectations, let alone obligations, to put anything back into the original box. Usually it's not very easy (you must follow strict standards) or even impossible (see e.g. SQLite).

xigoi · on April 3, 2023

If you leave the box open, the original author is free to take your modification and put it into their box.

nine_k · on April 3, 2023

Good for them! As long as all the resulting boxes remain open, it's fine and in accordance with the (spirit of the) license.

stonemetal12 · on April 3, 2023

It is a pretty big distinction with different end results in practice. Look at Android, you can use the source in a "right to repair" manner but Google doesn't take patches so you can't give back even if you wanted to.

The same goes for Apple and Google's OSS browser. The source is there, but there is more or less no way to give back, and they certainly don't.

ilyt · on April 4, 2023

> Look at Android, you can use the source in a "right to repair" manner but Google doesn't take patches so you can't give back even if you wanted to.

Well, license obliging the original author to take patches back would be weird one.

But Google could suck in any change to their own and make it better.

> The same goes for Apple and Google's OSS browser. The source is there, but there is more or less no way to give back, and they certainly don't.

That's a different problem that's a bit orthogonal to licensing and has more to do with project leadership. Like, you don't even need to have OSS license to allow users to contribute to project.

eyko · on April 3, 2023

The four freedoms[1] of free software specifically state that:

> The freedom to distribute copies of your modified versions to others (freedom 3). By doing this you can give the whole community a chance to benefit from your changes. Access to the source code is a precondition for this.

Emphasis in "give the whole community a chance to benefit from your changes".

1. https://www.gnu.org/philosophy/free-sw.en.html#four-freedoms

tm-guimaraes · on April 5, 2023

"giving whole community a chance to benefit from your changes" is very different from "upstreaming your changes"

bruce511 · on April 3, 2023

I'm not sure I agree with this as a general point of view.

Speaking generally, I'm not sure that one can claim

>> The original authors tend to be the users

There are endless forks of say emacs,and I expect RMS is not s user of any of them.

Of course RMS is free to inspect the code for all of them, separate out bug fixes from features, and retro apply it to his build. But I'm not seeing anything in any license that requires a fork to "push" bug fixes to him.

>> This aligns with the reason why some people publish their work under copyleft licenses: You get my work, for free, and the deal is that if you find and fix bugs then I get to benefit from those fixes by them flowing back to me.

I think you are reading terms into the license that simply don't exist. I agree a lot of programmers -believe- this is how Free Software works, and many do push bug fixes upstream, but that's orthogonal to Free Software principles, and outside the terms of the license.

>> That's the spirit here and trying to argue around that with technicalities is disingenuous.

Licenses are matters of law, not spirit. The original post is about this "spirit". My thesis is that he, and you, are inferring responsibilities that are simply not in the license. This isn't a technicality,it goes to the very heart of Free Software.

jrochkind1 · on April 3, 2023

> Licenses are matters of law, not spirit.

But also, i think even the spirit of the original copyleft movement is being misunderstood. As a GP said, the spirit was about centering users, requiring developers to be responsible to _users_, in order to create the kind of society where our _use_ of technology would be unconstrained in certain ways.

It was not about anything owed to the "original" developers, it was not about developers responsibility to other developers. In original spirit, even. It was definitely not about creating a system where people could make adequate income from writing software. That was not even the spirit in which the licenses were devised.

(To be fair, it also imagined/hoped that a large portion of (but not all) users could also be "developers" in the sense they could tweak software to meet their needs -- for their own and others use, though, not for money. Even if users would be coding, the "spirit" still centered them as users, and centered the conditions of use, their needs and desires for how that software would work, not conditions of profit or income from charging people for software use).

colonwqbang · on April 3, 2023

Most people don't actually want to maintain a fork. They would prefer that their patches are mainlined.

Consider Linux. It's huge and most vendors really don't want to maintain a fully independent fork. One reason they might do it anyway, is if they could keep their patches private. But the GPL means they can't, so most just choose to upstream patches.

tm-guimaraes · on April 3, 2023

> Most people don't actually want to maintain a fork. They would prefer that their patches are mainlined.

But that's a downstream decision regarding efficiency in their developing process. That's not what free software is about, there is nothing about that in its principles nor licences.

That's just Development Process and maintenance decision, offloading patch integration to upstream, which they might or not accept depending on your changes. None of that is about Free Software. You can see similar decisions/trade-offs taking place in any org with multiple software dev teams with ownership over libs etc, regardless if it is free software or not.

FrustratedMonky · on April 3, 2023

It's like everyone here is a lawyer nitpicking the license as it exists today. But absolutely before the licenses existed Open Source was about those principles, to share code, to share bug fixes, to publish any improvements. So everyone would get better. To say the 'license' doesn't make this explicit it really missing the point.

yawpitch · on April 3, 2023

It’s more like everyone here is very definitely IANAL and _still_ nitpicking (and also missing the point).

noirscape · on April 3, 2023

You'd think so, but there's also a good chunk of copyleft code that's just "here's our source code, go figure out how to deploy lol".

You can try to fork it into something workable, but that can sometimes literally mean trying to figure out what the actual deployment process is and what weird tweaks were done to the deploying device beforehand. In addition, forking those projects is also unworkable if the original has pretty much enterprise-speed development. At best you get a fork that's years out of date where the maintainer is nitpicking every PR and is burnt out enough to not make it worthwhile to merge upstream patches. At worst, you get something like Iceweasel[0] where someone just releases patches rather than a full fork (and having done that a few times, it's a pain in the neck to maintain those patches).

FOSS isn't at all inherently community-minded; it can be and can facilitate it, but it can also be used as a way to get cheap cred from the people who are naïeve enough to believe the former is the only place it applies.

[0]: "Fork" of Firefox LTS by the GNU Foundation to strip out trademarked names and logo's. It's probably one of their silliest projects in term of relevancy.

ghaff · on April 3, 2023

People do release software as open source for the "street cred" (perhaps unsurprisingly given the school of thought that you don't deserve a developer job if you can't show off a GitHub repo). A lot of people also create something and figure "why not?" They may just not be interested in doing any serious community development and maintenance. There are even significant open source projects that are pretty much a closed development process.

JaumeGreen · on April 3, 2023

> They oblige the author of a derived work to publish the changes to all users under the same terms. The original authors tend to be users. So, a license which would grant this directly to the original authors would end up providing the same end result since the original authors would be both allowed to and reasonably expected to distribute the derived work to their users as well.

I might be in the wrong, but this is not how I understand GPL [0]. Care to correct me if I'm wrong.

What I get from the license is that you have to share the code with the users of your program, not anyone else.

AFAIK you could do an Emacs fork and ask money for it. Not only that but the source code only needs to be available to the recipients of the software, not anyone else.

A company could have an upgraded version of a GPL tool and not share it with anyone outside the company. Theoretically employees might share the code outside, but I doubt they'd dared.

[0] https://www.gnu.org/software/emacs/manual/html_node/emacs/Co...

cmsj · on April 3, 2023

> What I get from the license is that you have to share the code with the users of your program, not anyone else.

You're correct, but it's sort of a meaningless distinction because those users are entirely within their rights under the GPL to share that code on with anyone they want, which is why we don't really see the model of "secret GPL" you describe, in the wild.

bruce511 · on April 3, 2023

I'd argue that this is exactly what you see in the wild. And it's why the AGPL license was created.

GPL code has to be shared with users who receive binaries. SAAS happily didn't shop binaries, so quite legally didn't ship source code.

AGPL redefines this in terms of "user" not "binary". That refinement completely exists to cater for unexpected use cases. No doubt new licenses (AIGPL?) will be needed to address this issue.

The whole need for Open Source protection played out with the (Apache licensed) Elastic Search. Switching to a ELv2 and SSPL license was controversial and in some ways "not open source", certainly not "free" because it limits what a user can do with the software.

So the distinction is far from meaningless and in some ways rendered GPL obsolete.

samtho · on April 3, 2023

> That's the spirit here and trying to argue around that with technicalities is disingenuous.

First, I am not a lawyer, but don't licenses exist precisely for their technicalities? This is not like a law on the books in which case we can consider the "Letter and Spirit of the law" because we know in what context in which it was written in/for. With a written license however, someone chooses to adopt a license and accepts those terms from an authorship point-of-view.

FrustratedMonky · on April 3, 2023

Exactly. We all benefit from sharing contributions to the same code base. I use your library, you use mine, we fix each others bugs, add features, etc... The code gets better.

japhyr · on April 3, 2023

No need to hate on you for a valid response.

I think we're in a new enough situation that we can look beyond what's legal in a license. When many of us started working on open source projects, AI was a far-off concept. Speaking for myself, I thought we'd see steady improvement in code-completion tools, but I didn't think I'd see anything like GPT-4 in my lifetime.

Licenses were written for humans working with code. We can talk about corporations as well, but when I've thought about corporations in the past, I thought about people working on code at corporations. The idea of an AI using my open source project to generate working code for someone or some corporation feels...different.

Yes, I'm talking explicitly about feelings. I know my feelings don't impact the legalities of a license. But feelings are worth talking about, especially as we're all finding the boundaries of new tools and new ways of working.

I don't agree with everything in the post, but I think this is a great conversation to be having.

happymellon · on April 3, 2023

> Yes, I'm talking explicitly about feelings. I know my feelings don't impact the legalities of a license.

They don't impact the current legality of a licence, but it will affect future ones.

GPL/BSD/Apache/proprietary, they are all picked for ideological concerns which all stem from feelings. It is good to discuss these things, and it is good to recognise that these are emotionally driven.

Sander_Marechal · on April 3, 2023

> They don't impact the current legality of a license, but it will affect future ones.

Don't they? Even the most liberal licenses require that you at least keep the license and attributions. Which are exactly the parts that AI systems remove. I would have no problem with an AI system trained on GPL code if the output was still covered by the GPL.

cornel_io · on April 3, 2023

Whether or not copyright applies at all to model training is an entirely open question, and where rulings have come down, it's likely closer to these situations being fair use (e.g. the Google Book's case, which was ruled transformative and not a direct replacement for the works in question).

The reality is, these models don't copy or distribute anything directly, which makes applying copyright a bit of a stretch. Many people feel like it is a use that should have some sort of IP law applying to it, which is why I think there's some chance that courts or legislators will decide to screw the letter of existing law and just wedge new interpretations in, but it's not super simple: they'd have to thread the needle and not make things like search illegal, and that's tricky. Besides that, these models are out there, they're useful, and if they're ruled infringing they'll just be distributed illegally anyways.

I don't envy the people who will have to decide these cases, I suspect what's better for the world overall is to leave the law as-is and clarify that fair use holds (nobody will stop publishing content or code just because AI is slurping it up, a few weirdos like the article author excepted), but there are going to be a lot of pissed off people either way...

bombolo · on April 3, 2023

Would I be able to train an AI only using microsoft's leaked windows code to write a windows clone with no copyright (since it comes from an AI) and be safe from legal repercussions because it was trained on fair use code I just happened to find online?

If they rule that it's ok to do that, I might be ok with AI being ruled as fair use.

FrustratedMonky · on April 3, 2023

Exactly. I'm betting if you asked GPT to create a windows clone, for sure MS would not let you distribute. This will go like every other law/license, big corp can sue little guy into the ground. When big corp uses your code it will be 'thats just a model generated code not yours'. But in other direction, if little guys creates windows clone, 'sorry, its off to jail for you maytee'.

jgerrish · on April 3, 2023

Even if it's the opposite direction, big guy losing and small guy coming out ahead, it's still drama.

Just like Covid introduced epidemiological terms to the general public, this issue can introduce design choices around licensing, copyright and watermarking to more people.

I assume there is a group of researchers building tools to provide fine-grained historical views into AI output. And yes, for billions of parameters trained on billions of documents, linking every letter to a source document is a UX nightmare.

But what a cool problem. That's the interesting part. Yeah, something like TileBars[1] or Seesoft[1] seems like the right tool. But maybe keeping it all text with some graphical marker of authenticity is the better choice.

So many cool problems. But, that authenticity marker is the hard sell. Can reasoned discussions with others be enough to introduce that, or is drama required?

https://people.ischool.berkeley.edu/~hearst/irbook/10/node7....

sokoloff · on April 3, 2023

Is it even fair use under the law to consume that Windows source code?

bombolo · on April 3, 2023

People who train AI models claim that anything they can find on the internet is fair use.

It's the whole point of the discussion… is it really?

And if it's not fair use to train on windows source code because of copyright… doesn't that same copyright law cover everything else as well?

sokoloff · on April 3, 2023

I think there's a reasonable distinction to make between "you can train AI models on any code that you are legally allowed to have and read" and "you can train AI models on any code that you are able to feed into it, regardless of whether you have permission to possess/read it".

bombolo · on April 3, 2023

You're not legally allowed to have and read it if you knowingly violate its license terms.

happymellon · on April 3, 2023

You misunderstood. Their emotions do not impact current licences, but their choice of licence is an emotional act.

There are plenty of arguments to choose a licence, and as the world changes we will evolve. See how the GPL itself evolved to handle TiVo. These things arent static.

I personally agree that GPL trained code should produce GPL code, I see a distiction between teaching people and teaching computers but that isn't my call.

tlocke · on April 3, 2023

The MIT No Attribution licence https://choosealicense.com/licenses/mit-0/ doesn't require an attribution.

It's the licence I choose for new projects.

uuuguaii · on April 3, 2023

That's very generous of you. Don't get angry though if I go ahead and re-publish all your code with your name replaced by mine. That's exactly what you are allowing me to do with such a license.

mech422 · on April 3, 2023

That would be a copyright violation, not a licensing issue. GP is giving you a _license_ to use his code without having to say "GP wrote this", he is NOT giving up his rights to the code..

edit: to clarify, he's allowing you to _use_ it however you like, including making a derivative work, including it wholesale, etc. However, claiming authorship of the original code would still run afoul of the original copyright.

edit2: oh - if you mean relicensing the code - that is allowed.

zelphirkalt · on April 3, 2023

They are emotionally driven, but that is not all there is too it. They are also driven by an insight, that most people/organisations do not give back to the community, unless obligated to do so. It is about advancement of society and goods everyone can use, if only they give back to society, when it is their turn to share.

This is exactly what code laundering ANNs circumvent and that might open up a dystopian future for all of us, not only us code monkeys, but society in general.

ghaff · on April 3, 2023

>they are all picked for ideological concerns which all stem from feelings

Or, perhaps more commonly, they were picked for business model reasons or (related) because building a community that includes commercial interests tends to favor more permissive licensing.

FrustratedMonky · on April 3, 2023

Yes. People here seem to be forgetting that Open Source was a community driven ideal first. The License came later as "protection". Corporations were stealing code and there was no recourse. The variety of open source licenses were created to provide a framework for the community, to fight off stealing, to keep it open. So GPT is very much 'laundering' the code just like criminals 'launder' money.

bruce511 · on April 3, 2023

I agree that AI usage of code is somewhat murky with current licenses,which obviously don't mention it either way.

Free software has a principle of "freedom to run, to do whatever you wish" (freedom 0), so arguably has said that training AI is OK. (We could quibble over the word Run, but the Gnu.org,and RMS clearly say "freedom 0 does not restrict how you use it."

GPL code can be used by the military to develop nuclear weapons. Given that the is a guiding principle of the FSF its hard to argue that the current usage is not OK.

eulenteufel · on April 3, 2023

I have no problem with Copilot being trained on AGPL code and the getting released with a AGPL compatible license. Free to do whatever they want with it.

The problem is Copilot training on source code and then discarding any restrictions of the licenses. Maybe it is legal right now but I'm sure this case will find it's way into open source licenses pretty soon.

zelphirkalt · on April 3, 2023

Even if usage is legal right now, the other obligations of the license need to be adhered to as well. Can't just pick or choose one tiny aspect of FSF philosophy and run with that. AGPL is clearly about sharing and spreading free/libre software as well.

hikawaii · on April 3, 2023

Do we know if CoPilot X was trained on AGPL, not just GPL?

Additionally I'm not sure if AGPL does anything.

I suspect the ethics and such of licensing when large fractions of work are training AI and using AI need to be worked out rather than getting mad at any individual.

What does copy left look like for AI?

zelphirkalt · on April 3, 2023

There is indeed a problem of transparency right now. Companies afaik did not release the complete training data set. Might even be intentionally, because they do not want to risk, that they trained it on stuff they should not have had, without building in license and attribution into the output of their models. Or it might be, that they know that to be a fact.

I can only hope, that lawmakers hurry to catch up with reality and impose transparency obligations for AI models.

fauigerzigerk · on April 3, 2023

I largely agree with you, but I think there is one question that hasn't been addressed yet: Are the weights learned by an LLM a derivative work?

When a person learns from GPL code this question doesn't arise. The state of a person's brain is outside of copyright. But is the state of an LLM also outside of copyright or outside of the terms covered by the GPL? I'm not sure.

An LLM can not only emit source code derived from code published under the GPL, it can also potentially execute a program and could therefore be considered object code.

This isn't necessarily a problem as long as the model isn't distributed and does not include any AGPL code.

kuschku · on April 3, 2023

> the state of a person's brain is outside of copyright

It clearly isn’t. Which is why clean-room reverse engineering always requires at least two people. Or why a musician that accidentally recreates a chord progression they heard years ago but don't remember the source might still get sued.

fauigerzigerk · on April 3, 2023

No, you're missing the very distinction I'm trying to highlight.

When I read and remember some text, possibly also learning from it, I'm not making a copy and I'm not creating a derivative work. The state of my brain is outside of copyright. Only at the point where I create a new representation based on what I have read I may be violating someone's copyrights.

But is it the same for an AI? Is the act of reading, remembering and learning (i.e. adjusting model weights) not in itself tantamount to creating a derivative work?

8note · on April 3, 2023

> The state of my brain is outside of copyright.

Is it actually? If we could fully pull out the state of your brain, and understand that you stored a copy of a copyrighted work, I think you could be on the hook for licensing it, paying fees every time you remember the work as a performance of it.

261582335426158 · on April 6, 2023

The state of your brain is moot wrt copyright as you cannot distribute your brain.

Copyright is to the exclusive right to make copies, it is the exclusive right of distributing them.

As a simple example reading a book aloud in your home or singing in the shower is not copyright infringement; not even if you record it.

If you sell tickets to these performances or stream them on twitch it becomes copyright infringing.

Similarly it cannot be in violation of copyright for GitHub to train copilot on any random code they can legally access. It can be in violation to sell access to the model trained in this way.

uuuguaii · on April 3, 2023

The problems are attribution and licensing. Not use.

Lutger · on April 3, 2023

Fully agree.

This may seem a bit nitpicky and philosophical, but anyway: these feelings you mention are about things, and these things the feelings come from are what is most important. Feelings are never standalone, if they are they are just moods which are so personal its hard to have a conversation over.

Let's call 'the things' values. I'd say feelings are perceptions of values, and as such they invariably have a conceptual element to them. And exactly that conceptual aspect makes them suitable for conversation and sometimes even debate, insofar as they can be incorrect. We can acknowledge the subjective, emotive aspect of feelings as highly and inalienably personal, respect the individual opinion behind them and contest the implicit truth-claims all at the same time.

dijit · on April 3, 2023

> I'm not aware of any Open Source license,or Free license for that matter,that has a give-back clause.

§5.c of the GPL Version 3 states explicitly:

> You must license the entire work, as a whole, under this License to anyone who comes into possession of a copy. This License will therefore apply, along with any applicable section 7 additional terms, to the whole of the work, and all its parts, regardless of how they are packaged. This License gives no permission to license the work in any other way, but it does not invalidate such permission if you have separately received it.

As in, all modifications must be made available. Is that not meeting your definition of giving back? GPL (all variants) is one of the most widely distributed of the free software licenses and has an explicit "give back" clause as far as I can see it. -- and is part of why some people referred to GPL as a "cancer".

FWIW the issue I've come to have with copilot is that you're not explicitly permitted to use the suggestions for anything other than inspiration (as per their terms), there is no license given to use the code that is generated. You do so at your own risk.

bruce511 · on April 3, 2023

>> As in, all modifications must be made available. Is that not meeting your definition of giving back?

Available to all users. Not previous authors. There may be overlap, or there may not be overlap.

Plus, I would say it's giving forward, not back. If there are public users then the original authors can become users and get the code. But there will be bug fixes and features smooshed together.

Which is why i posit that there's no "give back" concept in the license. Only "give forward".

FrustratedMonky · on April 3, 2023

Man, that is really splitting some fine hairs

pms · on April 4, 2023

Wow! Just wow... Apparently, some people don't get the idea of common good... as if it didn't exist...

desas · on April 3, 2023

Modifications you distribute have to be given back. You don't have to distribute the modifications though.

GrumpySloth · on April 3, 2023

Not given back to the original authors, but given to the users.

sokoloff · on April 3, 2023

Only if you give those users your modified version.

If I host GPL software on a webserver and my users use that webserver, I don’t have to give them the source code for modified GPL programs. This is fairly common.

ilikehurdles · on April 3, 2023

Depends on which GPL you’re using.

uuuguaii · on April 3, 2023

Issue is that many large corporations use FLOSS internally with heavy extensions/modifications and never give back to the community. They don't have to, since all users are in-housr, and those tend to have access to the source code.

But that's ok. If upstream is somewhat active then it's just a pain to keep maintaining your in-house patches, compared to sending them upstream. So that's automatically a motivator. If upstream is not active, then it does not matter anyway.

bostik · on April 3, 2023

I would say that open source as a movement sprung up from the principles of early netiquette[0]. Which themselves were built on the foundations of sharing your knowledge with your peers.

Whether you were trawling Usenet or just a presence in your local BBS scene, "teach it forward" was always a core concept. Still is. It's difficult to pay back to the person who taught you something valuable, so you can instead pay it forward by teaching the lessons - along with your own additions - to the later newcomers.

Of course the Eternal September changed the landscape. And now we can't have nice things.

0: https://en.wikipedia.org/wiki/Etiquette_in_technology#Netiqu...

detourdog · on April 3, 2023

I think the big move to formalization of GNU was that no free compilers for C existed. RS rightly saw this as a problem and did what he thought was needed to get a universal free c compiler.

EarlKing · on April 3, 2023

Co-pilot spits back protected expressions, not novel expressions based on ideas harvested from code. It is therefore violating the licenses of numerous free and open source projects. The author is right to be pissed.

KoolKat23 · on April 3, 2023

That's not the case, there's a probability it may "spit back" the protected expression. There's also a probability I, as a human "spit back" the protected expressions. This could either be by pure chance or from past learnings, reading the protected code and internalizing it as a solution, my subconscious forgetting I actually saw it elsewhere.

In Uni, students run their theses through plagiarism checkers, even if it's novel research as it naturally occurs.

As the thought experiment goes, given infinity, a monkey with a typewriter will inevitably write Shakespeares works.

EarlKing · on April 3, 2023

...except you don't need an infinite number of monkeys. It has been trained to produce protected expressions by virtue of being trained on protected expressions. The probability of it producing a protected expression at some point is 1.

KoolKat23 · on April 3, 2023

The same truth holds for you or me writing up that code.

EarlKing · on April 3, 2023

No it doesn't. My mind contains information derived from expressions I've read which I can rearrange into novel expressions. I don't regurgitate protected expressions verbatim. Co-Pilot does.

KoolKat23 · on April 3, 2023

That's exactly what co-pilot does, ask it to rearrange it if what it comes up with is the same. That's what code plagiarism checkers are for.

sureglymop · on April 3, 2023

You are correct. The problem is that the GitHub Terms of Service probably (guessing) have a clause which invalidates your license if you upload your code there. And that's exactly why you shouldn't use GitHub.

xigoi · on April 3, 2023

The terms of service explicitly say that GitHub is not allowed to use your code for commercial purposes.

natch · on April 3, 2023

This seems to be what people imagine about it, not what it actually does, although I don’t doubt you could cherry-pick some snippet after a lot of trial and error to try to claim that it had regurgitated something verbatim. But certainly let’s see the examples.

bombolo · on April 3, 2023

You never know if a snippet it created came from another project verbatim or not… unless you claim you know all of the code that exists?

bqmjjx0kac · on April 3, 2023

That's a bit extreme. In theory, an LLM's proclivity for plagiarism could be studied by testing it with various prompts and searching its training data for its responses (maybe with some edit distance tolerance).

bombolo · on April 3, 2023

Except the training data is secret…

natch · on April 3, 2023

You can search github and other open sources to find at least a likely subset of the training data though.

bombolo · on April 6, 2023

You suggest doing this, by hand, for every suggestion?

natch · on April 7, 2023

Just try it out for some code you have on github where you know yours is the only solution out there. You'll be pleasantly surprised to see that it does not suggest a verbatim copy/paste of your code or anything close to it, unless you try this with a one liner like how to do an fopen(), which would not be a good test, and would not be the only solution out there. And then seeing the result, you can adjust your theory. So, in short, I suggest simply testing your theory, not anything absurd like what you are coming up with.

bombolo · on April 10, 2023

What would that prove? I still have no access to all the proprietary code generated from copilot, and no idea if it did copy paste or not in all those cases.

You suggest I try it twice and since it will probably not copy paste in those 2 tries, assume it never copy pastes (despite existing evidence that it does copy paste in some other cases).

What problem would this exercise solve? I can't see it.

natch · on April 3, 2023

Search tools are a thing. Grep, Google, Github, etc.

But yes, exactly, you never know if a snippet came from another project or not, so let's not assume it did without some convincing evidence.

bombolo · on April 4, 2023

Why assume it didn't?

natch · on April 5, 2023

I’ve done tests and it passed with flying colors so it’s not an assumption. So the premise of your question is flawed.

bombolo · on April 10, 2023

It has been shown that github copilot does copy paste.

The fact that you tried it a couple of times (or 10 or 20) means absolutely nothing.

1 copyright infringement is enough for a lawsuit.

rob74 · on April 3, 2023

You're allowed to make money from Free/OSS code, and plenty of companies have (Google, Amazon etc.), but they have always also at least given back something to the community to earn some good will. The situation with AI is new because it not only doesn't give anything back, it actually takes something away by threatening developers' jobs etc.

dolmen · on April 3, 2023

> they have always also at least given back something to the community to earn some good will

The busybox authors disagree: https://busybox.net/license.html

layer8 · on April 3, 2023

One possible problem is if Copilot gets good enough that you can rather easily sidestep GPL (or any other license) by having Copilot implement functionality X for you instead of using a license-bound library providing X. Not only may this be questionable regarding the license, but it would also be tend to reduce contributions to the library which otherwise would have been used.

zirgs · on April 3, 2023

ChatGPT already can rewrite functions to use different algorithms.

the_gipsy · on April 3, 2023

Shouldn't be different from a human ripping or regurgitating parts of the code.

If it's not a hard derivation, then it's difficult to prove or even notice.

feanaro · on April 3, 2023

It shouldn't matter. If your model touched X during training, it should be seen as producing derivative work. This is the reason humans use clean room implementation techniques.

usrbinbash · on April 3, 2023

> If your model touched X during training, it should be seen as producing derivative work.

I as a programmer touched X during training (learning how to code). Is all my work now derivative because of that?

feanaro · on April 8, 2023

If you examined the source code of a library for a specific purpose Y, then shortly afterwards went implementing another library for purpose Y, there's a high probability of your code being infringing. That's the entire premise and purpose of clean room design (https://en.wikipedia.org/wiki/Clean_room_design).

Now factor in that machine models don't have a fallible or degrading memory and I think the answer is quite clear.

the_gipsy · on April 3, 2023

The opposite of clean room is probably much more common.

layer8 · on April 3, 2023

My point was more about the possible negative effects than about the legality.

An important difference is that AIs are much cheaper than a human. Having a library reimplemented by a human usually isn’t cost-effective, but having it done by an AI may become viable. That could cause a major change in the open-source dynamics, while possibly also reducing average software quality (because less code is publicly scrutinized).

EthanHeilman · on April 3, 2023

It would be interesting to have Free Software License that requires that any thing which ingests the source code must be Free Software running on Free Hardware. If you train a model on such inputs, your model would need to be Free software and all the hardware the model runs on would need to be Free Hardware. This would create a massive incentivize to either not use such software in your model or to use Free Software and Free Hardware.

Taken to its logical conclusion, you could add the notion of Free Humans are legally bound to only produce Free Ideas. One could imagine this functioning like sort of monastic vow of charity or chastity. "Vow of silence on producing anything which is not Free (as in freedom)."

Would you take such a vow if offered 100,000 USD/year for the rest of your life (adjusted for inflation)? I would.

b3morales · on April 3, 2023

This idea ("make a stronger license") has come up in previous discussions of Copilot as well[0].

The problem is that the Copilot project doesn't claim to be abiding by the license(s) of the ingested code. The reply to licensing concerns was that licensing doesn't apply to their use. So unfortunately they would just claim they could ignore your hypothetical Free³ license as well.

[0]: https://news.ycombinator.com/item?id=34277352

EthanHeilman · on April 3, 2023

> The problem is that the Copilot project doesn't claim to be abiding by the license(s) of the ingested code. The reply to licensing concerns was that licensing doesn't apply to their use.

I think github is largely correct in their view on licenses. However I would argue that you could create a stronger legal binding than say a GPL-3 license. For instance you could require and enforce that anyone that wishes to read the repo must sign a legal contract or EULA: "By decrypting this git repo you are agreeing to the following licenses, restrictions, contractual obligations, ..."

gampleman · on April 3, 2023

From the MIT license:

> The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

Does GPT spit out the copyright notice when it regurgitates my code?

Uvix · on April 3, 2023

Is GPT spitting out “substantial portions” of your code?

WorldMaker · on April 3, 2023

It might. The ethical dilemma here seems to be:

1) We don't know when/if GPT will. GPT in its current form can't seem to guarantee safety (either from "substantial" verbatim snippets or from complex "hallucinations" of random pachinko output).

2) GPT doesn't know when/if it has. GPT in its current form likely cannot know this. (In part, because it doesn't really "know" anything, that's too anthropological a word for what is still mostly just a casino full of pachinko machines.)

3) Define "substantial portions" in a way that a jury of your peers can understand it in a court of law.

4) Can you define "substantial portions" again, only this time in code as guide rails for something like GPT? "Substantial portions" is barely a human term designed for human lawyers and courts. There's a fascinating challenge here on quantifying it.

GoblinSlayer · on April 3, 2023

Trivial functions aren't copyrightable, so it doesn't matter whether they are GPL or not.

bombolo · on April 3, 2023

> If the AI is regurgitating GPL code as-is, then that's a problem- not dissimilar to a student or employee regurgitating the same code.

Not "if". We know it does.

And since it doesn't show citations, it might be the case that you use and mistakenly end up making your entire software GPL, because of including copy pasted GPL software.

rdiddly · on April 3, 2023

Yeah on the one hand, isn't opening your source all about not really minding what happens to it after that? It's intended to be copied and used. On the other hand something about the term "laundering" kind of resonated for me. It's kind of like automated plagiarism where you spread your copying out over millions of people. But plagiarism only has meaning as an offense when the thing being copied isn't intended to be copied. But for copyright purposes is there a difference between copying exactly, and the type of blending a LLM does? I'm too confused. That feeling when you hit on something society has never thought about before.

zelphirkalt · on April 3, 2023

If we go by how many people explain open source, you would be right, but if we go by how people who actually know what their licenses are supposed to do explain open source, then no. You give a license for a specific reason. One might be to allow others to copy, but there is usually a condition, and that is to leave the license information intact. If we go further towards free/libre software licenses like GPL or AGPL, then we have more conditions to it. For example that, if you distribute software using that code, you need to distribute the source of your software as well (a bit imprecise).

If you want to get a better picture of the situation, read up on the licenses and what they do, specifically the term "copyleft".

robertlagrant · on April 3, 2023

That's why OSS is not about the contents of more restrictive licences (as you say, copyleft is a good example), but about the broadest definition that applies to all of them. More specialised sub-types can have more conditions, but then we should speak about those sub-types directly.

I make this point because it causes confusion. Saying "my OSS project is being run by others for money and I get none of it" is confusing, as it uses the supertype. Using the subtype is clearer, and even self-explanatory: "my project I licenced so others can make money from it while giving me none is being run by others to make money from it and I get none".

That may be a bad situation, but at least the reasons are now clear.

uuuguaii · on April 3, 2023

> Yeah on the one hand, isn't opening your source all about not really minding what happens to it after that?

No! That's a gross misrepresentation of what open sourcing is. It's the offer of a deal. You publish the source code and in return for looking at it and using it for sth, I have obligations. Like attribution and licensing requirements regarding derived works.

lionkor · on April 3, 2023

no, open source isn't about practically giving up your rights, its about restricting use of your code and software in exactly such a way that it gives every user as much freedom as possible.

layer8 · on April 3, 2023

This actually has been thought about before, in the context of remixes, collages, etc. The essential question is how much of the originality of the original work(s) constitutes the originality of the new/derived work. If it is little enough, then it’s okay. The issue with AI models is that they have no way of assessing originality and tracking the transfer of originality.

AmericanChopper · on April 3, 2023

The term is being used here to imply that the generated code is somehow bypassing the licensing requirements, which isn’t necessarily true, and certainly isn’t a substantiated claim.

You can read licensed code, learn from it, and then write your own code derived from that learning, without having committed a copyright violation.

You can also read licensed code, directly copy paste it into your codebase, and still not have committed a copyright violation, as long as you did so in a way that constituted fair use (which copy-pasting snippets certainly would).

There’s no copyright issue here at all, and rationally speaking there aren’t any legitimate misuse of open source concerns either. If these people were honest they’d just admit to feeling threatened by AI, but nobody would care about that, so they just try to manufacture some fake moral panic.

taink · on April 3, 2023

I agree that copyleft is more about "giving forward", and I think it's a confusion a lot of people make. Reading through the thread, I get the impression that some think as soon as one "distributes" the licensed material, original authors should get a copy. I'm extrapolating of course, but even then I feel some people would agree with that statement.

GPL, for instance, merely states that distributed sources or patches "based on" the program should be "conveyed" under the same terms. In other words, anyone who gets their hands on it will do so under the same license.

If anything, I would be worried that GitHub trained itself on publicly-available but not clearly licensed code, because then it would have no license to "use" it in any way[0]. GPL provides such a right, so there is no problem there. It would be even more worrying if the not clearly licensed code was in a private repository but I think I remember reading that private repositories were not included in the training data.

However, would you consider a black box program, of which the output can consistently produce verbatim or at the very least slightly modified copies of code from GPL code to be transformative? The problem does not lie in how the code is distributed but in how transformative the distributed code is. Not only does the same apply to any program besides AI-powered software, it applies to humans[1].

Given how unpredictable the output of an AI is, one should not be able to train itself on GPL code if it cannot reliably guarantee it will not produce infringing code.

[0]: https://docs.github.com/en/site-policy/github-terms/github-t... (https://archive.ph/susi0#4-license-grant-to-us)

[1]: One such example would be how Microsoft employees allegedly prevented themselves from reading refterm source code, cf. https://github.com/microsoft/terminal/issues/10462#issuecomm...

starkd · on April 3, 2023

Perhaps I'm out of the loop on this, but I always thought the concept of open source was primarily about the opportunity for personal professional development. The ability for someone not connected with a corporation to stay relevant and continuously update his skills in away that was not dependent on proprietary systems. That is a huge asset, not only for oneself but also for the the world.

galleywest200 · on April 3, 2023

That may be a benefit, but the primary concept of open source is that its open source.

viciousplant · on April 3, 2023

Time will tell, and it's a destined trend for more devs close sourcing their code no matter what curious angle you are trying to justify large firms using AI exploiting money, and I doubt you are working for one of them.

lenkite · on April 3, 2023

IMHO, just like there was a robots.txt file made for the web, there needs to be a NOAI.txt for git repos. Sorry, this repo does not permit you to ingest the code for a learning model. Seems completely reasonable.

welshwelsh · on April 3, 2023

What would be the point of that?

If we were somehow able to prevent AI models from ingesting a codebase, that would mean everyone else who wants to produce similar code would have to re-invent the wheel, wasting their time repeating work that has already been done.

All because... the person who did it first wants attribution? They want their name to be included in some credits.txt file that nobody will ever read? That's ridiculous.

mftrhu · on April 3, 2023

> All because... the person who did it first wants attribution? They want their name to be included in some credits.txt file that nobody will ever read?

Yes, and yes. Those would be the terms that person publishes their code under. If you can't agree to those terms - maybe because including a single name in a credits.txt file that no-one reads is somehow too onerous for your process - then you are always free to re-implement that code on your own.

xigoi · on April 3, 2023

That wouldn't work. robots.txt is not enforceable.

ye-olde-sysrq · on April 3, 2023

I wonder if the GPLv4 will be coming out soon with an anti-AI-training clause (unless your output model and its outputs are also GPLv4, of course).

cxr · on April 3, 2023

People keep bringing this up. It's not as straightforward as a clause that says "you can't use this to train AI" (which is what I suspect many people think).

Licensing operates on a continuum of permissiveness. They can only relax the restrictions that you as a creator are given by default. You can't write a copyright license that adds them. You could write a legal instrument that compels and prohibits certain behavior(s), but at that point you're talking about a contract. (And there's no way to coerce anyone to agree with the contract.)

Harry Potter has even more restrictions than the GPL or any other open source license. It's "All Rights Reserved—it enjoys the maximum protections that a work can. And yet it would still be possible feed it into an AI model, even if all of Rowling, Bloomsbury, and Scholastic didn't want you to. They don't get a say in that. Nor do open source software developers in their works which selectively abandon some of the protections that Rowling reserves for herself and her business partners.

The only real viable path to achieve this using an IP license alone would be a React PATENTS-like termination clause—if your company engages in any kind of AI training that uses this project as an input, then your license to make copies (including distributing modified copies) under the ordinary terms are revoked, along with revoking permission for a huge swathe of other free/open source software owned by a bunch of other signatories, too. This is, of course, contingent upon the ability to freely copy and modify a given set of works being appealing enough to get people to abstain from the lure of building AI models and offering services based on them.

ye-olde-sysrq · on April 3, 2023

I amend my wondering to the prepend the GPL thing with "I wonder how long it will take disney et al to make it illegal to train AI with their stuff" which then opens that door to the GPL

robertlagrant · on April 3, 2023

> I'm gonna get hated on for this, but I don't think "give back" is an open source concept.

You're right. It's a politeness law some people have invented.

It's also a value people have, but that's for themselves. I like contributing to OSS projects. But, as soon as it's imposed on others, and there are punishments for disobeying, it's a politeness law.

Eisenstein · on April 3, 2023

Politeness keeps society civil and sane.

robertlagrant · on April 3, 2023

Politeness laws don't. They calcify and oppress.

felixgallo · on April 3, 2023

Could you link to a politeness law on the books that you feel is oppressive?

robertlagrant · on April 3, 2023

On the books? I mean laws like "you shouldn't make money from OSS made by someone else". The context of this chat.

felixgallo · on April 4, 2023

oh, so those things are what people call "opinions", which they are completely allowed to have, just as you have yours. They aren't oppressing you, they don't expose you to penalties, you can't get thrown in jail.

robertlagrant · on April 4, 2023

No, not just opinions. I don't understand why you'd redefine what I'm saying, as though that invalidates the original statement. It's not exactly difficult to spot.

felixgallo · on April 5, 2023

I’m not redefining what you’re saying. I’m pointing out that you are exaggerating the magnitude and impact of Internet opinions by characterizing them as oppressive laws.

numpad0 · on April 3, 2023

    then that's a problem- not dissimilar to  
                          ^

Discontinuity here.

HenriTEL · on April 3, 2023

> On the one hand we happily train humans on GPL code. Those humans can then write their own functions,but for trivial functions they're gonna look a lot like GPL Source.

Exactly. People are getting mad that Microsoft is making good money while the people who made all that free software available mostly did it for free (like in no money and no recognition). It can sound unfair but that's the deal. If you didn't want people or AI to learn from your code, open source was not the right option.

mftrhu · on April 3, 2023

> If you didn't want people or AI to learn from your code, open source was not the right option.

There's nothing wrong with other people using - learning and creating derivative works of - one's open-source code, provided they respect the terms of the license. It seems to me that the real issue is the fact that these licenses don't have enough teeth.

WinstonSmith84 · on April 3, 2023

Most people I know who contribute, or host open source projects, me included, do this for references. And the most successful ones find a way to generate revenue. "Giving back" is a nice additional thing, but I don't know anybody who does that _primarily_ to "help the world"

slg · on April 3, 2023

If we are being honest as a community, open source developer are pretty far down the list of groups with valid grievances against this current wave of AI for how they are trained. There is at least a debatable case that these systems are operating in the spirit if not the exact letter of general open source licenses. It is a much harder argument to make for the AI trained on writing and art that is clearly copyrighted. If you have ethical questions about Copilot, you really should be against this entire crop of AI systems.

marcus_holmes · on April 3, 2023

So you're suggesting that developers shut up and let the artists talk first? I'm not sure what the "you're suffering less than these other people" thing is actually intended to translate into? What do we do with that?

All software licences are based on copyright, same as writing, art, music, etc. Some software licences are permissive. Some writing is permissive (e.g. Cory Doctorow). Some music is permissive (e.g. Amanda Palmer). It entirely depends on what the author wants. The fact that more software is permissive is a good thing, right?

I entirely agree that there are ethical problems with training AI on copyrighted training data. But please let's not start gatekeeping this. We need to have a serious discussion as a culture about it, and saying "you're way down the list of victims" isn't helping.

slg · on April 3, 2023

> So you're suggesting that developers shut up and let the artists talk first? I'm not sure what the "you're suffering less than these other people" thing is actually intended to translate into? What do we do with that?

The tech community has a tendency to not care about issues like this until it effects us. I’m not telling people to shut up about this. I’m saying don’t be a hypocrite. If this is the wrong approach for GitHub, it is a problem with the way all these AI are trained.

asmor · on April 3, 2023

This is a pattern of argument I see all the time in minority communities, and it really irks me the wrong way.

You are allowed to care about the things you care about and not have a concrete well-informed opinion on something related that might be more pressing. It only becomes hypocricy as soon as you actively dismiss the other thing as if you were well-informed. And I don't see anyone here doing that.

mftrhu · on April 3, 2023

There is indeed a problem with the way all these models are trained, and too many people want to pretend that colour can be laundered out <https://ansuz.sooke.bc.ca/entry/23>

w4ffl35 · on April 3, 2023

> If this is the wrong approach for GitHub, it is a problem with the way all these AI are trained.

Broken logic.

zelphirkalt · on April 3, 2023

What I agree with is the typical open source dev, who goes "I MIT license all my things, because I have seen it elsewhere and I don't want to think about licenses a lot." being pretty far down the list of groups of people to complain.

What I disagree with is the idea, that they should therefore not complain, or that there could not be an AI system, that does not code laundering, but keeps licenses in place and does this ethically and in an honest way. Adding "ethically" and "honest way", because I am sure that companies will try to find a way around being honest, if they ever are forced to add back the licenses.

In fact, artists might not be the group, that grasps the impact of training on that corpus as quickly as the dev communities. Perhaps it is exactly the devs, who need to complain loudest and first, to have a signal effect.

ksec · on April 3, 2023

>I'm gonna get hated on for this, but I don't think "give back" is an open source concept.

Well I guess you know why you may be hated for this already. For anyone who has surf HN since ~2010 would know or should notice the definition of open source has changed over the past 10-15 years. Giving Back and Communities are the two predominant Open Source ideals now. Along with making lots money on top of OSS code being somewhat a contentious issue to say the least.

But I want to side step the idealistic issue and think this is more of an economic issue. Where this could be attributed as a zero interest rate phenomenon. You now have developers ( especially those from US ) for most if not all of their professional life living under the money / investment were easy, comparatively speaking. And they should give back when money ( or should I say cash flow ) isn’t an issue. When $200K Total Comp were suppose to be the norm for fresh grad joining Google. And management thinking $500K is barely enough they need to work their way to $1M, while seniors developer believes if Junior were worth $200K than they are asking for $1M total comp is perfectly sane, or some other extreme where everyone in the company should earn exactly the same.

If Twitter or social media were any indication you see a lot of these ideals were completely gone from the conversation. Although this somehow started before the layoffs.

It is somewhat interesting to see the sociologic and idealogical changes with respect to economics changes. But then again, economics in itself is perhaps the largest field psychology study.

fariszr · on April 3, 2023

> The code that was regurgitated by the model is marketed as "AI generated" and available for use for any project you want. Including proprietary ones. It's laundering open-source code. All of the decades of knowledge and uncountable hours of work is being, well, stolen. There is nothing being given back.

Leaving GitHub wont change that, OpenAI is training its models on every bit of code they can have, sourcehut, codeberg etc. If its public, they will train on it.

Also from my experience of trying to leave GitHub, you just end up having a couple of projects on your alternative platform, and everything else on GitHub. You are still active on GitHub, probably even more than your new alternative.

And if you want to build a community, you will quickly find out that the majority want to stick to GitHub, and leaving it can kill your projects chances of getting contributions.

Personally if the courts decide its fair use, that's it, I'm going back, its the best got platform out there, gitlab doesn't even compare in free features. However I have been eyeing Gitea and Gitea Actions, with it Codeberg could become a realistic choice for me.

To end it, here is a Hot take, I really hate Sourcehut.

it hard to use, the ui is .. Not great and trying to browse issues or latest commits is a nightmare.

Every time a project uses it, its a pain to deal with.

sjamaan · on April 3, 2023

> Also from my experience of trying to leave GitHub, you just end up having a couple of projects on your alternative platform, and everything else on GitHub.

> And if you want to build a community, you will quickly find out that the majority want to stick to GitHub, and leaving it can kill your projects chances of getting contributions.

That's a defeatist attitude and a self-fulfilling prophecy at the same time. As more and more people leave GitHub (hopefully not to go to the same alternative), it becomes less and less of a must-have. The reason these things are somewhat true today is because of the network effect, and it's precisely that effect which we must actively attempt to squash by leaving.

mattlutze · on April 3, 2023

Parent is talking about a fundamental feature of networks. A denser and larger network has much more useful network-related features, and if one company has a significant majority of the total addressable market for a network, it's a massive ask for people to extricate themselves and rebuild a network somewhere else.

It's why Facebook is still on top even though everyone hated it for a while; YouTube is the *only video platform, etc.

toastal · on April 3, 2023

But we are developers, not my grandma. We ‘know better’ but haven’t been doing enough about it.

fariszr · on April 5, 2023

You are overestimating how many developers care about this really.

lelanthran · on April 3, 2023

> Leaving GitHub wont change that, OpenAI is training its models on every bit of code they can have, sourcehut, codeberg etc. If its public, they will train on it.

Not every bit of code, they are respecting proprietary licenses.

When MS puts the code for Windows, Office, Azure and everything else in front of ChatGPT, Copilot, whatever other AI learning model they have, then perhaps they have a leg to stand on.

Otherwise, they're just being hypocritical to claim that no injury is being done by using code for training, because they are refusing to train on any of their code.

Right now it just looks like they are ripping off open source licenses without meeting the terms of the license.

https://www.lelanthran.com/chap7/content.html

Ajedi32 · on April 3, 2023

AFAIK that has nothing to do with the license, it has to do with whether the code is public. You don't want the AI accidentally revealing proprietary non-public information (e.g. imagine someone had a secret API key in a private repo and copilot leaked it; that'd be a huge incident), so you don't train it on that information, regardless of what it's licensed under.

You could make a similar argument for not training on GPL code, but it's a lot easier to programmatically determine whether or not code is public than it is to programmatically determine what it's licensed under, particularly when you're training on massive amounts of unlabeled data. Not to mention it's way easier to delete an accidentally-added snippet of GPL code from a codebase than it is to "unleak" company secrets after they've been publicly revealed.

AlexandrB · on April 3, 2023

> Not to mention it's way easier to delete an accidentally-added snippet of GPL code from a codebase than it is to "unleak" company secrets after they've been publicly revealed.

How often do you think anyone will notice that some part of a proprietary codebase is copied substantially from GPL code? I think it's going to be very rare and a lot of this code will fly under the radar. The GPL was always a kind of legal jiu-jitsu, turning copyright against itself and allowing non-commercial entities to protect themselves from uncompensated exploitation. Models like copilot, if they're legal, upend the status quo tremendously. Even though your code isn't (always) used directly, a commercial entity like Microsoft will slurp it up and sell the resulting model back to you for $9.99/mo.

bsder · on April 3, 2023

> Every time a project uses it, its a pain to deal with.

Sorry, but I consider that a plus.

One of the primary problems with GitHub right now is the "drive by" nature. Everybody is on Github because a bunch of idiotic big corporations made "community contribution" part of their annual review processes so we now have a bunch of people who shouldn't be on GitHub throwing things around on there.

Putting just a touch of friction into the comment/contribute cycle is a good thing. The people who contribute then have to want to contribute.

friend_and_foe · on April 3, 2023

I like sourcehut, I'm just not a fan of email oriented collaboration workflow, so I dont use it. And the rest of the world isn't either, if the success of github is anything to go by. I get that Drew likes it, the greybeards are used to it, it works, it's adequate, and it keeps things simple, but I just never could do it. I don't like git either tbh, I grumble while I use it. IMO the perfect collaboration suite would be something like fossil with RSS feeds for every action.

toastal · on April 3, 2023

I believe the goal is to build a minimal UI for those that don't prefer which is fine, but email & pull requests aren't the only model here. Look how much tooling is created to try to fit stack-based diffs atop Git+GitHub instead of using a different platform.

galangalalgol · on April 3, 2023

I'm mostly familiar with gitlab, what does github provide for free above and beyond that? I like that I can run my gitlab pipeline on my machines and sync to a free gitlab instance. I like that I don't read about security vulnerabilities in gitlab pipelines nearly as often as github actions. I like gitlab issues as they are fairly minimal.

fariszr · on April 5, 2023

GitHub registry, GitHub actions and GitHub Codespaces are unlimited for public repos, in addition to all enterprise features.

That's without talking about nice to have features like GitHub Sponsors, the for you tab, the (arguably) more popular UI layout, It's simply a better platform for Open source projects

rascul · on April 4, 2023

> gitlab doesn't even compare in free features.

What features is GitLab missing? I don't know, I'm curious.

fariszr · on April 5, 2023

Unlimited package registry, unlimited Action run time, premium features unlocked and more. Also, the free tier on GitHub gives more for private repos too!, unlimited orgs, 2000 Ci minutes etc. It's just plain better, and It's because Microsoft can afford to play the long game, GitLab can't anymore.

muyuu · on April 3, 2023

I believe he just wants to do his bit by removing his activity from github towards lowering their dominance numbers in the space. I don't think he intends to stop those LLM code models.

lofaszvanitt · on April 3, 2023

This whole open source thing is the biggest farce on planet Earth. Someone with a good knowledge about geeks and their behaviour concocted up this open source bullshit. So now talented people give their skill to the "whole" and they have to beg for contributions and donations to get by. And other geeks (not suits with ties) finance the ones they sympathise with. It's ridiculous.

And faceless entities use their hard work for who knows what, but mostly to fatten up their already oversized corp and give back NOTHING.

And people, seemingly without common sense suck up to companies that rob them, and even disseminate their shiny new "free" tools.

This would be a Hugo-Nebula award winner novel if it wouldn't be reality.

azangru · on April 3, 2023

This is such a misrepresentation of the open-source landscape. Yes, there are people working on open-source projects who beg for donations; but there also are open-source projects maintained by full-time employees (Eleventy, paid by Netlify; React, paid by Facebook; Angular, paid by Google; Next.js, paid by Vercel; Linux, paid by various companies; etc.). If a person thinks that his efforts will be better compensated elsewhere, he can always start looking for a paid job.

lofaszvanitt · on April 3, 2023

People should not beg for their work for a start. If you use it, you should pay for it. A small fee, or a fee that is set by the dev.

sokoloff · on April 3, 2023

I use a lot of open source software and directly pay for approximately none of it.

Do you behave differently and pay for it today? What do you use to accomplish (track and manage) that?

lofaszvanitt · on April 3, 2023

That's the problem there are no systems in place for it. So if there is no UI for payments on github, people will never stop for a moment to think about what they take for granted.

sneak · on April 3, 2023

You just described proprietary software. It's a philosophy incompatible with free software.

You aren't required to pay for a gift, which is what free software is.

lofaszvanitt · on April 3, 2023

Nah, nope. You usually don't get the source for proprietary software. Just like you get medical care when you are in an accident, same should apply to work done on os software. Somehow devs should be compensated for their work on os soft.

lelanthran · on April 3, 2023

> So now talented people give their skill to the "whole" and they have to beg for contributions and donations to get by. And other geeks (not suits with ties) finance the ones they sympathise with. It's ridiculous.

Is it? I can't think of a single professional dev making money right now that isn't making money because they did not have to reinvent the entire tech stack that they are skilled in.

If there was no open source, we'd all be making a lot less, and the state of tech would be far far smaller than it is right now.

lofaszvanitt · on April 3, 2023

If there would be no open source, people would pay for libraries. Now we have open source, and a lot of devs are not compensated. End of story. No proper solution. That's all.

Roughly the same applies to newspapers. Ohm please do not turn off advertisements so we could keep the lights going.

Digital beggars everywhere.

lelanthran · on April 3, 2023

> If there would be no open source, people would pay for libraries.

Nonsense. The cost of creating non-trivial software (say, 20+ dependencies, all needing payment) would put software out of the reach of ordinary people, meaning that there will only be a small niche of developer jobs.

Which means that most people making a non-zero income from writing software today would have been making a zero income from writing software in your hypothetical alternate universe.

There's a lot of butterfly-effect type results as well - due to how capitalism works, the majority of people who are capable of writing software would never be able to compete - whoever the bug players are, they could simply buy them out, shut them down or even product-dump.

FOSS levels the field somewhat: FOSS is a force multiplier, in that whatever FOSS creates can be used to create more software (even non-FOSS), reducing the dependency on one or two incumbents who were lucky enough to get there first and cornered the market.

Without FOSS, we'd all be running IE6 on Windows 98, because there'd be no competition.

lofaszvanitt · on April 4, 2023

I think you have issues with interpreting the idea as a whole, so you cling to one sentence and base some totally out of touch assumption on that very sentence.

KronisLV · on April 3, 2023

While this is a pretty harsh take, I can't help but to feel that articles like "Software below the poverty line" support at least aspects of it: https://staltz.com/software-below-the-poverty-line.html

nemo44x · on April 3, 2023

I don’t think open source per-se but certainly permissive licenses like Apache were a mistake. They’ve just allowed business to either get free things to make a profit while contributing nothing back or to literally create a business by selling the Apache licensed programs in the cloud.

elric · on April 3, 2023

Yikes. You sound very bitter. Is there a story behind that bitterness?

There's a wide variety of people in the open source community at large. And a wide variety of motivations for contributing. I for one am happy that open source software is a thing. It's been a net good for mankind. Sure, there are abuses, and I'm sure many things could be improved. But I'm glad it's there all the same.

lofaszvanitt · on April 3, 2023

Nah, not at all, just don't like power structures that feed on benevolent naivety. Of course it's good, but compensating those people should be the norm. Where is the payments side of github for example? So it's open source but in order to clone the rep you should spend 1 bucks on it, or if you want a bug/feature addressed you could name a price or the dev can set the price on it?

Because the holy sacred cow must not be agitated... suuuuuuuuuuuuuuuuure.

And people rationalizing the all devouring machine, hell, it is just bonkers.

TheDong · on April 3, 2023

> compensating those people should be the norm. Where is the payments side of github for example? [..] if you want a bug/feature addressed you could name a price or the dev can set the price on it?

Github does have monetization. It has "sponsors", and you can create a "sponsor" level that is basically "I will consult with you and prioritize bugs you choose".

It's totally normal for a developer who wants to monetize a popular open source project to offer consulting or "pay for me to work on your bug". That's already there.

... However, I would like to provide an alternative view. I am personally very happy that monetary compensation is not the norm in free software. I find joy in coding, but I find far more joy in coding when there's no money involved. When I am able to work as much or as little as I want, without feeling any form of financial obligation to others (which inescapably comes from being paid), I am happier.

If the norm was to pay or be paid in free software, I would not find joy in it. I would likely not participate.

By analogy, let's say that me and some friends get together to eat food, and each bring a meal. You might say "oh, that is a waste, the person who made a meatloaf could have sold that for money. Everyone at this meal should be paying each other for their cooking, and the person who cooked the most ends up making some money". Do you not see how that would ruin the feeling of cooking for your friends and enjoying time together?

To me, the free software community has a similar thing. Because the norm is assuming people are just trying to build stuff, not make money, it makes it a far more pleasant activity.

> And people rationalizing the all devouring machine, hell, it is just bonkers.

To me, the truly bonkers thing is people letting capitalism eat them. "You have to have your grindset, optimize your time to make money", it seems bonkers to me. People trying to rationalize their existence not by finding communities and trying to help others, but by trying to make their wealth as large as possible, often at the expense of happiness.

lofaszvanitt · on April 3, 2023

It should be default. They are nowhere to be found when you click on code, to get the clone url for example. It's still up to you whether you pay or slack, the code will be there, it's free, just it would weigh down on your conscience. It's a safety net and a way to say thanks to the devs.

There are a lot of people who do it, because they like what they do (esp in the beginning, while it's not a maintenance nightmare), but would also like to have some side income from it, but they are timid/shy to ask for it. So the burden should be on the service provider to provide these services and not on the developer. The dev can even opt out of it (like you) if he wants to, but I think that would be the very minority.

You pour your heart into a project, others use your project like it's a free service, but in the end nobody gives you nothing for it. All you get is stars and forks, and some stats. Wow, thank you for the exploitation of your naivety.

You can buy the favorite beer, coffee, hamburger from the money flowing in, and that's your tangible reward for your efforts.

elric · on April 3, 2023

I disagree that compensating FOSS contributors should be the norm. That would open people up to a huge can of worms (labour laws, liabilities, etc).

> Because the holy sacred cow must not be agitated... suuuuuuuuuuuuuuuuure.

I have no idea what you're talking about.

specialist · on April 3, 2023

Yup.

FWIW, I keep thinking about some kind of dual licensing, FOSS and something-something-royalties. (Sorry, IANAL, so haven't gotten any further.)

ianbutler · on April 3, 2023

I tend to disregard articles that default to the "Stochastic Parrot" argument. These tools are useful now, I don't personally care about achieving actual intelligence. I want additional utility for myself and other humans, which these provide now, at scale.

By a lot of measures many humans perform at just about the same level, including confidently making up bullshit.

This post reads like one of the "Goodbye X online video game" posts. I'll cut them some slack because this is their blog they're venting on and was likely posted here by someone else and not themselves doing some attention seeking, but meh.

cowl · on April 3, 2023

Being useful and a stochastic parrot are not mutually exclusive. And I in fact think the opposite. it's Necessary to remind people what it really is, especially in this phase of "Enthusiasm" because I see to many people attributing some meaning or some hidden Insight and especially some innate infallibility to AI nowdays, maybe confused by the name AI.

ianbutler · on April 3, 2023

Right, but most arguments, including the one here, go something like "AI is a Stochastic Parrot so it's a lie and now I think it's bad and we shouldn't do it."

Which is a pretty dumb position imo. Not that I personally think these newer LLMs are a stochastic parrot, or at least not to the degree proponents of the Stochastic Parrot argument would have you believe.

cowl · on April 3, 2023

The argument of Stochastic Parrot is not that we shouldn't do it. It's just "Do not attribute any meaning to it". The one in this article is in the same vane that author thought the AI was learning to program when in reality it was just repeating the most statistically probable combination of the code it had seen. That is, "correctness" is not part of "considerations" that the model does. If the majority of the code that it has scanned contains a particular form of logic bug, it will suggest the same logic bug. The trap is in the fact that the AI will write perfect syntax because that is it's bread and butter and people seeing this perfect syntax attribute also perfect logic to it. As long as people are aware of this kind of problem, LLMs are a very useful tool that will save a lot of time. But if applied blindly "Because AI knows best" it will create more problems down the road.

ianbutler · on April 3, 2023

That's not what I said the Stochastic Parrot is arguing, I said people use the Stochastic Parrot argument to justify not doing it.

LASR · on April 3, 2023

It’s pretty ironic that the argument that LLMs are stochastic parrots actually sounds like the actual parrot.

I think we’re now way past that now with LLMs now quickly taking on the role of a general reasoning engine.

cowl · on April 3, 2023

> now quickly taking on the role of a general reasoning engine

And this right here is why it's important to emphasize the "stochastic parrot" fact. Because people think this is true and are making decisions based on this misunderstanding.

FeepingCreature · on April 3, 2023

Or maybe they just disagree with you?

cowl · on April 3, 2023

Who? See: https://dl.acm.org/doi/pdf/10.1145/3442188.3445922 A 2021 research paper warning (among other things) precisly about this confusion.

> The ersatz fluency and coherence of LMs raises several risks, precisely because humans are prepared to interpret strings belonging to languages they speak as meaningful and corresponding to the communicative intent of some individual or group of individuals who have accountability for what is said

Is there any Researcher who maintains that LLM models contain Reasoning and intent?

Those who are working on this models are not confused, they know what they are, the Public is confused.

FeepingCreature · on April 3, 2023

How is that different from "This evidence for X raises the risk that people falsely believe that X"? That's an argument for X, not against. And nothing in that paper, even if I discard the dross (ie. everything except one section on page 7), seems to actually make an argument against X of any strength beyond "it is wrong because it is wrong".

My point is this: I disagree with you. This is not because I have "misunderstood" something; it is because I understand the stochastic-parrot argument and think it is erroneous. And the more you talk about "the risk that people will come to falsely believe" rather than actual arguments, the less convincing you sound. This paternalistic tendency is a curse on science and debate in general.

usrbinbash · on April 3, 2023

> it is because I understand the stochastic-parrot argument and think it is erroneous.

Okay then, what exactly about it is erroneous? Because stochastically sorting the set M of known tokens by likelyhood of being the next, is literally what LLMs do.

FeepingCreature · on April 3, 2023

There's a class of statements that can be either interpreted precisely, at which point the claim they make is clearly true but trivial, or interpreted expansively, at which point the claim is significant but no longer clearly true.

This is one of those: yes, technically LLMs are token predictors, but technically any nondeterministic Turing machine is a token predictor. The human brain could be viewed as a token predictor [1]. The interesting question is how it comes up with its predictions, and on this the phrase offers no insight at all.

[1] https://en.wikipedia.org/wiki/Predictive_coding

usrbinbash · on April 3, 2023

> The human brain could be viewed as a token predictor

No it really couldn't, because "generating and updating a 'mental model' of the environment." is as different from predicting the next token in a sequence, as a bees dance is from a structured human language.

The mental model we build and update is not just based on a linear stream, but many parallel and even contradictory sensory inputs that we make sense of not as abstract data points, but as experiences in a world of which we are part of. We also have a pre-existing model summarizing our experience in the world, including their degradation, our agency in that world, and our intentionality in that world.

The simple fact that we don't just complete streams, but do so with goals, both immediate and long term, and fit our actions into these goals, in itself already shows how far a humans mental modeling is from the linear action of a language model.

FeepingCreature · on April 4, 2023

But the human mental model is purely internal. For that matter, there is strong evidence that LLMs generate mental models internally. [1] Our interface to motor actions is not dissimilar to a token predictor.

> The mental model we build and update is not just based on a linear stream, but many parallel and even contradictory sensory inputs

So just like multimodal language models, for instance GPT-4?

> as experiences in a world of which we are part of.

> The simple fact that we don't just complete streams, but do so with goals, both immediate and long term, and fit our actions into these goals

Unfalsifiable! GPT-4 can talk about its experiences all day long. What's more, GPT-4 can act agentic if prompted correctly. [2] How do you qualify a "real goal"?

[1] https://www.neelnanda.io/mechanistic-interpretability/othell...

[2] https://github.com/hwchase17/langchain

usrbinbash · on April 4, 2023

> For that matter, there is strong evidence that LLMs generate mental models internally.

Limited models, such as those representing the state of a game that it was trained to do: Yes. This is how we hope deep learning systems work in general.

But I am not talking about limited models. I am talking about ad-hoc models, built from ingesting the context and semantic meaning of a string of tokens, that can simulate reality and allows drawing logical conclusions from it.

In regard to my example given elsewhere in this HN thread: I know that Mike exits the elevator first because I build a mental model of what the tokens in the question represent. I can draw conclusions from that model, including new conclusions whos token-representation would be unlikely in the LLMs model, which doesn't explain anything about reality, but explains how tokens are usually ordered in the training set.

FeepingCreature · on April 5, 2023

The relevant keyword you want is "zero-shot learning". (EDIT: Correction; "in-context learning". Sorry for that.) LLMs can pick up patterns from the context window purely at evaluation time using dynamic reinforcement learning. (This is one of those capabilities models seem to just pick up naturally at sufficient scale.) Those patterns are ephemeral and not persisted to memory, which I agree makes LLMs less general than humans, but that seems a weak objection to hang a fundamental difference in kind on.

edit: Correction: I can't find a source for my claim that the model specifically picks up reinforcement learning across its context as the algo that it uses to do ICL. I could have sworn I read that somewhere. Will edit a source in if I find it.

edit: Though I did find this very cool paper https://arxiv.org/abs/2210.05675 that shows that it's specifically training on language that makes LLMs try to work out abstract rules for in-context learning.

edit: https://arxiv.org/abs/2303.07971 isn't the paper I meant, since it only came out recently, but it has a good index of related literature and does a very clear analysis of ICL, demonstrating that models don't just learn rules at runtime but learn "extract structure from context and complete the pattern" as a composable meta-rule.

edit: I think I was thinking of https://arxiv.org/abs/2212.10559 , which asserts that ICL acts equivalent to gradient descent.

> In regard to my example given elsewhere in this HN thread: I know that Mike exits the elevator first because I build a mental model of what the tokens in the question represent. I can draw conclusions from that model, including new conclusions whos token-representation would be unlikely in the LLMs model, which doesn't explain anything about reality, but explains how tokens are usually ordered in the training set.

I mean. Nobody has unmediated access to reality. The LLM doesn't, but neither do you.

In the hypothetical, the token in your brain that represents "Mike" is ultimately built from photons hitting your retina, which is not a fundamentally different thing from text tokens. Text tokens are "more abstracted", sure, but every model a general intelligence builds is abstraction based on circumstantial evidence. Doesn't matter if it's human or LLM, we spend our lives in Plato's cave all the same.

usrbinbash · on April 5, 2023

> In the hypothetical, the token in your brain that represents "Mike"

Mike isn't represented by a token. "Mike" is a word I interpret into an abstract meaning in an ad-hoc created, and later updated or discarded model of a situation in which exist only the elevator, some abstract structure around it, and the laws of physics as I know them from knowledge and experience.

> built from photons hitting your retina, which is not a fundamentally different thing from text tokens.

The difference is not in how sensory input is gathered. The difference is in what that input represents. For the LLM the token represents...the token. That's it. There is nothing else. The token exists for its own sake, and has no information other than itself. It isn't something from which an abstract concept is built, it IS the concept.

As a consequence, an language model doesn't understand whether statements are false or nonsensical. It can say that a sequence is statistically less likely than another one, but that's it.

"Jenny leaves first" is less likely than "Mike leaves first".

But "Jenny leaves first" is probably more likely than "Mario stands on the Moon", which is more likely than "catfood dog parachute chimney cloud" which is more likely than "blob garglsnarp foobar tchoo tchoo", which in turn is probably more likely than "fdsba254hj m562534%($&)5623%$ 6zn 5)&/(6z3m z6%3w zhbu2563n z56".

To someone reaching the conclusion that Mike left the elevator first by drawing that conclusion from an abstract representation of the world, all these statements are equally wrong. To a language model, they are just points along a statistical gradient. So in a language models world a wrong statement can still somehow be "less wrong" than another wrong statement.

---

Bear in mind when I say all this, I don't mean to say (and I think I made that clear elsewhere in the thread) that this mimickry of reasoning isn't useful. It is, tremendously so. But I think it's valueable to research and understand the difference in mimicking reason by learning how tokens form reasonable sequences, and actual reasoning from abstracting the world into models that we can draw conclusions from.

Not in the least because I believe that this will be a key element in developing things closer to AGIs than the tools we have now.

FeepingCreature · on April 5, 2023

> an ad-hoc created, and later updated or discarded model of a situation in which exist only the elevator, some abstract structure around it, and the laws of physics as I know them from knowledge and experience.

LLMs can do all of this. In fact, multimodality specifically can be shown to improve their physical intuition.

> The difference is not in how sensory input is gathered. The difference is in what that input represents. For the LLM the token represents...the token. That's it. There is nothing else. The token exists for its own sake, and has no information other than itself. It isn't something from which an abstract concept is built, it IS the concept.

The token has structure. The photons have structure. We conjecture that the photons represent real objects. The LLM conjectures (via reinforcement learning) that the tokens represent real objects. It's the exact same concept.

> As a consequence, an language model doesn't understand whether statements are false or nonsensical.

Neither do humans, we just error out at higher complexities. No human has access to the platonic truth of statements.

> So in a language models world a wrong statement can still somehow be "less wrong" than another wrong statement.

Of course, but so with humans? I have no idea what you're trying to say here. As with humans, in a LLM token improbability can derive from lots of different reasons, including world model violation, in-context rule violation, prior improbability and grammatical nonsense. In fact, their probability calibration is famously perfect, until RLHF ruins it. :)

> Bear in mind when I say all this, I don't mean to say (and I think I made that clear elsewhere in the thread) that this mimickry of reasoning isn't useful.

I fundamentally do not believe there is such a thing as "mimickry of reason". There is only reason, done more or less well. To me, it's like saying that a pocket calculator merely "mimicks math" or, as the quote goes, whether a submarine "mimicks swimming". Reason is a system of rules. Rules cannot be "applied fake"; they can only be computed. If the computation is correct, the medium or mechanism are irrelevant.

To quote gwern, if you'll allow me the snark:

> We should pause to note that a Clippy² still doesn’t really think or plan. It’s not really conscious. It is just an unfathomably vast pile of numbers produced by mindless optimization starting from a small seed program that could be written on a few pages. It has no qualia, no intentionality, no true self-awareness, no grounding in a rich multimodal real-world process of cognitive development yielding detailed representations and powerful causal models of reality; it cannot ‘want’ anything beyond maximizing a mechanical reward score, which does not come close to capturing the rich flexibility of human desires, or historical Eurocentric contingency of such conceptualizations, which are, at root, problematically Cartesian. When it ‘plans’, it would be more accurate to say it fake-plans; when it ‘learns’, it fake-learns; when it ‘thinks’, it is just interpolating between memorized data points in a high-dimensional space, and any interpretation of such fake-thoughts as real thoughts is highly misleading; when it takes ‘actions’, they are fake-actions optimizing a fake-learned fake-world, and are not real actions, any more than the people in a simulated rainstorm really get wet, rather than fake-wet. (The deaths, however, are real.)

usrbinbash · on April 5, 2023

> I fundamentally do not believe there is such a thing as "mimickry of reason". There is only reason, done more or less well.

    if transaction.amount > MAX_TRANSACTION_VOLUME:
        transaction.reject()
    else:
        transaction.allow()

Is this code reasoning? It does, after all, take input and make a decision that is dependent on some context, the transactions amount. It even has a model of the world, albeit a very primitive one.

No, of course it isn't. But it mimicks the ability to do the very simple reasoning about whether or not to allow a transaction, to the point where it could be useful in real applications.

So yes, there is mimicry of reasoning, and it comes in all scales and levels of competence, from simple decision making algorithms, purely mechanical contraptions such as overpressure-valves, all the way up to highly sophisticated ones that use stochastic analysis of sequence probabilities to show the astonishing skills we see in LLMs.

FeepingCreature · on April 5, 2023

I feel this is mostly going to come down to how we define the word. I suspect we agree that there's no point in differentiating "reasoning" from "mimicked reasoning" if the performed actions are identical in every situation.

So let's ask differently: what concrete problem do you think LLMs cannot solve?

usrbinbash · on April 7, 2023

> what concrete problem do you think LLMs cannot solve?

From the top of my head:

Drawing novel solutions from existing scientific data for one. Extracting information from incomplete data that is only apparent by reasoning (such as my code-bug example given elsewhere in this thread), aka. assuming hidden factors. Complex math is still beyond them, predictive analysis requiring inference is an issue.

They also still face the problem of, as has been anthropomorphized so well, "fantasizing", especially during longer conversations; which is cute when they pretend that footballs fit in coffee-cups, but not so cute when things like this happens:

https://eu.usatoday.com/story/opinion/columnist/2023/04/03/c...

--

These certainly don't matter for the things I am using them for, of course, and so far, they turn out to be tremendously useful tools.

The trouble, however, is not with the problems I know they cannot, or cannot reliably, solve. The problem is with as of yet unknown problems where humans, me included, might assume they can solve, and suddenly it turns out they can't. What these problems are, time will tell. So far we have barely scratched the surface of introducing LLMs in our tech products. So I think it's valueable to keep in mind that there is, in fact, a difference between actually reasoning, and mimicking it, even if the mimicry is to a high standard. If for nothing else, then only to remind us to be careful in how, and for what, we use them.

FeepingCreature · on April 7, 2023

I mean, do you think a LLM cannot draw a novel solution from existing data, fundamentally, because its reasoning is "of the wrong kind"? That seems potentially disprovable. - Or do you just think current products can't do it? I'd agree with that.

What's the easiest novel scientific solution that AI couldn't find if it wasn't in its training set?

usrbinbash · on April 7, 2023

> because its reasoning is "of the wrong kind"?

No, because it doesn't reason, period. Stochastic analysis of sequence probabilities != Reasoning. I explained my thoughts on the matter in this thread to quite some extend.

> That seems potentially disprovable.

You're welcome to try and disprove it. As for prior research on the matter:

https://www.cnet.com/science/meta-trained-an-ai-on-48-millio...

And afaik, Galactica wasn't even intended to do novel research, it was only intended for the, time consuming but comparably easier, tasks of helping to summarize existing scientific data, ask questions about it in natural language and write "scientific code".

FeepingCreature · on April 7, 2023

Alright, I'll keep an eye open for instances of networks doing scientific reasoning.

(My own belief is that reasoning is 95% habit and 5% randomness, and that networks don't do it because it hasn't been reflected in their training sets, and they can't acquire the skills because they can't acquire any skills not in the training set.)