You will never see this from a single human being. Probably, within a few months...

DinaCoder99 · 2024-03-04T04:21:26.000000Z

You'd still have to describe to an LLM what to do, which strikes me as about equivalent complexity to simply writing the code in the first place (which is, after all, a formal description of how a program should behave, even if we perceive it as of a complexity which should not be strictly necessary). The big wins have been simply leveraging LLMs to apply common patterns to multiple codebases (albeit in a buggy and haphazard fashion), but it's still up to humans to compose these patterns into meaningful programs and validate that it's actually functioning as expected or desired. Humans still have far superior understanding of what software is, how it functions, what our intentions are using software, and how to derive good software from bad—we know what a bug is intuitively in a way that LLMs have not been able to demonstrate at all.

However, being able to rewrite a program with formally well-defined behavior (i.e. code) should be in an LLM's capability, but LLMs are a long ways away from demonstrating semantically coherent coding skills, just the ability to regurgitate common patterns (often filled with bugs and/or incoherent semantics).

otabdeveloper4 · 2024-03-04T04:47:48.000000Z

An LLM is only an Internet search engine with a fancier interface, it doesn't actually reason about anything. There is no "semantic" anything about an LLM's output.

jzombie · 2024-03-04T12:26:05.000000Z

Non-RAG LLMs don't search the internet, nor even have the capability to do so.

ProllyInfamous · 2024-03-04T13:33:54.000000Z

I don't personally detect sentience, yet — but about 5% of my inputs result in some sort of interpretation and/or reasoning (sometimes I have to think for days "why did LLM.xyz make such a strange connection..?" only to realize the schizotypisms of machine aren't often wrong (just different).

otabdeveloper4 · 2024-03-05T06:23:01.000000Z

People ascribe sentience and emotion to a smiley face picture, and that is just two dots and a curved line.

That's just what people do, we are hard-wired to see social cues even if there are none.

otabdeveloper4 · 2024-03-05T06:21:50.000000Z

Absolutely false, at the core of every LLM is a highly compressed text corpus from an Internet search engine.

(The wonder here isn't that an LLM succeeds at text retrieval tasks, the wonder is how highly compressed the index turns out to be. But maybe we just severely overestimate our own information complexity.)

jzombie · 2024-03-05T17:38:20.000000Z

So, you're saying an LLM is a just a database that does text retrieval?

otabdeveloper4 · 2024-03-06T06:45:04.000000Z

Yes, using a statistical model which is in effect a very lossy compressor.

jzombie · 2024-03-06T07:14:07.000000Z

So, what you're telling me is that every thing they say has already been said before, completely verbatim? Like, if I asked it to write a story about a dog named Jebediah surfing to planet Xbajahabvash, it would basically just find a link to someone else's story that wrote about the same dog surfing to the same planet? That sounds like an infinitely large amount of combinations. Perhaps the internet is just infinitely large, squared (or even circled).

ProllyInfamous · 2024-03-09T15:18:36.000000Z

So, like a human, then?

DinaCoder99 · 2024-03-04T15:25:42.000000Z

Reasoning is not necessary, but semantic coherence is.

otabdeveloper4 · 2024-03-06T06:50:08.000000Z

It's only as semantically coherent as its training database. An LLM is, in effect, just a lossy compression of its training database. The compression is based on statistical maximum likelihood estimation, there are no mental (or any other kind) of models involved in compressing the training database.

You can claim that mental models don't actually exist and everything in the universe is just maximum likelihood, but that would be a religious/spiritual statement, outside the realm of science.

fragmede · 2024-03-04T04:43:17.000000Z

We'll have to see what Google's most advanced model with the ridiculously larger context window can do, once it's fully released into the wild. Refactoring an entire code base is presumably asking too much, but it'll be able to do small refactoring since the released model can do that, so we'll have to see where its upper limits are.

Of course, to get it better at refactoring, everyone has to write blog posts on refactoring to feed the machine.

ProllyInfamous · 2024-03-04T13:31:42.000000Z

Why blog posts? Wouldn't the codebase "self evolve" based on the proficiencies of other published code?