> We work very closely with Google DeepMind to adapt Gemini models for Google-sc...

ijidak · 2024-10-31T00:50:07 1730335807

I'm the first to say that AI will not replace human coders.

But I don't understand this attempt to tell companies/persons that are successfully using AI that no they really aren't.

In my opinion, if they feel they're using AI successfully, the goal should be to learn from that.

I don't understand this need to tell individuals who say they are successfully using AI that, "no you aren't."

It feels like a form of denial.

Like someone saying, "I refuse to accept that this could work for you, no matter what you say."

kgeist · 2024-10-30T21:15:17 1730322917

They probably use AI for writing tests, small internal tools/scripts, building generic frontends and quick prototypes/demos/proofs of concept. That could easily be that 25% of the code. And modern LLMs are pretty okayish with that.

gerash · 2024-10-30T21:09:04 1730322544

I believe most people use AI to help them quickly figure out how to use a library or an API without having to read all their (often out dated) documentation instead of helping them solve some mathematical challenge

delfinom · 2024-10-31T01:05:57 1730336757

I've never had an AI not just make up API when it didn't exist, instead of saying "it doesn't exist". Lol

taeric · 2024-10-30T21:16:18 1730322978

If the documentation is out of date, such that it doesn't help, this doesn't bode well for the training data of the AI helping it get it right, either?

macintux · 2024-10-30T21:24:55 1730323495

AI can presumably integrate all of the forum discussions talking about how people really use the code.

Assuming discussions don't happen in Slack, or Discord, or...

woodson · 2024-10-30T22:18:13 1730326693

Unfortunately, it often hallucinates wrong parameters (or gets their order wrong) if there are multiple different APIs for similar packages. For example, there are plenty ML model inference packages, and the code suggestions for NVIDIA Triton Inference Server Python code are pretty much always wrong, as it generates code that’s probably correct for other Python ML inference packages with slightly different API.

jon_richards · 2024-10-31T04:16:03 1730348163

I often find the opposite. Documentation can be up to date, but AI suggests deprecated or removed functions because there’s more old code than new code. Pgx v5 is a particularly consistent example.

randomNumber7 · 2024-10-30T21:43:00 1730324580

And all the code on which it was trained...

Capricorn2481 · 2024-10-31T15:40:03 1730389203

Forum posts can also be out of date.

randomNumber7 · 2024-10-30T21:41:41 1730324501

I think that too but google claims something else.

calf · 2024-10-30T21:43:54 1730324634

We are sorely lacking a "Make Computer Science a Science" movement, the tech lead's blurb is par for the course, talking about "SWE productivity" with no reference to scientific inquiry and a foundational understanding of safety, correctness, verification, validation of these new LLM technologies.

almostgotcaught · 2024-10-30T23:42:33 1730331753

Did you know that Google is a for-profit business and not a university? Did you know that most places where people work on software are the same?

zifpanachr23 · 2024-10-31T09:14:56 1730366096

So are most medical facilities. Somehow, the vibes are massively different.

almostgotcaught · 2024-10-31T10:13:15 1730369595

That's rich? Never heard of the opioid crisis? Or the over-prescription of imaging tests?

calf · 2024-10-31T09:13:44 1730366024

Did you know that Software Engineering is a university level degree? That it is a field of scientific study, with professors who dedicate their lives to it? What happens when companies ignore science and worse yet cause harm like pollution or medical malpractice, or in this case, spread Silicon Valley lies and bullshit???

Did you know? How WEIRD.

How about you not harass other commenters with such arrogantly ignorant sarcastic questions?? Or is that part of corporate "for-profit" culture too????

almostgotcaught · 2024-10-31T10:14:46 1730369686

> Did you know that Software Engineering is a university level degree? That it is a field of scientific study, with professors who dedicate their lives to it?

So is marketing? So finance? So is petroleum engineering?

justinpombrio · 2024-10-30T21:58:03 1730325483

> Can you spot all the problems?

You were probably being rhetorical, but there are two problems:

- `p = 2` should be outside the loop

- `prime_factors.append(n)` appends `1` onto the end of the list for no reason

With those two changes I'm pretty sure it's correct.

kenjackson · 2024-10-31T02:45:29 1730342729

You don't need to append 'p' in the inner while loop more than once. Maybe instead of an array for keeping the list of prime factors do it in a set.

zeroonetwothree · 2024-10-31T04:48:57 1730350137

It’s valid to return the multiplicity of each prime, depending on the goal of this.

justinpombrio · 2024-10-31T15:04:02 1730387042

https://www.wolframalpha.com/input?i=prime+factors+of+12

rmbyrro · 2024-10-30T23:44:33 1730331873

`n` isn't defined

justinpombrio · 2024-10-31T01:50:31 1730339431

The implicit context that the poster removed (as you can tell from the indentation) was a function definition:

    def factorize(n):
      ...
      return prime_factors

senko · 2024-10-30T20:58:28 1730321908

We collectively deride leetcoding interviews yet ask AI to flawlessly solve leetcode questions.

I bet I'd make more errors on my first try at it.

AnimalMuppet · 2024-10-30T21:24:36 1730323476

Writing a prime-number factorization function is hardly "leetcode".

senko · 2024-10-30T22:27:34 1730327254

I didn't say it's hard, but it's most definitely leetcode, as in "pointless algorithmic exercise that will only show you if the candidate recently worked on a similar question".

If that doesn't satisfy, here's a similar one at leetcode.com: https://leetcode.com/problems/distinct-prime-factors-of-prod...

I would not expect a programmer of any seniority to churn stuff like that and have it working without testing.

AnimalMuppet · 2024-10-30T22:41:53 1730328113

> "pointless algorithmic exercise that will only show you if the candidate recently worked on a similar question".

I've been able to write one, not from memory but from first principles, any time in the last 40 years.

senko · 2024-10-30T23:03:58 1730329438

Curious, I would expect a programmer of your age to remember Knuth's "beware of the bugs in above code, I have only proven it's correct but haven't actually run it".

I'm happy you know math, but my point before this thread got derailed was that we're holding (coding) AI to a higher standard than actual humans, namely to expect to write bug-free code.

0points · 2024-10-31T08:48:16 1730364496

> my point before this thread got derailed was that we're holding (coding) AI to a higher standard than actual humans, namely to expect to write bug-free code

This seems like a very layman attitude and I would be surprised to find many devs adhering to this idea. Comments in this thread alone suggests that many devs on HN do not agree.

smrq · 2024-10-31T04:47:54 1730350074

I hold myself to a higher standard than AI tools are capable of, from my experience. (Maybe some people don't, and that's where the disconnect is between the apologists and the naysayers?)

Jensson · 2024-10-31T03:53:36 1730346816

Humans can actually run the code and knows what it should output. the LLM can't, and putting it in a loop against code output doesn't work well either since the LLM can't navigate that well.

eesmith · 2024-10-31T11:26:03 1730373963

A senior programmer like me knows that primality-based problems like the one posed in your link are easily gamed.

Testing for small prime factors is easy - brute force is your friend. Testing for large prime factors requires more effort. So the first trick is to figure out the bounds to the problem. Is it int32? Then brute-force it. Is it int64, where you might have a value like the Mersenne prime 2^61-1? Perhaps it's time to pull out a math reference. Is it longer, like an unbounded Python int? Definitely switch to something like the GNU Multiple Precision Arithmetic Library.

In this case, the maximum value is 1,000, which means we can enumerate all distinct prime values in that range, and test for its presence in each input value, one one-by-one:

    # list from https://www.math.uchicago.edu/~luis/allprimes.html
    _primes = [
        2, 3, 5, 7, 11, 13, 17, 19, 23, 29, 31, 37, 41, 43, 47, 53, 59,
        61, 67, 71, 73, 79, 83, 89, 97, 101, 103, 107, 109, 113, 127, 131,
        137, 139, 149, 151, 157, 163, 167, 173, 179, 181, 191, 193, 197,
        199, 211, 223, 227, 229, 233, 239, 241, 251, 257, 263, 269, 271,
        277, 281, 283, 293, 307, 311, 313, 317, 331, 337, 347, 349, 353,
        359, 367, 373, 379, 383, 389, 397, 401, 409, 419, 421, 431, 433,
        439, 443, 449, 457, 461, 463, 467, 479, 487, 491, 499, 503, 509,
        521, 523, 541, 547, 557, 563, 569, 571, 577, 587, 593, 599, 601,
        607, 613, 617, 619, 631, 641, 643, 647, 653, 659, 661, 673, 677,
        683, 691, 701, 709, 719, 727, 733, 739, 743, 751, 757, 761, 769,
        773, 787, 797, 809, 811, 821, 823, 827, 829, 839, 853, 857, 859,
        863, 877, 881, 883, 887, 907, 911, 919, 929, 937, 941, 947, 953,
        967, 971, 977, 983, 991, 997]

    def distinctPrimeFactors(nums: list[int]) -> int:
        if __debug__:
            # The problem definition gives these constraints
            assert 1 <= len(nums) <= 10_000, "size out of range"
            assert all(2 <= num <= 1000 for num in nums), "num out of range"

        num_distinct = 0
        for p in _primes:
            for num in nums:
                if num % p == 0:
                    num_distinct += 1
                    break
        return num_distinct

That worked without testing, though I felt better after I ran the test suite, which found no errors. Here's the test suite:

    import unittest

    class TestExamples(unittest.TestCase):
        def test_example_1(self):
            self.assertEqual(distinctPrimeFactors([2,4,3,7,10,6]), 4)

        def test_example_2(self):
            self.assertEqual(distinctPrimeFactors([2,4,8,16]), 1)

        def test_2_is_valid(self):
            self.assertEqual(distinctPrimeFactors([2]), 1)

        def test_1000_is_valid(self):
            self.assertEqual(distinctPrimeFactors([1_000]), 2) # (2*5)**3

        def test_10_000_values_is_valid(self):
            values = _primes[:20] * (10_000 // 20)
            assert len(values) == 10_000
            self.assertEqual(distinctPrimeFactors(values), 20)

    @unittest.skipUnless(__debug__, "can only test in debug mode")
    class TestConstraints(unittest.TestCase):
        def test_too_few(self):
            with self.assertRaisesRegex(AssertionError, "size out of range"):
                distinctPrimeFactors([])
        def test_too_many(self):
            with self.assertRaisesRegex(AssertionError, "size out of range"):
                distinctPrimeFactors([2]*10_001)
        def test_num_too_small(self):
            with self.assertRaisesRegex(AssertionError, "num out of range"):
                distinctPrimeFactors([1])
        def test_num_too_large(self):
            with self.assertRaisesRegex(AssertionError, "num out of range"):
                distinctPrimeFactors([1_001])

    if __name__ == "__main__":
        unittest.main()

I had two typos in my test suite (an "=" for "==", and a ", 20))" instead of "), 20)"), and my original test_num_too_large() tested 10_001 instead of the boundary case of 1_001, so three mistakes in total.

If I had no internet access, I would compute that table thusly:

  _primes = [2]
  for value in range(3, 1000):
    if all(value % p > 0 for p in _primes):
        _primes.append(value)

Do let me know of any remaining mistakes.

What kind of senior programmers do you work with who can't handle something like this?

EDIT: For fun I wrote an implementation based on sympy's integer factorization:

    from sympy.ntheory import factorint
    def distinctPrimeFactors(nums: list[int]) -> int:
        distinct_factors = set()
        for num in nums:
            distinct_factors.update(factorint(num))
        return len(distinct_factors)

Here's a new test case, which takes about 17 seconds to run:

        def test_Mersenne(self):
            self.assertEqual(distinctPrimeFactors(
                [2**44497-1, 2,4,3,7,10,6]), 5)

atomic128 · 2024-10-30T21:36:03 1730324163

Empirical testing (for example: https://news.ycombinator.com/item?id=33293522) has established that the people on Hacker News tend to be junior in their skills. Understanding this fact can help you understand why certain opinions and reactions are more likely here. Surprisingly, the more skilled individuals tend to be found on Reddit (same testing performed there).

louthy · 2024-10-30T21:55:16 1730325316

I’m not sure that’s evidence; I looked at that and saw it was written in Go and just didn’t bother. As someone with 40 years of coding experience and a fundamental dislike of Go, I didn’t feel the need to even try. So the numbers can easily be skewed, surely.

atomic128 · 2024-10-30T22:01:05 1730325665

Only individuals who submitted multiple bad solutions before giving up were counted as failing. If you look but don't bother, or submit a single bad solution, you aren't counted. Thousands of individuals were tested on Hacker News and Reddit, and surprisingly, it's not even close: Reddit is where the hackers are. I mean, at the time of the testing, years ago.

louthy · 2024-10-30T22:07:45 1730326065

That doesn’t change my point. It didn’t test every dev on all platforms, it tested a subset. That subset may well have different attributes to the ones that didn’t engage. So, it says nothing about the audience for the forums as a whole, just the few thousand that engaged.

Perhaps even, there could be fewer Go programmers here and some just took a stab at it even though they don’t know the language. So it could just select for which forum has the most Go programmers. Hardly rigourous.

So I’d take that with a pinch of salt personally

atomic128 · 2024-10-30T22:12:03 1730326323

Agreed. But remember, this isn't the only time the population has been tested. This is just the test (from two years ago, in 2022) that I happen to have a link to.

louthy · 2024-10-30T22:15:09 1730326509

The population hasn’t been tested. A subset has.

59nadir · 2024-10-31T07:21:02 1730359262

It's also fine to be an outlier. I've been programming for 24 years and have been hanging out on HackerNews on and off for 11. HN was way more relevant to me 11 years ago than it is now, and I don't think that's necessarily only because the subject matter changed, but probably also because I have.

Izikiel43 · 2024-10-30T22:05:21 1730325921

How is that thing testing? Is it expecting a specific solution or actually running the code? I tried some solutions and it complained anyways

atomic128 · 2024-10-30T22:08:56 1730326136

The way the site works is explained in the first puzzle, "Hack This Site". TLDR, it builds and runs your code against a test suite. If your solutions weren't accepted, it's because they're wrong.

0xDEAFBEAD · 2024-10-30T22:16:47 1730326607

Where is the data?

freilanzer · 2024-10-31T09:10:44 1730365844

Yeah, this is useless.