I've found that writing randomized unit tests for each small part of a system fi...

mattgreenrocks · 2024-04-18T19:10:05

I really like this idea because it avoids the issue of fuzzers needing to burn tons of CPU just to get down to the actual domain logic, which can have really thorny bugs. Per usual, the idea of "unit" gets contentious quickly, but with appropriate tooling I could foresee adding annotations to code that leverage random input, property-based testing, and a user-defined dictionary of known weird inputs.

paulddraper · 2024-04-18T20:21:13

> maybe not in the first run, but after a week or two in continuous integration

You'd use a different seed for each CI run??

That sounds like a nightmare of non-determinism, and lost of trust in CI system in general.

forty · 2024-04-18T21:42:31

Not if you log the seed of the failing runs

hedora · 2024-04-18T23:44:33

Yep; I definitely log the seed and re-seed every few seconds.

Most software I work on these days is non-deterministic anyway (it involves the network, etc.), so CI is fundamentally going to fail some runs and not others.

Even stuff like deterministic simulation has this property: Those test suites rely on having a large number of randomly generated schedules, so there's always a chance that running the test one more time (with a new seed) will find a new bug.

jhardy54 · 2024-04-19T00:29:25

If you’re using Git, I’d strongly recommend the hash of the current tree (not the commit). That way your tests are deterministic based on the contents of your tree. For example, if you add a commit and then revert, you’ll end up with the same test seed as if you hadn’t committed.

hedora · 2024-04-19T13:43:12

The important thing to log is the seed passed to the rng. (It’s usually wallclock time measured at nanosecond granularity.)

In a typical night for a production quality system, 100-1000+ hours of such tests would run, all with different seeds, and in diverse configurations, etc, so the seed isn’t derived from the git sha, or the source code.

paulddraper · 2024-04-19T04:15:11

Or someone just punches retry

hedora · 2024-04-19T13:45:45

Ignoring test failures is a different issue. It can range from known bugs and rational tradeoffs to a hiring/hr issue.

Establishing a healthy dev/engineering culture is hard.

paulddraper · 2024-04-19T18:53:27

You don't have auto retries anywhere in your tests?

Arnavion · 2024-04-18T20:28:55

aka property testing.