> "Json" and "Go" seem antithetical in the same sentence as "high performance" t...

3cats-in-a-coat · on Nov 5, 2023

I think people say that as they give disproportional weight to the fact it's text-based, while ignoring how astoundingly simple and linear it is to write and read.

The only way to nudge the needle is to start exchanging direct memory dumps, which is what ProtoBuff and the like do. But this is clearly only for very specific use.

riku_iki · on Nov 5, 2023

> while ignoring how astoundingly simple and linear it is to write and read.

code maybe simple, but you have lots of performance penalties: resolving field keys, you need to construct some complicated data structures through memory allocations, which is expensive.

> to start exchanging direct memory dumps, which is what ProtoBuff and the like do

Protobuff actually is doing parsing, it is just binary format. What you describing is more like Flatbuffer.

> But this is clearly only for very specific use.

yes, specific use is high performance computations )

3cats-in-a-coat · on Nov 6, 2023

Resolving what keys? JSON has keyval sequences ("objects") but what you do with them is entirely up to you. In a streaming reader you can do your job without ever creating maps or dehydrating objects.

Plus no one makes people use objects in JSON. If you can send a tuple of fields as an array... then send an array.

riku_iki · on Nov 6, 2023

> In a streaming reader you can do your job without ever creating maps or dehydrating objects.

but you need some logic which will check that key is what you want to process, and also do various type transformation (e.g. json string to long).

> Plus no one makes people use objects in JSON. If you can send a tuple of fields as an array... then send an array.

then this shouldn't be called JSON parsing anymore.

3cats-in-a-coat · on Nov 6, 2023

The "logic" to match a key is not slow. Your CPU can parse hundreds of keys while waiting for the next keyval sequence to be loaded from RAM.

As for whether you must stuff everything in objects for it to be JSON, I mean that makes no sense. JSON arrays are also JSON. And JSON scalars are also JSON.

If this argument requires we deliberately go out of our way to be dumb, then it's a bad argument.

I'm writing a platform which has a JSON-like format for message exchange and I realized early on that in serialized data, maps are at best nominal. You process everything serially (hence the word serialization). It's a stream of tokens. The fact some tokens are marked as keys and some as values is something that can be useful to communicate intent, but it doesn't mandate how you utilize it.

Everything else is just prejudices and biases, such as "maps take resources to allocate". JSON doesn't force you to build maps when you have an object. Point in the spec where JSON mandates how you must store the parsed results of a JSON object. Should it be a hashmap? A b-tree? A linked list? A tuple? Irrelevant.

riku_iki · on Nov 6, 2023

> The "logic" to match a key is not slow. Your CPU can parse hundreds of keys while waiting for the next keyval sequence to be loaded from RAM.

RAM on my computer works with pages loaded into CPU cache.

Rants, personal attacks, handwavings are ignored.

3cats-in-a-coat · on Nov 6, 2023

No, pages are not loaded in cache. Cache lines are. RAM pages are typically 4kb, and cache lines are most commonly 64 bytes. This means you have 64 cache lines per RAM page. And this entire detour has no relevance to what I said in the first place, which still stands. But you know, someone was wrong on the Internet.

riku_iki · on Nov 6, 2023

> RAM pages

I didn't say "RAM pages", well vectorrized code can dump MBs of data preemptively into cache, instead of reading "next keyval sequence" each time.

> And this entire detour has no relevance to what I said in the first place,

or you just don't understand such relevance.

3cats-in-a-coat · on Nov 6, 2023

I never said the CPU reads one key at a time. I said it can decode hundreds of keys at the time it loads one from memory. This is completely irrelevant of how memory reads are batched. It's about a ratio, like 100:1 get it? Seems like you felt your ego attacked, and you just had to respond in a patronizing way about something, but didn't know about what.

Hashing a string as you read it from memory and jumping to a hash bucket is not an expensive operation. This entire argument sounds like some kindergarten understanding of compute efficiency. This is not a 6052.

riku_iki · on Nov 6, 2023

Will ignore you this time )

3cats-in-a-coat · on Nov 6, 2023

Excellent work. Don't drop to my level where I made a simple factual statement.

riku_iki · on Nov 7, 2023

lets sync on numbers maybe? My engine processes 600MB/s (mebabytes, not megabits) of data per core (I have very many cores) for my wire format, current bottleneck is that linux vpages system can't allocate/deallocate pages fast enough when reading from NVME raid.

What are your numbers for your cool json serializer?

3cats-in-a-coat · on Nov 7, 2023

I think maybe you possibly forgot what our argument was. I said the bottleneck is memory, and not processing/hashing the keys to match them to the symbol you want to populate.

And you're currently telling me the bottleneck is memory.

I also said you don't need to parse a JSON object to a hashmap or a b-tree. The format suggests nothing of the sort. You can hash the key and fill it into a symbol slot in a tuple which... literally only takes the amount of RAM you need for the value, while the key is "free", because it just resolves to a pointer address.

Additionally, if you have a fixed tuple format, you can encode it as a JSON array, thus skipping the keys entirely. None of that is against JSON. You decide what you need and what you don't need. The keyvals are there when you need keyvals. No one is forcing you to use them where you don't need them at gunpoint.

I have a message format for a platform I'm working on, it has a JSON option, just for compatibility. It doesn't use objects at all, yet (but it DOES transfer object states). Nested arrays are astonishingly powerful on their own with the right mindset.

riku_iki · on Nov 7, 2023

> And you're currently telling me the bottleneck is memory.

Not memory, but virtual pages implementation in linux, which is apparently single threaded and doesn't scale to high throughput. There was a patch to fix this, but it didn't make to mainline: https://lore.kernel.org/lkml/20180403133115.GA5501@dhcp22.su...

3cats-in-a-coat · on Nov 8, 2023

"High throughput" seems like an odd problem to have. You don't have to throw away pages and allocate new ones all the time. You can reuse a page.

riku_iki · on Nov 8, 2023

That's not up to me. It is Linux virtual fs implemented that way