Oh come on... how can one assume a binary somehow TLV-encoded format is not fast...

lucideer · on Oct 31, 2022

(a) You're making assumptions based on rule of thumb, I'm talking about real world usage: your points make sense in theory but don't necessarily reflect reality

(b) I'm talking about a narrow & specific case. PB may outperform JSON in most cases but I'm very specifically referring to browsers where JSON is native (& highly optimised) whereas PB is provided by a selection of open source libraries written in javascript. So that domain heavily favours JSON perf-wise.

throw827474737 · on Nov 7, 2022

> You're making assumptions based

No, not at all... coming from embedded where apeed, memory size and also bandwidth did count, json was actually.not just worse, but just wouldn't have been feasible (because our protobufs already barely fit memory and MTU constraints).

CJefferson · on Oct 31, 2022

One important thing to consider with JSON is that a lot of people really, really care about JSON performance -- optimsing parsing in assembler, and rewriting internal datastructures just to make serialising + deserialising JSON faster.

I'm sure given two implementations of equal quality protobuf would easily outperform JSON, but I can also believe the JSON implementation in (for example) v8 is very, very hard to beat.

jeffbee · on Oct 31, 2022

https://github.com/protobufjs/protobuf.js#performance

CJefferson · on Oct 31, 2022

I just benchmarked it on my computer -- the protobuf is twice as fast (well, 1.8x), which is good, but I don't think I'd use that as a basis for choosing the technology I use.

Of course, I might use protobuf because I prefer it in my code to JSON, and it certainly is faster (if only twice).

soylentgraham · on Oct 31, 2022

Have you stepped through protobuf processing code? There's a lot of special cases, ifs, branches here and there. Protobufs within protobufs. Its not like its a size, then 100 floats packed together, theres more overhead than youd think. (Not to mention the client side allocations etc etc) I use protoc compiled to wasm for protobufs and it is fast, but theres a lot of wasm overhead to execute that code.

Json parsing is also a lot of special cases, error testing, but the v8 team has spent a huge amount of time optimising json parsing (theres a few blog posts on it). Im not assuming either way, but it's definitely as cut and dry as one would assume.

throw827474737 · on Nov 7, 2022

Stepped through? Yes..as I hinted, coming from an embedded environment, and measured compared highly optimized json parsing code (that even had much limitations, like very limited nesting, no lists) vs nanopb => clear winner on all points (memory reqs, performance, encoded size) - which is really not that surprising?

dekhn · on Oct 31, 2022

There are two ways to encode a repeated field (100 floats, but could also be any size up to the limits of repeteating fields): "Ordinary (not packed) repeated fields emit one record for every element of the field." That means type, value, type, value, etc"

However, "packed" fields are exactly a length followed by a byte array of the typed data. This was an oversight in original proto2 which is unlikely to be corrected, but packed the default in proto3.

jeffbee · on Oct 31, 2022

100 (or any N) floats prefixed by a size is exactly what you would get from `repeated float f = 1 [packed=true];`

walls · on Oct 31, 2022

They didn't assume, you did. They showed some real data and you reacted emotionally.

dekhn · on Oct 31, 2022

If there's a JSON parser faster than a PB parser (for the same underlying data content) it just means the JSON parser was optimized more. By every rule in computing, PB parsing is far faster than JSON for every use case for a simple reason: the messages use less RAM, and therefore, moving the data into the processor and decoding it takes less time.

patmorgan23 · on Oct 31, 2022

Theoretical performance doesn't matter in UX, only real world. Yes conceptually it's possible to make protobuffs faster than json, but someone still has to build that. Fast native json parsers already exist, that's the benchmark protobuffs has to beat significantly to make the worse DX worth it.

rad_gruchalski · on Nov 1, 2022

I believe the answer is „it depends”: https://medium.com/aspecto/protobuf-js-vs-json-stringify-per....

throw827474737 · on Nov 7, 2022

yes, sure it depends on the implementation, as the poster above said. You need to compare similarly optimized implementstions.. but really: no surprise?!?

AlmostAnyone · on Oct 31, 2022

How can JavaScript code (PB decoder) be faster than native code (JSON parser)?

Aeolun · on Oct 31, 2022

Much, much less processing to do. Most of pb decoding is just reading bytes until you fill your data structure.