I feel like people are pushing SQLite beyond its limits with this stuff. If you'...

freeqaz · on Sept 13, 2022

There are some real benefits to using SQLite as a "serialization format" over something like JSON or Protobuf.

To elaborate on a specific case, think about a 'package-lock.json' file in a Node project. These files are frequently many megabytes in size, even for simple projects, and if you wanted to do something like inventory _every package_ used on GitHub, it would be a ton of data.

You might argue that you could compress these files because they're text, and that's true, but what about when you only need to read a subset of the data? For example, what if you needed to say, "Does this project depend on React?"

If you have it as JSON or Protobuf, you have to deserialize the entire data structure before you can query it (or write some nasty regex). With SQLite you have a format you can efficiently search through.

There are some other formats in this space like ORC and Parquet but they're just optimized for column-reads (read only _this_ column). They don't provide efficient querying over large datasets like SQLite can.

This is at least my understanding. If anybody has a perspective that's opposite, I'd appreciate hearing it!

samatman · on Sept 13, 2022

I would call this hacking, actually. Is it useful in a financially productive sense? Maybe, don't care. It's pushing a program we like past its limits. That's fun.