Hacker News new | past | comments | ask | show | jobs | submit login

I feel like people are pushing SQLite beyond its limits with this stuff. If you're going to do something like this, wouldn't just a normal client/server option like PostgreSQL be way more performant?

I think SQLite is fantastic, but this just smacks of scope creep.




There are some real benefits to using SQLite as a "serialization format" over something like JSON or Protobuf.

To elaborate on a specific case, think about a 'package-lock.json' file in a Node project. These files are frequently many megabytes in size, even for simple projects, and if you wanted to do something like inventory _every package_ used on GitHub, it would be a ton of data.

You might argue that you could compress these files because they're text, and that's true, but what about when you only need to read a subset of the data? For example, what if you needed to say, "Does this project depend on React?"

If you have it as JSON or Protobuf, you have to deserialize the entire data structure before you can query it (or write some nasty regex). With SQLite you have a format you can efficiently search through.

There are some other formats in this space like ORC and Parquet but they're just optimized for column-reads (read only _this_ column). They don't provide efficient querying over large datasets like SQLite can.

This is at least my understanding. If anybody has a perspective that's opposite, I'd appreciate hearing it!


I would call this hacking, actually. Is it useful in a financially productive sense? Maybe, don't care. It's pushing a program we like past its limits. That's fun.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: