"I trust Redis-on-disk every day less." --Salvatore

antirez · on June 1, 2011

A message from the same thread that possibly better explains my point:

For Redis on disk to work well, you need:

1) Very biased data access.

2) Mostly reads.

3) Dataset consisting of key->value data where values are small.

4) A dataset that is big enough to really pose memory/cost problems on the ever growing RAM you find in a entry level server.

What is left of Redis semantics and advantages here? The intersection of 1+2+3+4 is small and fits exactly in the case where Redis for metadata, or as a cache, plus another datastore designed to work on disk is the right pick. So why should not we focus, instead, into doing what we already do (the in-memory but persistent data structure server) better? It would be already an huge success to enhance what we already have (not to say that Redis is so important, just that Redis can do his small part in the big picture providing something simple that works well, instead of trying to do everything and save the world).

mahmud · on June 1, 2011

It's an honest assessment by a conscientious engineer. And because of it, I trust Redis even more.

See ma, no ass-covering, no excuses, no cooking benchmarks.

ericd · on June 1, 2011

Agreed, it's really heartwarming when you see someone so public owning up to what some might view as a flaw in their project.

rbranson · on June 1, 2011

coughmongodbiswebscalecough but seriously, some of the commercial vendors in the space are having a difficult time admitting their faults. I feel as if the best way to warm up to engineers is to announce your limitations first. Any engineer worth his or her salt looks for the limitations of any product before attempting use.

anaisbetts · on June 1, 2011

So if Redis is purely in-memory, then (and I'm asking in a positive way), how can it be used in any production system effectively? If something like a spontaneous reboot or Redis crash occurs, or some other event, don't you lose everything?

antirez · on June 1, 2011

Redis is an in memory but persistent on disk data store. So memory is used to serve data, but disk is used in order to persist. This is the Redis model since the start. What I was questioning in the linked group message is to go for disk not just for persistence but also to take ready-to-serve organized data.

Actually in the persistence side we plan to do more work to make Redis better, for instance Redis 2.4 that is entering release candidate can save/load most databases on disk ten times faster.

In the future we plan to explore a new AOF format that is more compact and faster to process, and the ability to rewrite the log without a background process (BGREWRITEAOF).

antihero · on June 1, 2011

Is there anything we have to do to make sure redis stores our data persistently? cron for BGSAVE? Or is it generally handled fairly well already?

antirez · on June 1, 2011

BGSAVE in cron is not needed, you can configure Redis (and this is the default config) to automatically save every N seconds if there are at least M changes in the dataset. It is possible to configure multiple save points. More info: http://redis.io/topics/persistence

anaisbetts · on June 3, 2011

That makes way more sense, thanks!

latch · on June 1, 2011

Even though this has nothing to do with persistence (though I can see how you'd think it does from the OP title), I just want to give you a different perspective (again, this doesn't really have anything to do with what Antirez is actually saying)"

What's more reliable? Data persisted to disk on a single server in a RAID 1 array, or data stored in memory of 10 machines which are completely isolated from each other? What about 100 machines?

It's wrong to think of disks as reliable/persistent and memory as not. You should think of both as stores where neither is 100% reliable. Memory is far less reliable than disk, much in the same way that a single disk is less reliable than RAID-6. However, just like you can make disks more reliable by replicating across other disks (and then even more reliable by replicating across multiple servers...in multiple locations), so too can you achieve reliability with memory.

EDIT (clarify what they are talking about):

They aren't talking about persistence, though I can see why you think that from the OP title. They are talking about loading the data set into virtual memory when it doesn't fit into memory.

This has nothing to do with persistence. Antirez isn't saying he doesn't trust Redis' persistence implementation. He's specifically talking about how the Redis VM handles more data than available memory.

He's saying: always have enough memory for all your data.

theli0nheart · on June 1, 2011

The OP's title is a bit sensationalist, so I can understand why you'd ask this. Redis stores the in memory database to disk asycronously. If a crash occurs, Redis will read the database from disk on startup and copy it into memory.

So the short answer is "no", you don't lose everything. You just lose the data that was written to memory since the last sync to disk.

zwischenzug · on June 1, 2011

And if you're thinking "this is amateur" let me tell you that at least 2 Very Big E-Commerce Companies I know of run their (trad) dbs in the same mode.

jamesaguilar · on June 1, 2011

I.e. a commit can complete successfully before the data has actually been persisted? Or just the part about all data being kept in memory as well? If the former, seems like a bad idea, big e-commerce company or no.

zwischenzug · on June 1, 2011

The former. I think most sane serious dbs cache and manipulate recently-used data in memory (in addition to the logical log).

Bad idea or no, it's what they do - performance gains are large. There is an ersatz logical log in the form of application logs, and these have been used to piece together transactional information before.

tftfmacedo · on June 1, 2011

Yes. As the GP stated, all data added/updated/deleted since last snapshot will be lost. If your data is that critical you can use AOF with "appendfsync always" which will fsync every write operation into a log that will replayed on startup (at a speed cost). Setting it to fsync every second (instead of always) is a good compromise.

aaronblohowiak · on June 1, 2011

innodb_flush_log_at_trx_commit=0 or 2 are very common performance tweaks that trade some level (about a second) of data vulnerability for performance.

jamesaguilar · on June 1, 2011

Hmm, the thing I'm not really clear on is how this helps performance. The database has to do the same amount of work whether you commit before or after the sync, so why not just make all the commits wait until the sync has happened?

aaronblohowiak · on June 1, 2011

fsync() is a blocking call, which requires confirmation from the disk before your process continues. roughly, this means that redis can't do anything while the fsync() happens. while this isn't any more work (cpu) it is significantly slower because of all of the time spent waiting.

jamesaguilar · on June 1, 2011

Hmm . . . as long as the requests are not being handled on the same thread as the one doing the fsync, this shouldn't stop work in the process or make anything appreciably slower.

mikeyk · on June 1, 2011

You can run a slave Redis replica very easily, and save db snapshots from that without impacting the master. When Salvatore says "redis on disk", he's referring to attempts to mix in-memory & on-disk portions of the data, rather than the background saving feature of Redis.

thibaut_barrere · on June 1, 2011

I've been using the AOF (append-only file mode) described in the doc [1], with an older version of Redis (1.2.x) in production for a year and a couple of months, specifically to allow a reboot or a crash without loosing a job. It works well for us.

My use-case is processing XML files that are pushed by a third-party (see [2] for a blog post describing the setup).

One drawback is that the AOF file will keep growing (at least on my version) and could reach the maximum filesize of your system, if any - there's the BGREWRITEAOF command available to work-around that issue (not tested).

[1] http://redis.io/topics/persistence

[2] http://blog.logeek.fr/2010/8/2/on-jruby-resque-and-windows

_mattb · on June 1, 2011

Redis can save to disk after X number of keys have changed or after Z number of seconds. http://redis.io/topics/persistence

tomjen3 · on June 1, 2011

Redis is not meant to be run on a single node, but on multiple computers - and you can decide how much to trade of in terms of stability versus redundancy.

In addition you can make Redis asynchronously save to the disk every so often.

cdavid · on June 1, 2011

It can be used for caching, etc... It is only a problem in production if you expect data there to be durable (i.e. resistant to mild crashes).

tptacek · on June 1, 2011

I'm thrilled to see this, because the work Salvatore was putting into Redis-on-disk functionality had so little perceived payoff for us. This portends Redis not trying to create an idiosyncratic replacement for SQL, and focusing instead on its unique strengths.

If you want the cost/performance characteristics of disk storage, use MySQL or Postgres. If you want them and the speed and flexibility of the Redis data structure server, use Redis + SQL as a one-two punch.

We use Redis for "live" data, and SQL for archival/report/versioned data.

moe · on June 1, 2011

Glad to see antirez is keeping focus on the cluster features, as opposed to sinking time into a questionable direction.

We have plenty disk-based k/v stores already. If that's what I need then tokyo, riak, etc. are my friends.

Redis plays in a slightly different and equally important game. The cluster-/sharding-features seem like a more natural path to scale out the use-cases where it shines.

yid · on June 1, 2011

Isn't this MongoDB's approach to persistence?

latch · on June 1, 2011

They aren't talking about persistence, though I can see why you think that from the OP title. They are talking about loading the data set into virtual memory when it doesn't fit into memory.

This has nothing to do with persistence. Antirez isn't saying he doesn't trust Redis' persistence implementation. He's specifically talking about how the Redis VM handles more data than available memory.

He's saying: always have enough memory for all your data.

mariusmg · on June 1, 2011

Eventually persistent. Thats all there is to say.

antirez · on June 1, 2011

Redis is in the AOF configuration as durable as your MySQL InnoDB store. This has nothing to do with durability :)

aaronblohowiak · on June 1, 2011

*only if you use the appropriate fsync options. By default, AOF is set with the equivalent of innodb_flush_log_at_trx_commit=0 (fsync every second.) Note that the equivalnt of sync_binlog is always 0. Usually, this is just fine, but it is important to note when comparing MySQL to InnoDB.