Nice write-up. I suspect the poor implementation of async I/O suggests how often...

wmf · on Oct 26, 2012

Or the disuse of async disk I/O is due to the difficulty of its proper implementation.

shrughes · on Oct 26, 2012

It's also the extremely limited use case. There's not that many I/O requests to a disk that you can do at a time while getting a performance speed-up. Having to use a thread pool ends up not being such a problem -- you don't really hurt your performance benchmarks. On the other hand, a system that talks on a network to thousands or millions of clients will benefit greatly from avoiding 4-8KB of stack overhead per connection.

throwaway54-762 · on Oct 26, 2012

4-8kB? Maybe physical memory overhead, if your code isn't too deep. But userspace thread stacks are anywhere from 128kB (FreeBSD) to 8MB (Linux) of virtual memory overhead.

shrughes · on Oct 26, 2012

Stacks can be made to be 4KB or 8KB if you want them to be.

throwaway54-762 · on Oct 30, 2012

Depends on what libc (or other) routines you call... going over the end of the stack is no fun. Lots of code seems to be written to rely on deep stacks in userspace.