Cache coherency primer (2014)

bluetomcat · on July 15, 2016

The moral of the story for software developers is to be aware of false sharing and cache line bouncing when writing multithreaded code.

Seemingly innocent code like this will most likely cause completely unnecessary inter-core traffic (assuming that the compiler has laid out a and b adjacently, and they both fall within the boundaries of a single cache line):

    unsigned a, b;

    void thread_a(void) {
        for (;;) a++;
    }

    void thread_b(void) {
        for (;;) b++;
    }

bluecalm · on July 14, 2016

I don't know almost anything about hardware but that was very well written and easy to follow. A pleasure to read.

camelspade · on July 14, 2016

If you are more interested, Appendix C of "Is Parallel Programming Hard, And, If So, What Can You Do About It?" by Paul McKenney (http://kernel.org/pub/linux/kernel/people/paulmck/perfbook/p...) provides a very detailed description as well. It really helped improve my understanding of how memory barriers and atomics work

stplsd · on July 15, 2016

This is also a very good read: https://lagunita.stanford.edu/c4x/Engineering/CS316/asset/A_...

camelspade · on July 16, 2016

Thanks! :) Gonna add this to the list of books I need to read.

swah · on July 15, 2016

It was very helpful for me as well - I saw cache optimization mentioned in these C++ gamedev talks but wasn't quite grasping the issues.