I'm working through this course (and took the earlier sister course Computation for Data Analysis with R). It's quite good so far. They stay away from the "big" part and focus on the core of data analysis: how to find data, clean it up, explore it, find relationships and present your findings. We are using R, which is suitable for most data sizes. It's offered by Johns Hopkins, and has more of an academic bent than an industry one. Great general purpose knowledge that I think you would want before you start messing around with Hadoop.