Aside: are there any good open/online resources for learning about bayesian reasoning, NLP, machine learning or similar topics?
I've read "Probabilistic Programming & Bayesian Methods for Hackers" but it didn't feel like it explained enough of the concepts. I'd love something like an undergraduate curriculum/text if someone could recommend one.
If you want a thorough explanation of concepts (as opposed to the 'black box' 'use this library' approach taken by many tutorials I've seen) then I highly recommend John Foreman's (Data Scientist at MailChimp) book "Data Smart".
He implements algorithms on practical problems in Excel. I know that may sound off-putting but it's so much easier to visualise what's going on when you can see vectors of data change in a spreadsheet compared to trying to debug arrays in a more general programming language.
What led me to John - something I heard about here actually - was his blog called 'Analytics Made Skeezy' where he creates fictional 'drug dealer' GTA style examples such as "Forecasting Made Skeezy: Projecting Meth Demand using Exponential Smoothing"
I've often seen "An introduction to statistical learning" [1] and "Elements of statistical learning" [2] cited as good resources for statistical inference and machine learning. The former is more of an undergrad text, while the latter seems to be aimed at graduate students. Both books are available free online.
I've read "Probabilistic Programming & Bayesian Methods for Hackers" but it didn't feel like it explained enough of the concepts. I'd love something like an undergraduate curriculum/text if someone could recommend one.