I absolutely second the suggestion of working through the math of a shallow neur...

I absolutely second the suggestion of working through the math of a shallow neural network to start.

I would also suggest that once one thinks they have a sufficient grasp of the steps to try to implement them at a low level without frameworks, just math libraries. Personally, I found Octave (highly similar to MATLAB) to be a nice granularity tier for learning the process programmatically.