The AM335x isn't designed for data acquisition like this. I should know: I'm doing a project right now with 16 Msamples running through a beaglebone. The PRUs don't really like to write directly to memory very much, and hacky methods are required to prevent segfaults. There are other chips with officially supported DSPs that work much, much better for this sort of thing.