Wow, this brings back memories from more than 3 decades ago. I created a routine on the C64 to copy memory and calculated the performance then around 25KB/sec.
The first version contained a memory corrupting bug that took some time to figure out. Depending on the locations of the source and destination you have to start copying forwards from the beginning of the source, or backwards from the end. If there's an overlap you risk overwriting the source before it is copied to the destination.
The first version contained a memory corrupting bug that took some time to figure out. Depending on the locations of the source and destination you have to start copying forwards from the beginning of the source, or backwards from the end. If there's an overlap you risk overwriting the source before it is copied to the destination.