I used to work for a company which used S3 as its data backend - copying lots of...

manigandham · on Sept 14, 2020

I had to transfer 50TB+ to a new bucket with several rules based on the file metadata. Scanned all the file names, put them in a queue, then ran a custom C# program on several of the largest instances to process the queue. Maxed out CPU and bandwidth and it worked great.

I don't recommend Ruby for anything beyond the simplest websites. Don't try to make it perform, just use a better language that can handle the performance you need.

rapsey · on Sept 14, 2020

So you clearly picked the wrong tool for the job and spent an obscene amount of effort hacking around your tool instead of picking a better one.

jml7c5 · on Sept 14, 2020

What alternative would you have proposed?

mixmastamyk · on Sept 14, 2020

In other threads folks are chatting about how awesome rclone is.

rapsey · on Sept 14, 2020

A language better at parallellism. Elixir or go are both easy to pick up.

tehlike · on Sept 14, 2020

Umm what, java supports threads what's the deal :)

ComputerGuru · on Sept 14, 2020

The problem was Ruby, not Java.

lmm · on Sept 14, 2020

Buck up and learn enough Java to write this basic program. It's really not a particularly complex or hard language.

ed25519FUUU · on Sept 14, 2020

(Sarcasm) You should “buck up” and learn Rust or C. With a real language you won’t have to deal with the overhead of a runtime and will really be able to saturate the resources.

lmm · on Sept 14, 2020

I can and have written Rust, C, and indeed Ruby where warranted. Surely any self-respecting programmer would do the same?

CydeWeys · on Sept 14, 2020

How's that significantly different from the Ruby tool they did write?

lmm · on Sept 14, 2020

Java libraries don't tend to break in the presence of multithreading the way Ruby ones do.