Hacker News new | past | comments | ask | show | jobs | submit login
A comparison of 7 desktop data wrangling tools with benchmarks (easydatatransform.com)
2 points by hermitcrab on Nov 29, 2022 | hide | past | favorite | 3 comments



Benchmarks are included on Windows and Mac for: Base R, R + dplyr, R + data.table, Python + Pandas, Knime, Power Query and Easy Data Transform. Alteryx and Tableau Prep are also discussed, but benchmarks are forbidden by their licensing agreement.

While other benchmarks for ETL/data wrangling tools are available ( https://h2oai.github.io/db-benchmark/ ) they tend not to include GUI based tools such as Knime and Easy Data Transform.


Wow - benchmarks forbidden due to licencing arrangements - I didn't realise that's a comment practice. IMO an informed buyer should avoid buying any tool that prohibited benchmarking on this criterion alone!

It'd be interesting to include duckdb in these comparisons, it's performance is similar to data.table but it has bindings from multiple languages


Apparently DuckDB + R + dplyr is even faster than data.table (as mentioned in the article). See also:

https://www.reddit.com/r/Rlanguage/comments/z6txsl/comment/i...




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: