It’s not clear to me why this would be faster than polars, duckdb, vaex or click...

maleldil · 2024-11-20T16:36:47 1732120607

None of those drop-in replacements for Pandas. The main draw is "faster without changing your code".

faizshah · 2024-11-20T20:42:46 1732135366

I’m asking more about what techniques did they use to get the performance improvements in the slides.

They are showing a 20-30% improvement over Polars, Clickhouse and Duckdb. But those 3 tools are SOTA in this area and generally rank near eachother in every benchmark.

So 20-30% improvement over that cluster makes me interested to know what techniques they are using to achieve that over their peers.

mettamage · 2024-11-20T13:53:23 1732110803

Maybe it isn’t? Maybe they just want a fast pandas api?

geysersam · 2024-11-20T16:48:18 1732121298

According to their benchmarks they are faster. Not by a lot, but still significantly.