Numba: a JIT compiler for Python that works best on code that uses NumPy

ben509 · on Nov 23, 2019

Having ported a decent chunk of code to work with Numba, the key is to minimize where you're interfacing with Numba.

You need to be careful of inadvertently introducing new types. Especially, numba doesn't (last I checked) recognize homogenous tuples (Tuple[Foo, ...] in typing) so each new length requires recompilation.

Similarly, every call is doing inference on its arguments, including Jitclass constructors. If you're making calls with a large number of arguments, you may be killing your performance gains even absent compilation.

If you're trying to make code that can run with or without numba, e.g. the same logic may not run in a loop, definitely avoid jitclasses.

All in all, just_temp's remark that you have to "write it like Fortran" is pretty close. The reason I found it worked was that I had a lot of business logic type stuff segregated to an early section that spat out structures that were very regular and primitive. That meant the code that had to be fast was already very Fortran like.

dang · on Nov 23, 2019

Related from 2018: https://news.ycombinator.com/item?id=17678758

2017: https://news.ycombinator.com/item?id=15301766

2013: https://news.ycombinator.com/item?id=5927787

https://news.ycombinator.com/item?id=5757231

https://news.ycombinator.com/item?id=5680722

2012: https://news.ycombinator.com/item?id=4430780

https://news.ycombinator.com/item?id=3692055

just_temp · on Nov 23, 2019

I had considered Numba in the past but it just seemed not worth the overhead. A few talks from this year show that they have really expanded the library, to the point where much of the scientific python stack use it instead of Cython. It can target things like ARM devices and is more flexible in the types it can take (dicts!) For reference https://www.youtube.com/watch?v=cR8E70GTO8c and https://www.youtube.com/watch?v=6oXedk2tGfk

shoyer · on Nov 24, 2019

I think it's rather premature to say that the scientific Python stack is adopting Numba. None of the core projects like SciPy, pandas, and scikit-learn have been willing to swap out Cython for Numba. Cython is still dominant and I don't see that changing anytime soon.

dataflow · on Nov 23, 2019

What kind of overhead do you mean? Performance? Or cognitive?

just_temp · on Nov 23, 2019

Cognitive. Things like having to strip down abstractions and "write it like Fortran". The fact that it can deal with numpy arrays no problem and can actually deal with more common python objects like dicts means that there is less overhead

xiaodai · on Nov 24, 2019

Just use Julia. You get the benefits and libraries that compose well with others.

127 · on Nov 24, 2019

There's also Cupy, which is Numpy with CUDA acceleration, a drop in replacement for most of Numpy, that you can easily also use CUDA kernels inside Python and even run Numba functions generated with @numba.cuda.jit.

saboot · on Nov 23, 2019

Really wish they would implement texture memory for CUDA, I used numba initially but switched to pycuda for that feature alone. I gained a 2-3x runtime speed up for a raytracing based simulation.