I find it hard to believe that all the heavy-weight data processing and GPU computation really make a constant factor reduction in search steps worth it.
It's also not clear to me how one would determine that an ML model-generated plan is indeed optimal, or how far from optimal it is. A*-based approaches give you these things.