Do you have access to the weights? If so you probably have better ML hardware. W...

Do you have access to the weights? If so you probably have better ML hardware. Wish this model was actually "open".

The perf of a model of that size on the M1 will not be good. That is big enough it won't quite fit on a 3090 (24GB) without quantization.