This thing works great. It's loaded in my local 5-node(2x3) k8s cluster, with a Coral TPU plugged into a single node. I have 6 4k cameras running, and it does object analysis on all of them with < 10% CPU on the node.
I run this too on k8s, but there's no benefit of running on k8s because the app doesn't benefit from multiple pod / clustering. Unless I'm missing something, there's no real advantage to running on k8s compared to docker. It would be cool if each camera stream could be spawned off to a pod so that it could leverage clustering.