I'm sure others will comment, but in the meantime, some people have written up t...

bobosha · on April 14, 2020

>orchestrating an inference cluster, autoscaling, prediction monitoring,

Does this approach preclude the need for queuing (a la RabbitMQ) and/or a load balancer?

calebkaiser · on April 14, 2020

Yep! Cortex deploys load balancers on AWS and manages queueing.

bobosha · on April 14, 2020

This is super-exciting! I didn't know it could be this easy!

How do you handle API authentication? Is there a module that interfaces with AWS API gateway? or external API authentication?

calebkaiser · on April 14, 2020

Right now, users handle API auth by using AWS API gateway in front of Cortex, but incorporating AWS API Gateway into Cortex to automate this is on our short term roadmap.