Cortex: Deploy machine learning models in production

kuu · on Oct 18, 2019

How does this work under the hood? Is the model loaded every time it receives a request? Is it run in a docker or a lambda? How does it work after "uploading it" to amazon?

ospillinger · on Oct 18, 2019

Each model is loaded into a Docker container, along with any Python packages and request handling code. The cluster runs on EKS on your AWS account. Cortex takes the declarative configuration from 'cortex.yaml' and creates it every time you run 'cortex deploy' so the containers don’t change unless you run 'cortex deploy' again with updated configuration. This post goes into more detail about some of our design decisions: https://towardsdatascience.com/inference-at-scale-49bc222b3a...

kuu · on Oct 18, 2019

Thank you!

oli5679 · on Oct 18, 2019

If your model can be exported as PMML, this is really nice. Fast, minimalist, battle-tested and with very clean API.

When I've tested, it's up to 10x faster than Flask + serialised model object and uses far less CPU resources.

Plays nicely with lightgbm and Xgboost.

https://github.com/openscoring/openscoring

isubasinghe · on Oct 18, 2019

This is basically my startup idea that I worked on for a while now (https://aiscalr.isub.dev)

Looks like I am going to have to scrap that entire project now, seems pointless to keep working on it given how similar this is.

sixhobbits · on Oct 18, 2019

Similarity should be taken as validation, not a negative thing at all.

ovi256 · on Oct 18, 2019

You should do customer development and find people willing to pay for your product.

If they're willing to pay, they'll even tell you why they can't use the open source tool.

sails · on Oct 22, 2019

How does this compare to MLflow [0]?

Considering MLflow has a few components, I suppose you are building something closer to MLflow Models? How do they compare?

[0] https://mlflow.org/docs/latest/index.html

ospillinger · on Oct 23, 2019

From the MLflow Models docs: "An MLflow Model is a standard format for packaging machine learning models that can be used in a variety of downstream tools—for example, real-time serving through a REST API or batch inference on Apache Spark. The format defines a convention that lets you save a model in different “flavors” that can be understood by different downstream tools."

Cortex is what they are referring to as a downstream tool for real-time serving through a REST API. In other words, MLflow helps with model management and packaging, whereas Cortex is a platform for running real-time inference at scale. We are working on supporting more model packaging formats and I think it's a good idea to support the MLflow format as well.

lettergram · on Oct 18, 2019

Was discussed three months ago: https://news.ycombinator.com/item?id=20579166

ecnahc515 · on Oct 19, 2019

Not to be confused with weaveworks/CNCF Cortex project for high scale Prometheus monitoring https://github.com/cortexproject/cortex.

punnerud · on Oct 18, 2019

Seems to work with: Pytorch, TensorFlow, Keras, XGBoost, sklearn

ospillinger · on Oct 18, 2019

Yes, Cortex uses ONNX Runtime (https://github.com/microsoft/onnxruntime) under the hood so any model that can be exported to ONNX can be deployed.

solidasparagus · on Oct 18, 2019

Is it only able to handle ONNX models? That's a pretty massive limitation compared to a hosted SageMaker endpoint.

vishalbollu · on Oct 18, 2019

Contributor here - Cortex supports Tensorflow saved models in addition to ONNX. PyTorch support is on the roadmap. Do you have specific frameworks in mind that you would like Cortex to support?

solidasparagus · on Oct 19, 2019

Perfect. Nothing in particular other than TF.

kevinmershon · on Oct 18, 2019

Unfortunately, a somewhat popular Clojure library for machine learning on GitHub is also called Cortex, because this is going to make discussing machine learning APIs in the context of Clojure that much more confusing.

waz0wski · on Oct 18, 2019

there's also a prometheus storage backend called Cortex

https://github.com/cortexproject/cortex

bermanoid · on Oct 18, 2019

And I imagine many more machine learning tools will take the same name in the years to come, since it's about the most obvious one you could think of other than "brain".

Whatever is popular will survive...

notus · on Oct 18, 2019

Last commit was like 2 years ago...

brennebeck · on Oct 18, 2019

Couldn’t you just search ‘clojure cortex’? As this isn’t actually clojure?

ChefboyOG · on Oct 18, 2019

It also looks like the last release for the Clojure library was in 2017

TaupeRanger · on Oct 18, 2019

Stop naming things single word neuroscience terms. There are like 50 projects called "Cortex".

mzanchi · on Oct 18, 2019

Calling it an alternative to SageMaker might be a bit misleading, as SageMaker is also a platform for training the models in automatically allocated EC2 resources, even on spot instances.

dang · on Oct 18, 2019

We've changed the title from "Cortex: An open source alternative to SageMaker" to the page's own title, as the HN guidelines request.

https://news.ycombinator.com/newsguidelines.html

deliahu · on Oct 18, 2019

Cortex contributor here - you're right, I would say we can be compared to SageMaker model deployment. We are currently working on supporting spot instances for serving, and training is on our roadmap.

manojlds · on Oct 18, 2019

Sagemaker has notebooks, training and serving. This seems to be only about the serving.