Ask HN: How to deploy an AI App?

SkyTreasure · on Nov 21, 2018

If you haven't brought aws yet. They provide 1 year free service, check if its available.

Then deploy your server side API code using lamda and serve users from that layer(Read abt serverless architecture, u don't need an ec2 instance). Which is relatively cheap, its pricing is based on number of times it runs, if i remember correctly they give 1 million free call per month i guess, even if its not, its pricing is relatively cheap.

And store the images in s3.

gyeonggi · on Nov 21, 2018

Something along the lines of this article? https://aws.amazon.com/fr/blogs/machine-learning/how-to-depl...

pplonski86 · on Nov 21, 2018

How big is your model (neural network)? How many RAM you need to load the model? How big are images?

Do you need a website running (same pages with html)? Or just need API to the model?

gyeonggi · on Nov 21, 2018

Size of model is unclear at the moment, but will likely need 4 gigs or RAM or so?

We will only be doing inference with this particular app, the training on another machine.

Images will vary based on what users upload, but larger ones a couple GBs.

Would be ideal to have a website running, to be user friendly and present all of this.

pplonski86 · on Nov 21, 2018

Looks like AWS lambda can be too small for the job.

Have you checked https://www.tensorflow.org/serving/ ?