AWS EC2 Container Service

Animats · on Nov 13, 2014

"All problems in computer science can be solved by another level of indirection" - David Wheeler

That's what "containers" are, of course. There's so much state in OS file namespaces that running any complex program requires "installation" first. That's such a mess that virtual machines were created to allow a custom OS environment for a program. Then that turned into a mess, with, for example, a large number of canned AWS instances to choose from. So now we have another level of indirection, "containers".

Next I expect we'll have container logistics management startups. These will store your container in a cloud-based "warehouse", and will continuously take bids for container execution resources. Containers will be automatically moved around from Amazon to Google to Rackspace, etc. depending on who's offering the lowest bid right now.

eldavido · on Nov 13, 2014

It's more like a ping-pong. Things start off simply, but over time as the layers of abstraction pile up, things become brittle and unworkable.

I view containers as more of a reworking of a key computational abstraction (VMs) than an evolution of them. We finally have operating systems with enough inter-process isolation, sufficiently capable filesystems (layering), etc. that we can throw out 80% of the other unnecessary junk of VMs like second kernels, duplicate schedulers, endless duplication of standard system libraries, etc.

So it's more like we've hacked/refactored virtualization into a more usable state, and gotten rid of a lot of useless garbage that it turns out we didn't actually need. It's a lot like how a big software system evolves, now that I think about it.

jobposter1234 · on Nov 14, 2014

I'm genuinely curious, although a bit naive WRT containers. Outside of an aesthetic preference (for being able to remove 80% unnecessary cruft), what is the advantage of containers? I was under the impression that VM overhead was marginal in terms of today's computing.

I ask because I'm familiar with VMs, having worked with them extensively for a number of years. VMs work quite well for any application I've needed, so what would be the benefit of switching to containers? I've got lots to do, and lots to learn, but I can't see learning containers (and being out of sync with the rest of my coworkers) being a priority.

But I'm willing to change my mind if there's a concrete benefit. Right now, VMs work just fine, but maybe there's something I'm missing...

eropple · on Nov 14, 2014

VM overhead isn't trivial. It still remains a pretty big factor in terms of cost bloat for CPU-bound stuff. Also, VMs take a godawful long time to start up; if you care about, say, responding to load within ten seconds, VMs aren't a great choice.

They're fine for a lot of things, of course. I use them all the time. But I use containers for other things.

e1g · on Nov 14, 2014

I recall several reliable testers confirming that the CPU overhead of virtualisation was negligible, somewhere around 2%. Unfortunately I could not quickly find those papers now, but I did find a old VMWare whitepaper[1] showing they had ~7% overheard 5+ years ago, which sounds about right considering what kind of advancements they would have made in half a decade.

[1] http://www.vmware.com/pdf/hypervisor_performance.pdf

richmarr · on Nov 14, 2014

Sounds feasible, but CPU usage isn't really talked about as an advantage of containers.

I expect startup time and memory usage would be lower, but to my mind the advantages are mainly around flexibility... e.g. How long it takes to create or upload an image file. How long it takes to set up a minimal infrastructure with several components to it on a single EC2 instance. Decoupling the operating system patch cycle from the app deployment image generation cycle. etc.

buster · on Nov 14, 2014

It's just MUCH more memroy efficient to run containers and also VMs typically have worse I/O throughput. CPU scores are fine though.

As an example i am running around 20 containerized servers on my Laptop in a 4GB VM which would typically be run on 20 distinct VMs on one or more hypervisors. It's not very fast but the density of servers you can put on your hardware is MUCH bigger.

gonzi25 · on Nov 14, 2014

"if you care about, say, responding to load within ten seconds, VMs aren't a great choice."

That's actually exactly why I would use a VM..

eropple · on Nov 14, 2014

You can spin up, configure, and push into production an application in a new virtual machine in ten seconds? I'd like to see proof of that.

The best I've managed was ninety seconds on my own hardware and three minutes (on average) in AWS.

gonzi25 · on Nov 15, 2014

Ah sorry! I didn't think you meant literally "10 seconds", was assuming you just meant quickly (a few minutes).

I can't really think of a use case though where someone would need more capacity in sub 10 seconds. Maybe if you only intend to scale horizontally with a bunch of 500Mb instances and had little to no room to set an appropriate scaling threshold? What would be a couple examples? With the apps I've seen the past several years generally they have scaling thresholds at 'X' resource and 3 minutes is more than enough to provision extra capacity for their needs.

gonzi25 · on Nov 15, 2014

Also, kind of ironic but your site is giving me a 503 :p

brianwawok · on Nov 14, 2014

Just lighter weight. Ways that containers can be cool:

You need to start 10 containers locally to dev your app. Spinning up a local 10 VMs kind of sucks. 10 containers can be pretty quick.

(now if you really need 10 containers is another question, and some people clearly over split their architecture).

XorNot · on Nov 14, 2014

Containers are just a way to launch threads without polluting your local namespace or system. It's a way to say "hey, this stuff shouldn't interfere with anything else".

Terretta · on Nov 14, 2014

Well, we've had various containers such as BSD jails, for decades. The useless garbage wasn't necessary. Seems like ping pong happens whenever "kids these days" don't know why the status is quo then have to relearn the old lessons.

arohner · on Nov 13, 2014

IMO, the problem is that your standard OS has way too much stuff running.

A SaaS app running in production should be about the size of your binary, and the libraries it uses. Instead, we have X, smtp, terminals and a full filesystem running. home directories and uids make no sense in an app that uses no unix users except for the one you're forced to use.

I'd really like to see a much smaller, simpler, non-posix OS for running server apps.

kelleyk · on Nov 14, 2014

I haven't had a chance to play with it, but I ran across this project the other day: https://github.com/cloudius-systems/osv

"OSv was designed from the ground up to execute a single application on top of a hypervisor... OSv... runs unmodified Linux applications (most of Linux's ABI is supported) and in particular can run an unmodified JVM, and applications built on top of one."

emmanueloga_ · on Nov 16, 2014

This article [1] lists a couple of other "cloud OS" systems. OSv and mirage [2] seem to be the two most promising ones right now.

1: http://www.linux.com/news/enterprise/cloud-computing/751156-... 2: http://www.openmirage.org/

Animats · on Nov 14, 2014

I'd really like to see a much smaller, simpler, non-POSIX OS for running server apps.

The POSIX system interfaces (read, write, open, close, etc.) are OK. It's the Commands and Utilities that are the problem. Do you really need Bash available? How much of the 50,000,000 lines of Linux need to be inside your VM running your one web application? How much attack service is provided by the presence of all that stuff?

There's a project which has taken the C runtime library and made it run on a bare VM, so you don't need an OS instance at all. If you're just running one program, that makes a lot of sense.

eldavido · on Nov 14, 2014

This doesn't really pencil out..."your binary, and the libraries it uses" can easily get into the GB when you include components like the .NET framework or java base class library. I don't know exactly how large a fully-loaded NPM repo with warm cache or warmed-up rvm installation directory are, but it isn't tiny.

Second, POSIX is a standard for how the operating system API works that has nothing to do with what packages are installed -- and it's a pretty low-level API, for doing stuff like read, write, fork, exec, etc. This isn't what's adding bloat.

teacup50 · on Nov 13, 2014

In this case, the problem isn't being solved -- solving the problem would mean moving away from dependencies on the global OS namespace by relearning how to write self-contained applications (some people never forgot).

Containers are just a big wad of duct tape holding together the ball of mud that comprises most web applications' server-side components.

Add containers, and you haven't solved the problem, you've just made two problems.

23david · on Nov 14, 2014

It's great for those nasty legacy apps that only work on old unmaintained versions of Rails or old OS Versions etc.

Take all the nastiness and throw it into a box, without needing to contact Ops to reserve memory and provision a VM.

IMO, it's one of the major reasons why Enterprises get so excited about Docker. Legacy app dependency issues are horrible once you get past a certain scale.

VM's are expensive and non-self-service at most orgs since they tie up RAM and licenses.

enos_feedler · on Nov 13, 2014

Functions are just a big wad of duct tape holding together the ball of mud that comprises most applications' lines of code. Add functions, and you haven't solved the problem, you've just made two problems.

Does this sound true to you? What makes containers any different from the organization that the abstraction of "functions" bring to ordinary sequential programs?

teacup50 · on Nov 14, 2014

Containers abstract over reducible complexity.

Functions (should) abstract over irreducible complexity.

As long as we're asking hypotheticals — why do applications need to control the global OS namespace and the dependencies between elements in that namespace to a degree that the applications themselves can't be easily deployed without containers?

Can that problem be reduced?

If not, why not?

siegecraft · on Nov 14, 2014

It's not really adding another level of indirection, it's taking one away. The pain of change remains in that you have to internalize yet another new layer, BUT at least this way you get to leave VMs behind. It's trading one layer for another slightly more granular one instead of piling another one on top.

tjbiddle · on Nov 13, 2014

It's amazing to me how much easier the process is for this nowadays. We really live in an exciting time.

rdgiii · on Nov 14, 2014

Bracket Computing is basically doing this. Funded by a16z too. https://www.brkt.com/

beat · on Nov 14, 2014

Docker in general is just another swing of the granularity pendulum. Since the rise of distributed environments in the late 1980s, the pendulum has swung back and forth between microservices (which become a version control tangle as they move independently) and monolithic applications (which become a bloatware problem as they have whole kitchen sinks to move around). The core problem is that software is complex, and at a certain level, you can't take complexity away - just push it around here and there. A large number of small pieces, or a small number of large pieces. Which kneecap do you want shot in?

After a few years of trending toward monoliths via chef/puppet/ansible DevOps automation, Docker is going in a different direction, toward fragmented SOA. It'll go that way for a while until it becomes too painful, and then new tech will come to push us back to the monolithic approach, until that hurts too much...

The good thing is, these cycles come in response to improvements in technology and performance. Our tools get better all the time, and configuration management struggles to keep up. It's awesome! Docker will rule for a while and then be passed by in favor of something new, but it'll leave a permanent mark, just as Chef did, and Maven, and Subversion, and Ant, and Make, and CVS, and every other game-changer.

St-Clock · on Nov 13, 2014

Security-wise, if I understand correctly, this is a very interesting offering.

1. The containers live on "your" VMs so you get the isolation of a virtual machine and do not worry about the other tenants' containers.

2. The VMs are part of a "private cloud", i.e., the internal network is not accessible by other tenants' VMs and containers.

#2 is what worried me the most in other container service offerings. It's easy to overlook protecting your internal ip when you manage VMs, it's even easier (and expected) when you deploy containers.

joshpadnick · on Nov 14, 2014

I'm here at AWS reinvent and just saw the EC2 Container Service presentation. They specifically targeted security as part of their design.

Basically, you launch a cluster of EC2 instances that are "available" for containers to launch into. So these are your instances, running in your VPCs. It's really the same security profile as the standard VPCs plus any other security issues your particular docker containers expose.

kalgen · on Nov 13, 2014

These are also properties of Google Container Engine. Which other container service offerings were you thinking of?

dividuum · on Nov 13, 2014

Digital Ocean has something called "Private Networking" that's internal to the data center but shared with all other customers. It's not obvious from reading the website that this is the case.

estsauver · on Nov 13, 2014

I actually think they're almost intentionally a touch deceptive. "Private" is a really loaded term to use there.

inopinatus · on Nov 14, 2014

When a door is marked "Private", then the room beyond is generally a shared space for all those authorized to access.

SoreGums · on Nov 14, 2014

Not really, common to refer to Private IP's as "private IP address space" as per RFC 1918(IPv4)/RFC 4193(IPv6).

If a user of the service is wanting to get on board, it is their responsibility to ensure what they think is accurate or not.

kelleyk · on Nov 14, 2014

Linode's is the same. I agree that it's annoying and that the terminology is slightly deceptive.

Any of these "datacenter LANs", though, tend to be fast and free (in terms of bandwidth). We just use a VPN overlay and call it a day.

incision · on Nov 13, 2014

I'm disappointed that this requires an invite, particularly so close after Container Engine which I was able to try out immediately while still watching Cloud Platform Live the other day.

Is this typical for new AWS offerings?

It makes me wonder if it's something that truly isn't ready for prime time, but is being rushed / forced by the mounting Docker hype and GKE announcement.

swordwield · on Nov 13, 2014

Considering they've been tweeting about it [1] since before their competitors announced things I'd say it's unlikely to be a "response". It's far more likely that Docker has now been out long enough for the various providers to build services around it. AWS already had some docker support built in in April [2]. It's also pretty common to release services as previews. GCE lists theirs as an Alpha quality product.

[1] https://twitter.com/jeffbarr/status/529493907839533056

[2] http://blog.docker.com/2014/04/aws-elastic-beanstalk-launche...

bashtoni · on Nov 13, 2014

Given that kubernetes (the project behind GCE) was open sourced in early June, I hardly think a tweet from a week and a half ago shows it's not a response to Google.

gruvector · on Nov 13, 2014

He also mentions the elastic beanstalk support for Docker from April. It's quite obvious that everyone has been working on Docker support for a while now anyway.

jakozaur · on Nov 13, 2014

It's fairly typical. Though I usually get preview to any service reasonable fast just by clicking on their website.

E.g. Amazon Kinesis:

Preview 14 Nov 2013: http://aws.amazon.com/blogs/aws/amazon-kinesis-real-time-pro...

Got access less than week from that.

GA 16 Dec 2013: http://aws.amazon.com/about-aws/whats-new/2013/12/16/amazon-...

joshpadnick · on Nov 14, 2014

According to one of the AWS devs, they plan to start honoring invite requests in about 2 - 4 weeks. It appears to be in preview right now mostly b/c the loose ends aren't tied up yet. For example, in their demo today, they launched EC2 instances in a cluster using an AMI that's specially enabled for the EC2 Container Service but which is not yet publicly available.

hammerdr · on Nov 13, 2014

Anyone have any insight about if this handles service discovery? It claims "cluster management" which usually means discovery, but there is no mention of it. Maybe Amazon is expecting you to handle that?

zenlikethat · on Nov 13, 2014

I was wondering this as well. It seems that they will provide for constraints around co-located containers (similar to pods in Kubernetes) but I'm not sure how discovery for containers scheduled across hosts is meant to take place.

jhappoldt · on Nov 13, 2014

From: https://aws.amazon.com/ecs/details/

...including the Docker repository and image, memory and CPU requirements, and how the containers are linked to each other. You can launch as many tasks as you want from a single task definition file that you can register with the service.

Very few details but it looks like container lifting across hosts. If so this is great news.

zenlikethat · on Nov 13, 2014

Yes but there are a lot of ways that "the containers are linked together" could be implemented and some of them e.g. key value store require modifying application code quite a bit whereas e.g. DNS does not.

vidarh · on Nov 13, 2014

If you want DNS, running e.g. Registrator would be easy enough, and have the added bonus of not tieing your service discovery to AWS.

derefr · on Nov 13, 2014

Wasn't there just an AWS announcement yesterday about the ability to register VPC-private DNS records in Route 53? It screamed "SkyDNS competitor" to me but I couldn't figure out what Amazon wanted such a thing for. Makes sense now.

donavanm · on Nov 14, 2014

Route 53 launched private (vpc) dns last week. Its actually a common pattern to manage ec2 instances via dns records. Many people had built this on top of the public route53 offering, see zonify from airbnb as an example. Private dns improves on that model as the vpc instances never have to communicate with the public internet now.

timdorr · on Nov 13, 2014

You install an ECS agent on each physical server that runs alongside dockerd and reports state back to the central ECS API. You can query that API for service discovery.

jpgvm · on Nov 14, 2014

No mention of Elastic Load Balancing integration or even EBS integration. Thus avoiding the 2 hardest problems in container management.

To make this not suck you will still need a proxy layer that maps ELB listeners to your containers and if you intend to run containers with persistent storage you are going to be in for a fun ride.

Probably best to integrate functionality for interacting with storage systems into Docker itself, probably as a script hook interface similar to the way Xen works.

nitinag · on Nov 13, 2014

Direct link to product as well: http://aws.amazon.com/ecs/

SEJeff · on Nov 13, 2014

So Azure, GCE, and now EC2 all support docker natively. Sorry Canonical and LXD, but docker has basically won at this point. There simply isn't a good reason to "compete" when you can just add features to docker at this point.

on Nov 13, 2014

[deleted]

cthalupa · on Nov 13, 2014

Well, LXD doesn't actually exist yet, so Docker couldn't possibly be built on it.

You are likely confusing LXD with the present day LXC, which is understandable. You thinking that Docker is built on LXC is also understandable - but also not quite right.

Docker used LXC as the default container implementation for much of it's lifespan. However, it has since dropped LXC as the default and uses libcontainer instead.

The linux kernel does not actually specifically have a container implementation. Userland tools such as LXC ( https://linuxcontainers.org/ ) and libcontainer ( https://github.com/docker/libcontainer ) utilize things like kernel namespaces, cgroups, and a variety of other features to deliver the container experience.

SEJeff · on Nov 13, 2014

Incorrect, docker is built ontop of Linux kernel namespaces and cgroups. The first backend was LXC (notice the C and not D). The current backend is libcontainer, which is a native golang re-implementation (more or less) of LXC. There is a docker backend for LXC, but it is not the default and is likely not used super heavily.

Note that there isn't really a Linux kernel feature called LXC. The LXC userspace just ties all of the namespacing and cgroup functionality into 1 coherent super-chroot style environment. That is the same as docker does with libcontainer.

http://www.infoq.com/news/2014/03/docker_0_9

rsync · on Nov 13, 2014

Is there a point-by-point (and detailed) comparison somewhere between FreeBSD jail and LXC (or libcontainer) ... it would be very helpful to see that comparison.

SEJeff · on Nov 13, 2014

This is actually a pretty reasonable comparison. It is somewhat obvious the guy knows more about BSD Jails and elaborates a bit more on it, but overall, this is pretty accurate:

http://unix.stackexchange.com/a/141595

gecko · on Nov 13, 2014

I just asked that on Twitter, because I also am not getting it. It seems to me that ZFS and Jails provide identical functionality, but without Docker's networking headache.

That said, even if my simplistic synopsis is correct, brining a Jails-like experience to Linux would still be a really solid step forward. Besides which, Jails have been underutilized at shops I've worked at. If Docker popularized the concept, that's still a huge win.

St-Clock · on Nov 13, 2014

The OP was referring to Linux Container Daemon LXD [1] which is built on top of LXC and promises to offer Docker-like features.

[1] http://www.ubuntu.com/cloud/tools/lxd

shykes · on Nov 13, 2014

No, Docker is not build on lxd. It used to be built on lxC, which is now an optional backend.

The grand-parent is correct that lxd is a newly introduced competitor to Docker.

I won't comment on whether it's good or bad, but it's an objective fact that canonical decided to reimplement what Docker does rather than contribute to it.

23david · on Nov 13, 2014

I'm confused...

Docker Inc CEO Ben Golub:

  Current meme gets it wrong. Ubuntu lxd is complementary, not 
  @docker replacement. Joyent brings Docker to SmartOS. 
  http://zd.net/1uB3Aly

https://twitter.com/golubbe/status/530475539262623744

SEJeff · on Nov 13, 2014

Don't worry Solomon, we still love Docker and will continue to use it :)

That being said, if they do somehow manage to get hardware assisted containment for containers, I see it as a no-brainer for docker to adopt it as soon as it is reasonably possible. Are there any plans (that you can speak of) regarding something like this, or are you waiting for LXD to be more than vaporware at this point?

shykes · on Nov 13, 2014

I can say that there are employees of major silicon companies already working on contributing all of this to upstream Docker. I was shown a real proof of concept already, it's very promising and not at all vaporware :)

SEJeff · on Nov 13, 2014

Perhaps once the pax guys (spender in specific) will stop trolling docker and containers in general for once! We can hope.

23david · on Nov 13, 2014

  > it's an objective fact that canonical decided to reimplement 
  > what Docker does rather than contribute to it.

What was the reason that Docker reimplemented much of LXC rather than contribute patches upstream? Latest LXC supports features that Docker's reimplementation doesn't, and it seems likely to get further ahead feature-wise now that Ubuntu is pouring more resources into LXC/LXD.

shykes · on Nov 13, 2014

You forgot to quote the part where I said "I won't comment on whether it's good or bad".

Canonical doesn't need to justify itself to me, no more than the Docker maintainers need to justify themselves to you. It's just how open-source works: you weigh the pros and cons of re-using vs re-implementing, make a decision, and see if the community follows you. In the case of Docker, the community followed. In the case of lxd, I guess we'll see. Either way, more choice and competition means the user wins.

23david · on Nov 14, 2014

Happened to find the pull request discussing the lxc-driver issue. It sounds like there was some interest in contributing upstream, but it didn't really go anywhere.

  https://github.com/docker/docker/pull/5797

From the thread, it seems like the concern was just that lxc-exec wasn't well maintained and the lxc interfaces weren't stable since it was undergoing heavy development. I think that's changed recently with both lxc and docker now post 1.0 release and 'production-ready'.

Docker still uses LXC if you want it to via the lxc-exec driver and the --lxc-conf option. It's just not the default, which probably makes sense since the lxc options mainly apply for advanced users.

So by default, Docker uses libcontainer for a simpler installation experience. But for advanced users, using the lxc driver is an option to look into.

IMO, Docker wins if it continues to play nicely with the other open-source projects people use with Docker, and also give credit where due.

shykes · on Nov 14, 2014

Completely agree and we try hard to do just that.

pliu · on Nov 13, 2014

Docker is built on LXC, Linux containers.

LXD is a new thing from Canonical, a hypervisor designed especially for containers that is supposed to guarantee hardware isolation.

waffle_ss · on Nov 13, 2014

> Docker is built on LXC, Linux containers

Nope, it currently defaults to libcontainer[1], not LXC, but through the use of execution drivers[2] you can still use LXC if you want to.

[1]: https://github.com/docker/libcontainer

[2]: http://blog.docker.com/2014/03/docker-0-9-introducing-execut...

general_failure · on Nov 13, 2014

Lxc is user land tooling

LeonidBugaev · on Nov 14, 2014

I guess it is like hosted Mesos. They provide Masters and API similar to Marathon, and you just have to run slave instances. Looks nice :)

gtaylor · on Nov 13, 2014

Is anyone else seeing a blank confirmation page when trying to sign up? http://i.imgur.com/faztegP.png

sync · on Nov 13, 2014

I got that too. Pretty awesome. May not be production ready for quite some time...

zenlikethat · on Nov 14, 2014

I got this too.

sshillo · on Nov 13, 2014

I wonder if this is built off apache mesos

j2d3 · on Nov 13, 2014

This destroys heroku, right?

general_failure · on Nov 13, 2014

No, I think deis and dokku

j2d3 · on Nov 13, 2014

Them too, but heroku runs on aws and basically provides this, too, but instead of free, it's very very expensive

general_failure · on Nov 14, 2014

It's expensive comparatively. The convenience heroku provides is vastly overweighs the price.

garblegarble · on Nov 13, 2014

Weird, I'm getting a 404 when visiting this page, did Amazon pull it?

pm90 · on Nov 13, 2014

working fine for me

cyanbane · on Nov 13, 2014

Same 404, I wonder if not at all CDN locations yet?

waitingkuo · on Nov 13, 2014

When can we start to use this service?

ing33k · on Nov 13, 2014

one more reason to use docker .