Ask HN: Have You Left Kubernetes?

derefr · on Aug 1, 2022

We started with a full-stack k8s approach (on GKE); left (switching to plain GCE VMs); then came back much more conservatively, just using GKE for the stateless business-layer while keeping stateful components on dedicated VMs. Much lower total maintenance burden.

(Hard-won bit of experience: k8s + Redis really don't like each-other if Redis 1. is configured to load from disk, and 2. your memory limit for the Redis container is somewhat-tightly bounded. At least from the k8s controller's perspective, Redis apparently uses ~400% of its steady-state memory while reading the AOF tail of an RDB file — getting the container stuck in an OOM-kill loop until you come along and temporarily de-bound its memory.)

However, we're considering switching back to k8s for stateful components, with a different approach: allocating single-node node-pools with taints that map 1:1 to each stateful component, effectively making these more like "k8s-managed VMs" than "k8s-managed containers." The point would be to get away from the need to manage the VMs ourselves, giving them over to GKE, while still retaining the assumptions of VM isolation (e.g. not having/needing memory limits, because the single pod is the only tenant of the VM anyway.)

roflyear · on Aug 1, 2022

This makes too much sense to me to think people use Kubernetes for anything else. Why must one use Kubernetes for EVERYTHING unless there is some higher-order reason?

I use Kubernetes to host something like 6 of our own services, and it excels at that and is fairly simple. Databases and other things use different services.

adra · on Aug 1, 2022

We did the reverse of go. We started with purely stateless k8s and got really comfortable with the platform before moving to support stateful loads. For databases, many vendors have dedicated operators which introduces best practice deployments with less operational fuss provisioning the instances. Tooling like Strimzi for Kafka as an example here.

AtlasBarfed · on Aug 1, 2022

This may be the issue with stateful operators:

Do you really think they have ALL the operations coded properly for all operational conditions?

Stateless is so much easier for operations. It either is running, or not, A/B upgrades, yada yada.

Stateful has backups, restores, outages, patching, migrations, corruptions, fixes. For distributed systems it gets even hairier. Do they support your distribution? What if you have a multicloud or blended internal/cloud? Does the operator provide a turnkey restore from backup, and are you testing it?

Operators shouldn't be viewed as replacements for stateful operations knowledge, but it's probably what they'll be used for.

If you're using a large scale distributed stateful/database system, you need an operations team to support it, or pay for that.

unity1001 · on Aug 1, 2022

Stateful has all of those problems anytime, anywhere. Even if you run a single stateful app in a single VM, you will still have all those issues.

tpetry · on Aug 2, 2022

You have a lot less problems in a single VM: There‘s (hopefully) a clear path on how to install software etc. and it is easy to understand. If your distributed PostgreSQL operator written by someone fails you‘re out of luck and it will be really really hard restoring your system. With non-kubernetes non-automated solutions not written by someone you may have some docs saying „Update the replication address and restart“.

The way to restore to a running system is much easier in old traditional vm approaches.

unity1001 · on Aug 14, 2022

In K8 paths are clearer: Every bit of component that goes into a deployment is versioned. They deploy the same in every build. Its 100% certain.

The problem with stateful app scaling and HA comes from two things: the need to make files and database multi read-write and high available. Other than that, app scaling is pretty easy in K8 even for stateful apps.

paraiuspau · on Aug 1, 2022

I am wondering, did you ever look at tuning MALLOC_ARENA_MAX? This sort of constant consumption of memory is fairly well aligned to the default tuning of MALLOC_ARENA_MAX, which is 8 * nproc.

We've just tuned it for a java-based app which was also stuck in OOMkill hell, and this has completely resolved the situation (MALLOC_ARENA_MAX=2).

zoomzoom · on Aug 1, 2022

At Coherence (withcoherence.com - I'm a cofounder) we generally agree that GKE and other managed k8s are best used for stateless workloads. Rather than move stateful workloads back, leveraging managed services will yield the best results in the long term. In the case of Redis on GCP, something like Memorystore is going to be a better fit than managing a nest of node pools over time (think about version upgrades, resources differences across environments, etc...) However, the complexity of managing the different kinds of configuration across GKE and managed services can be a nightmare.

That's a problem we're hoping to help solve, where you define your application and it's dependencies, and we help run it in the right way to leverage managed cloud services across environments without passing that headache on!

debarshri · on Aug 1, 2022

Crossplane [1] is great way to create and manage resources across cloud providers, MSPs via kubernetes objects.

[1] https://crossplane.io

aranelsurion · on Aug 2, 2022

Config Connector [1] is also an option in this space for GCP, it supports many GCP resources and thus far our experience with it has been largely positive.

[1] https://cloud.google.com/config-connector/docs/overview

gizzlon · on Aug 1, 2022

Have you checked out Managed Instance Groups? Used them a wile back, and worked as advertised :)

https://cloud.google.com/compute/docs/instance-groups#manage...

seabrookmx · on Aug 1, 2022

We do something similar with ElasticSearch. We use EKS (a k8s operator) but give each ES node a full k8s node using pod anti-affinities and taints. That way we can just select a sensible disk and instance size on our node pool and not worry about resource request/limit. It's been working very well so far.

ES handles node restarts or upgrades pretty gracefully though. I'd imagine for databases or "non-clustered" things you'd have to consider GKE's aggresive upgrade schedule. We use CloudSQL for some databases but our larger ones are still on GCE because we get more control of replication, CDC, and can use tools like proxysql to reduce downtime.

MuffinFlavored · on Aug 1, 2022

> Much lower total maintenance burden.

Devil's advocate but isn't having to maintain VMs (and then software deployed to those VMs) and k8s YAML/charts/whatever more "maintenance burden" than just one or the other?

pravus · on Aug 1, 2022

I guess it would depend on how you managed your deployments/infrastructure but in general I would say no. In my experience stateless services are all managed differently because they typically need fewer resources and can be scaled more easily. Services that require state tend to have a more hands-on approach since they are usually in the critical path for many other services. Deploying a cookie-cutter service is something where k8s accels so it makes sense to use it for those types of workloads.

MuffinFlavored · on Aug 3, 2022

What do you define a server as stateless? As... not needing to talk to a filesystem or a cache server or a database?

Just something that takes in API requests (or a cron-like scheduled job) and makes other API calls/does plumbing?

cogman10 · on Aug 1, 2022

At least in my experience, no.

We used managed services for our stateful stuff which significantly eased the operational burden there. Might be a different story if we looked at doing the absolute minimum cost optimization. However, at least for us, the extra cost of managed services is worth the price.

The yaml tends to be a "one and done" sort of thing. We touch it MAYBE once every 2 months if that.

AtNightWeCode · on Aug 1, 2022

Why did you run Redis on K8 in the first place? (One of the reasons we did not move to K8 was the default recommendation to not run Redis, SQL etc on the clusters.)

derefr · on Aug 1, 2022

Our Redis use-case started off as ephemeral per-deployment storage for things like rate-limit counters; before evolving into durable per-deployment storage for things like service-metadata discovery.

Ephemeral Redis is well-suited to k8s — you can treat it as just a sidecar to your app layer deployment. Durable Redis is not. But sometimes the transition can sneak up on you.

Nullabillity · on Aug 1, 2022

> The point would be to get away from the need to manage the VMs ourselves, giving them over to GKE, while still retaining the assumptions of VM isolation (e.g. not having/needing memory limits, because the single pod is the only tenant of the VM anyway.)

Isn't this just moving the problem from per-pod resource constraints to per-VM resource constraints?

scarby2 · on Aug 1, 2022

Yes. They could have just set the memory limit high enough to handle that workload. There's really no different.

3np · on Aug 9, 2022

With the pod, you still have the per-VM/node resource constraints, so it's an additional layer.

foosinn · on Aug 1, 2022

you can set hostAffinities on PersistentVolumes.

  nodeAffinity:
    required:
      nodeSelectorTerms:
      - matchExpressions:
        - key: kubernetes.io/hostname
          operator: In
          values:
          - <hostname>

this ensures that your workload will be rescheduled on the matching node

jupp0r · on Aug 1, 2022

Thanks for your insights! One question regarding the approach to hosting stateful components you describe last:

what’s the difference between doing this as single node pools vs pod constraints like anti affinity?

nijave · on Aug 1, 2022

It would make node pool operations like version upgrades more predictable since you'd know for sure which apps are running a given node pool

It can also make monitoring resource usage a little easier since you can just monitor at the node level

kristaps · on Aug 1, 2022

While I like Kubernetess generally, I agree that the OOM handling is not ideal, feels quite a bit like "exercise left to the reader".

unity1001 · on Aug 1, 2022

For stateful stuff, give a try to Google's File Store. It has pretty good performance.

As for K8 managed VMs - thats a good idea. Resource management wise, security wise etc.

rekrsiv · on Aug 1, 2022

I've yet to encounter a non-smelly k8s deployment that was started before everyone knew how it works or why it works.

On the other hand, once everyone on the team has experience building such a system from scratch, then deploying k8s and using it somehow becomes straightforward.

It's almost as if we need to learn how a tool works before being able to use it effectively.

Anyways, what we (actually didn't) replace it with:

  - Don't let your devs learn about k8s on the job.
  - Let them run side-projects on your internal cluster.
  - Give them a small allowance to run their stuff on your network and learn how to do that safely.
  - Give your devs time to code review each other's internally-hosted side-projects-that-use-k8s.
  - Reap the benefits of a team that has learnt the ins and out of k8s without messing up your products.

dosethree · on Aug 1, 2022

Maybe its just my team, but dev's dont need to know k8s. It certainly doesn't hurt, but they should be able to write code and get their jobs done without knowing much about k8s at all. Basic shit like how to get logs, but thats a given for all platforms

0xbadcafebee · on Aug 1, 2022

> but dev's dont need to know k8s

If somebody else on the team already knows k8s. The problem is a lot of places just give devs admin access and let them go hog wild. If devs don't know k8s they can't make significant changes without waiting for the one guy who knows k8s to do it.

Give a man a k8s, he will deploy for a day. Teach a man to k8s, he will deploy for at least 5 years while the hype cycle continues.

undergrowth54 · on Aug 3, 2022

But can your team succeed without having people who:

1. Are motivated by the sort of curiosity that would frustrate them if they were blocked from knowing about k8s

2. Are motivated by the sense of responsibility that would unnerve them if they didn't understand 1 abstraction layer beneath their work.

?

rekrsiv · on Aug 1, 2022

You don't need to solve this problem if k8s already works for you.

meldyr · on Aug 1, 2022

What do you mean with side projects? Are they paid?

If you want your Devs to learn kubernetes you should pay them for doing it.

If you can't, hire a contractor with the expertise you need.

Cd00d · on Aug 1, 2022

I would interpret 'side project' here as a work project that is not your primary and has low stakes for delivery into production timelines or expectations.

mmcnl · on Aug 1, 2022

I don't really understand this. Work is usually prioritized. If it has low stakes for delivery into production then it should have a lower priority than other activities, but that doesn't make it a side project.

rekrsiv · on Aug 1, 2022

If you don't let developers prioritize some time to play around with new concepts and ideas and to learn how they work, they'll play around to learn in your products.

rekrsiv · on Aug 1, 2022

I feel like you're conflating multiple unrelated topics together. This isn't advice on how to use another team's experience, or cut costs, or maintain your team morale.

It is difficult to tell at a glance whether an engineer is qualified to effectively use a tool. Letting them self-train by working on self-projects in isolation compounds this effect.

The goal of this exercise is to give time and space to your devs to practice in a safe environment, while allowing them to push, deploy and review projects internally as if they were core products, so that other SMEs are allowed to spend some time every week reviewing those projects for smells and issues before those ideas make it into a core product.

lc9er · on Aug 1, 2022

Running a side project on company infrastructure seems like a disaster waiting to happen for both parties.

rekrsiv · on Aug 1, 2022

It's true that running side projects on a company cluster in an environment where no one is quite sure how to use the tool properly is a disaster waiting to happen.

Fortunately, k8s can act as a very secure sandbox when it's configured properly, so you'll know how to mitigate such a disaster once your company has trained its engineers on how to use the tool effectively.

dan_quixote · on Aug 1, 2022

I took it to mean "sandbox"

lamontcg · on Aug 1, 2022

I took it to mean that infamous mythical Google 20% project.

igetspam · on Aug 1, 2022

It's not mythical. Gmail and Google Calendar were both 20% projects. They were built to solve for bad internal web mail (mirapoint, I think... ?) and Oracle Calendar (which was absolute trash on Linux).

It's not as common as some would have you believe but it is real. A teammate spent his 20% on underwater topography on Google Earth and another spent his on the glider thing.

rekrsiv · on Aug 1, 2022

I'm not surprised 20% is not more common, but I am surprised among 20%-enabled companies that the norm isn't to have the company host side projects for all employees. Insurance, snacks, gym memberships, mobile plans and laptops, but not the one thing all hackers need?

Having an employee turn up a popular side projects while being vendor-locked onto your platform sounds like it should be more popular among rich people.

lamontcg · on Aug 1, 2022

Yes I was being deliberately hyperbolic.

The way it is should actually be sold to the average entry-level fresh-out-of-college Google hire is probably closer to the way I framed it, though, than the examples of Gmail and Google Calendar which are pretty much two unicorns.

newusertoday · on Aug 2, 2022

it likely means internal tools that are just within the company like companies internal wiki.

manquer · on Aug 1, 2022

You can provision vclusters to give each dev (each or even) that gives the space to play with the env without it being a problem.

Cattle not pets after all.

iasay · on Aug 1, 2022

Not yet. We are still deluding ourselves that the 3x cost increment and insane complexity increase we can barely manage to keep spinning is actually a business benefit.

Note: this isn't everyone's end game but I suspect it's realistic for a lot of people.

I would like to go back to cleanly divided, architected IaaS and ansible. It was fast, extremely reliable, cheaper to run, had a much lower cognitive load and a million less footguns. What's more important possibly is not everything can be wedged into containers cleanly despite the promises.

krmboya · on Aug 1, 2022

Also a big fan of sticking to ansible and plain VMs, at least for most cases I've encountered. To me, a VM in the cloud already feels like a container and you can use the cloud provider's APIs to scale up and down virtual instances as needed

bob1029 · on Aug 1, 2022

> To me, a VM in the cloud already feels like a container

This is the mental abstraction I've been operating with for over a decade now.

All of our products are monolithic binaries that can be installed on bare-ass windows or linux machines. For all intents & purposes, basic AWS/Azure/et. al. VM hosting is our containerization strategy. We just pushed the tricky bits down into our software.

95% of our pain is resolved by using a modern .NET stack and leaning hard on their Self-Contained Deployment model. Our software has zero external dependencies at deploy time, so there isn't much to orchestrate. Anything that talks to a 3rd party system is managed purely via configuration in our software.

windexh8er · on Aug 2, 2022

Agreed. But I do think there are places for containers. I will often package single binaries in containers for built in distribution and capabilities for rolling upgrades. Especially tooling that relies on a lot of externalities that can taint the system. Python applications, as an example are much easier to deploy and manage this way than dealing with Terraform / Ansible to provision correctly. Even if you're just using host networking and good ol' Docker there is a ton of operational upside with very low maintenance overhead (mental and otherwise).

I'm working with a product now that's made their k8s deployment the standard and all it's done is create bigger issues. Ops got behind on Strimzi and so we got stuck on 1.21 because we couldn't upgrade due to being locked to the Strimzi version. This caused issues because of log4j and we ran into a wall quickly with customers on GCP as soon as 1.22 ended up as GA. Honestly I'm not sure we're getting much, if any, overhead advantage since I feel like the app has become bloated due to container creep.

That and supporting 4 different ways to provision storage across customers on every cloud / on-prem is a nightmare. Customer environment installed applications on k8s is a nightmare today.

nailer · on Aug 1, 2022

> To me, a VM in the cloud already feels like a container

A VM provides better isolation than a container as it has a separate kernelspace. Hence the DevOps mantra "containers don't contain".

gautamdivgi · on Aug 1, 2022

Unless you have massive scale VMs are your best option. If you need VM configuration on startup (elastic scaling), you may need to maintain your own image. Salt Stack and/or Fabric are good alternatives to Ansible.

You could look at containerization without K8S (podman or docker) especially if you use python and don’t want to mess with the Linux native python installation.

p_l · on Aug 1, 2022

Unless you have money to burn K8s excels compared to VMs in my experience.

It's original purpose wasn't to do elastic scaling or anything like that - it was to binpack workloads onto a set of nodes, and not everyone has Silly Valley money to pay Silly Valley prices (especially when one's currency is weak against dollar)

mhink · on Aug 1, 2022

> It's original purpose wasn't to do elastic scaling or anything like that - it was to binpack workloads onto a set of nodes

This is arguably still its primary purpose, and all the rest of its features are ancillary and only exist for the sake of operational convenience.

p_l · on Aug 2, 2022

Considerable portion of the internet, even people who supposedly know k8s, have this weird notion that it's for "scaling up" ... Except they never talk about scaling what single engineer can do, but some less useful things like dynamically adding lots of servers ;)

nimbius · on Aug 1, 2022

you might consider migrating to systemd controlled rootless dockerless podman. helm even has a plugin for podman.

iasay · on Aug 1, 2022

I wouldn't bother. I'd just consolidate our product out of microservices and run more small clusters of monoliths all started from systemd.

robbintt · on Aug 1, 2022

Do you know of some writeups for this? I am like halfway there but just mess with podman+systemd on the weekend.

paulgb · on Aug 1, 2022

We did. Our use case is spinning up containers on demand to user actions, giving them ephemeral, internet-routable hostnames, and shutting them down when all inbound connections have dropped. Because users are waiting to interact with these containers, we found the start times with Kubernetes too slow and its architecture to be a bad fit.

We ended up writing our own control plane that uses NATS as a message bus. We are in the process of open sourcing it here: https://github.com/drifting-in-space/spawner

cbanek · on Aug 1, 2022

> we found the start times with Kubernetes too slow

Just curious if you could elaborate here? I work with k8s on docker, and we're also going to be spinning up ephemeral containers (and most of the other things you say) with jupyter notebooks. We're all in on k8s, but since you might be ahead of me, just wondering what hurdles you have faced?

Our big problem was fetching containers took too long since we have kitchen sink containers that are like 10 GB (!) each. They seem to spin up pretty fast though if the image is already pulled. I've worked on a service that lives in the k8s cluster to pull images to make sure they are fresh (https://github.com/lsst-sqre/cachemachine) but curious if you are talking about that or the networking?

From what it looks like in your repo it might be that you need to do session timing (like ms) response time from a browser?

paulgb · on Aug 1, 2022

Jupyter notebooks are actually a use case we think about a lot, you can try a live demo with a Jupyter notebook here: https://jamsocket.com/tmpenv/

It wasn't really one thing with Kubernetes that was slow, but that the more we tried to optimize it the less of core Kubernetes we were using and so the less value we were getting for the complexity tax we were paying. The image pulling you mention is a good example of that; having pre-pulled images is a big factor, but we have too many images to push every image to every node, instead we'd like the scheduler to be aware of which node has which image. We could do that with node affinity, but what we'd end up building would be more work than if we wrote our own scheduler to support it from day one.

> From what it looks like in your repo it might be that you need to do session timing (like ms) response time from a browser?

Our goal is subsecond container starts. We're not there yet, and might not get there with Docker, but we have a POC that is there with WebAssembly-based workloads. Too bad those are rare :)

(By the way, I'm always happy to chat about this stuff, my email is in my profile)

justinsb · on Aug 1, 2022

> we'd like the scheduler to be aware of which node has which image

The kubernetes scheduler should be aware of which node has which image, that is why the Node object has the status.images field: https://kubernetes.io/docs/reference/generated/kubernetes-ap....

It turned out to be somewhat tricky, because it increased the size of the Node object, and colocating node heartbeats onto the same object meant that a bigger object was changing relatively often. But that was addressed by moving heartbeats to a different object: https://github.com/kubernetes/enhancements/issues/589

paulgb · on Aug 1, 2022

TIL, thanks. Looks like there's a corresponding ImageLocality score used by the scheduler: https://kubernetes.io/docs/reference/scheduling/config/#sche...

It doesn't get all the way to what we want, but it could be used to build a piece of it.

cbanek · on Aug 1, 2022

Very cool, I didn't know about this either. I feel like so many of these features are coming in which is great, but also part of the drag of k8s is the kind of constant upgrade churn and having to keep your yaml fresh.

jjoonathan · on Aug 1, 2022

AWS has put work into fast-starting containers [1] using tricks like lazy loading container storage, profiling container startup, non-lazily priming critical blocks, and caching shared blocks. IIRC parts of it are open source. I don't know if enough of it is open source to be helpful, but it's cool stuff!

[1] Gigabytes in milliseconds: Bringing container support to AWS Lambda without adding latency. https://www.youtube.com/watch?v=A-7j0QlGwFk

TurningCanadian · on Aug 1, 2022

On the Google side, Artifact Registry supports image streaming

https://cloud.google.com/kubernetes-engine/docs/how-to/image...

hosh · on Aug 1, 2022

Doesn’t the latest version of k8s let you use your own custom scheduler?

paulgb · on Aug 1, 2022

You can, but that falls into this bucket:

> the more we tried to optimize it the less of core Kubernetes we were using and so the less value we were getting for the complexity tax we were paying

Since we were headed down that path, we took a step back and asked what we were really getting out of Kubernetes, and most of it was things that were orthogonal to our intended use case. The way Kubernetes is architected around control loops works great for its intended use case, but we wanted a more event-driven system.

hosh · on Aug 2, 2022

Event driven ... like a streaming data pipeline? Given your comment about Jupyter notebooks, that makes sense. It might be the Mesos project is better architected for your use-case. Then again, I think Mesos ported some of their schedulers to Kubernetes.

tecleandor · on Aug 1, 2022

If you're pulling big images you could try kube-fledged (it's the simplest option, a CRD that works like a pre-puller for your images), or if you have a big cluster you can try a p2p distributor, like kraken or dragonfly2.

Also there's that project called Nydus that allows starting up big containers way faster. IIRC, starts the container before pulling the whole image, and begins to pull data as needed from the registry.

https://github.com/senthilrch/kube-fledged

https://github.com/dragonflyoss/Dragonfly2

https://github.com/uber/kraken

https://nydus.dev/

badLiveware · on Aug 1, 2022

Lazy pulling is already supported by a lot of container runtimes, most notably containerd with estargz

https://github.com/containerd/stargz-snapshotter/blob/main/d...

tecleandor · on Aug 1, 2022

Ah thanks! "Lazy pulling" is what I was looking for. I was trying to find estargz (didn't remember the name) and I couldn't find a proper keyword to do it :P :D

cbanek · on Aug 1, 2022

Yeah I think we considered this, but we want the container to actually run as the user and have all the permissions set up so they can have all the right access on the cluster (kind of like a PaaS), although I think we are doing some of the stuff with the starting the container while the data is still streaming down. Black magic.

baryphonic · on Aug 1, 2022

Wow, this is excellent! At a previous job, we had been using k8s + knative to spin up containers on demand, and likewise were unhappy with the delays. Spawner seems excellent.

One question: have you had to do any custom container builds on demand, and if so, have you had to deal with large "kitchen sink" containers (e.g. a Python base image with a few larger packages installed from PyPI, plus some system packages like Postgres client)? We would run up against extremely long build image times using tools like kaniko, and caching would typically have only a limited benefit.

I was experimenting using Nix to maybe solve some of these problems, but never got far enough to run a speed test, and then left the job before finishing. But it seems to me some sort of algorithm like Nixery uses (https://nixery.dev) to generate cacheable layers with completely repeatable builds and nothing extraneous would help.

Maybe that's not a problem you had to solve, but if it is, I'd love your thoughts.

ehutch79 · on Aug 1, 2022

It's always been my understanding that with things like k8s and other orchestration stuff, you're supposed to spin up before you need the capacity? You set a threshold, like 75% capacity, and if you're over that for a bit, you spin up a new container(s) to get you back to under effectively 75% capacity.

Is that not how this works?

paulgb · on Aug 1, 2022

Yes, that's the scaling model that works best for Kubernetes if the use case supports it. Our use case precludes it, because we are focused on uses where containers need to be spun up on a per-user (or per-group-of-users) basis as they use an application.

swid · on Aug 1, 2022

Precludes feels like the word word here. Nothing prevents you from satisfying your use case and spinning up vms before they are needed.

I wrote the student vm system for udacity, and I spun up student vms before they needed them, with some last mile loading to finalize the files they need. The student VMs were not using k8s, although a small piece of the infrastructure did.

I worried most about untrusted users working in a complex environment with the ability to harm the experience of other users, and just used GCE.

For me, boot time was < 5 minutes, so if you can predict the next five minutes of demand, you can boot those machines early. If you are wrong you will pay extra or someone will wait extra time, but still less than the full boot time. Generally it takes less than 10 seconds to access a vm with your coursework on in.

acoard · on Aug 2, 2022

Keep a pool of eg 5 pods that are warm and good to go. When a user needs one, the pod pulls in necessary config data which should be quick as it's already initialized.

blincoln · on Aug 1, 2022

Can you not have a queue of whatever type of containers the users are likely to be using already ready to go, like the GP suggests?

samsquire · on Aug 1, 2022

This is really awesome. Thank you for sharing this.

One of my ideas lately has been to upgrade FaaS to a full on server after a set amount of traffic. Or said differently, a dedicated server spin up that serves the same app as callable functions ala scalable RPC and upgrade to a dedicated instance composed of said functions. The best of both worlds.

Combine the scale to zero of Serverless combined with the scalability and capacity of a dedicated server.

no_circuit · on Aug 1, 2022

Kind of curious what made it too slow for your use case? I'm guessing you did not users to wait for something like kube-dns to update or the workload scheduler? Of course things like spinning up a Pod can be slow. Or non-Kubernetes things like doing DNS ACME challenges could affect things.

But on other hand, I can't quite figure out why something would prevent, you, yourself, from running the service that hosts the VMs that hosts the containers on demand on Kubernetes.

paulgb · on Aug 1, 2022

Our goal is sub-second container starts (admittedly, we're not there yet), and with Kubernetes we'd have to create Pods, create Services, wait for the scheduler to update, etc. We didn't go down the rabbit hole of profiling where the slowness was, but it was clear that Kubernetes just wasn't built with the type of speed we wanted. We realized we'd have to contend with a lot of design decisions that were the right choice for the things Kubernetes optimizes for (replication, resiliency), but not the right choice for us (fast launches of ephemeral containers).

> But on other hand, I can't quite figure out why something would prevent, you, yourself, from running the service that hosts the VMs that hosts the containers on demand on Kubernetes.

I'm not sure I understand this part, I guess we could use Kubernetes operators to scale up the underlying compute resources and manage the containers ourselves? This adds a lot of complexity for our use case.

wvh · on Aug 1, 2022

I just wrote a controller that does pretty much that – spawn containers on demand and report back status changes. While this solution does require some knowledge, it so far has been perfectly reliable and reasonably fast. I can fathom the need for processes to spawn and tear down faster in specific use cases than the Kubernetes scheduler would allow for, but for us a few seconds of wait time has been perfectly reasonable.

osigurdson · on Aug 1, 2022

Is there a fundamental reason why Kubernetes cannot start pods and services fast (outside of pulling images of course!)?

unity1001 · on Aug 1, 2022

There isnt. K8 does start pods and services fast. Im able to launch an entire stateful WordPress pod (multi-container) in just ~10 seconds. Including the provisioning and attaching times of PVs from scratch. This is at Digitalocean. You can easily run stateful things like WP if you build your pods well and use PVCs - even without needing to make them stateful sets. It ends up being a neatly constructed, integral VM living on virtualization. Everything is taken care by K8.

When using K8, if you use the most basic K8 features and concepts, things generally work out pretty ok.

osigurdson · on Aug 1, 2022

10s isn’t terribly long but, honestly I think 1s should be achievable and would open up more use cases for K8s in general (cloud provider physical machine latency not withstanding of course).

GauntletWizard · on Aug 2, 2022

1s is very achievable iff you have spare capacity and images already pulled. There's good reasons the latter is hard to achieved, and the former is a tradeoff few are willing to make as it increases costs.

davewritescode · on Aug 1, 2022

No, in fact we've gone running towards it after some initial success, especially when combined with ArgoCD for CD and Istio as a service mesh. My company has a lot of experience with running applications on VMs and Amazon's ECS. Our VM automation ultimately became expensive to maintain and ECS had its own set of issues I could probably fill up a blog post with.

From the Operations side, Kubernetes is scary. It's easy to screw things up and you can definitely run into problems. I understand why folks who work mostly on that side of the house are put off by the complexity of Kubernetes.

However, from the application side of things, our developers have been THRILLED with Kubernetes. For most developers my company provides a nice paved road experience with minimal customization required. For advanced use cases, we allow developers to use the Kubernetes API (along ArgoCD + GateKeeper policies) as a break glass type of approach. Istio gives the infra team the ability to easily move services between clusters and make policy changes easily. It also allows us to make use of Knative, although I think the Istio requirement is no longer there.

That said, you should be using managed Kubernetes wherever possible and not running your own clusters. That's where trouble lurks.

clutchdude · on Aug 1, 2022

ArgoCD was our missing lynch pin for getting workloads migrated over and supported.

It makes it that much easier to actually use the cluster rather than mess with endless configuration tooling. Is it the best engineered tool? Probably not. But it's the one that works best for us.

therealdrag0 · on Aug 1, 2022

Same story for us. We’ve been moving towards k8s and it’s been great for app devs. We ran in plain VMs for a decade and it was a good time to switch at 2k employees, maybe 500 devs?

nazka · on Aug 1, 2022

I’m curious, do you use Vault, Datadog, or some Falco maybe? What is the rest of your Infra stack?

benfrancom · on Aug 1, 2022

I migrated a company from k8s to ECS/Fargate in 2019. Kubernetes is very flexible, but I opted for simplicity.

The result of the migration was that there is little underlying infrastructure to maintain, and ongoing operational costs were lowered by 50% year over year. The CTO and I liked the setup so much, we started converting another large client of theirs. I followed up with them at the beginning of 2022 to see how things were going, and they still love it. There is so little maintenance, and now they have more time to focus on what they do best–Software!

Other options on the horizon that I'm testing include utilizing AWS Copilot with ECS/Fargate, and/or Copilot with Amazon App Runner.

mr337 · on Aug 1, 2022

I have settled on the ECS camp as well. Took a run at Kubernetes and was blown away by the complexity. With ECS/Fargate I don't spend any time on it. It just works for our setup.

I still wonder from time to time if I am missing something not going Kubernetes.

adra · on Aug 1, 2022

Are you big enough to need terraform? If the answer is yes, you may have a good justification to move to kubernetes migrate tf->k8s with lots of benefits for the app teams (if they care). If you're just yolo setup your cloud in AWS web console and you're fine with that, then you may not see much lift. A good reason to use declarative (often infrastructure as code) approach to deployments is that it improves bus factor and the ability to hire people who can pick up and maintain the infrastructure.

bernf · on Aug 1, 2022

AWS CDK exists and IMO is way better than terraform if you're on AWS. So much so that terraform is making their own variant to be more CDK like.

mr337 · on Aug 1, 2022

I didn't know they were trying to be like CDK. Now I have to look this up :)

glenngillen · on Aug 2, 2022

The CDK for Terraform went GA today (https://www.terraform.io/cdktf and https://www.hashicorp.com/blog/cdk-for-terraform-now-general...). It's a framework that extends the capabilities of CDK so that you can use the whole Terraform ecosystem of providers and modules.

Under the hood it means that the `cdktf synth` command ultimately generates Terraform configuration that can be executed like any other Terraform config. It's definitely not a case of Terraform trying to be like CDK. Each has it's strengths, choose whichever makes the most sense for your workflow.

mr337 · on Aug 1, 2022

We are big users of terraform. I couldn't imaging running our setup without it or some other tooling like CDK.

nazka · on Aug 1, 2022

What about Pulumi? I love it

rootforce · on Aug 1, 2022

I use AWS Copilot and find it to be really easy to use and helpful. It is still a pretty young project and as such doesn't really handle all the edge cases, but for the things it supports, it makes using ECS even easier than it already is.

cies · on Aug 1, 2022

Chose Fargate over K8 too. I made the call, so no need for migrations :)

oneplane · on Aug 1, 2022

We have had a few teams try, but as soon as you go beyond "I want to run some code for a bit", nobody really has anything for you. Instead of trying to re-invent the wheel (service discovery, mutual TLS, cross-provider capabilities) successfully, it went downhill quite fast and they moved back. (this was mostly due to cost as other services can get expensive really quickly, and because of the lack of broadly available knowledge for the custom stuff they had to build)

If a team were to start with no legacy and no complexity and there isn't going to be multi-team/multi-owner/shared-services I could see them using something else. But that applies to anything.

julienchastang · on Aug 1, 2022

I've been a K8s user for some time, but it does drive me bat shit crazy. My main beef with it is I often cannot discern the logic of how things work. For the developer platforms and systems I enjoy working with, you are presented with primitive axioms that you can then bootstrap your knowledge upon to derive more complex ideas (e.g., any decent programming language, or OS). K8s does not work that way -- at least as far as I can tell. A priori knowledge gains you nothing. When I run into a problem on K8s, I copy/paste the error into a search engine and I am presented with a 200 message long GitHub issue with users presenting their various solutions (how does this command relate to my original problem, who knows?), some work, but most of the time, they don't and you are left in a bigger hole than when you started. I end up tearing the whole things down and starting over, most of the time. That last comment is the biggest "code smell" for me with K8s. When it is easier just to nuke the thing and begin again, there is a problem.

p_l · on Aug 1, 2022

I'll put blame on bad documentation and tutorials becoming the norm for k8s versus what was common early on, because k8s is very much about building more complex ideas from primitive axioms. The whole resource model is built around simple ideas being used to build more complex ideas.

Wish there was some better docs out there, not sure if I could handle writing one from scratch :/

lumost · on Aug 1, 2022

I've never gotten too deep with K8s. It always came across as incredibly complex to maintain with limited managed service support. Whenever I spoke to engineers pushing it, the problems it solved didn't resonate with me as someone whose spent the last 10 years running hundreds of services across thousands of servers.

These days I'm a huge fan of CDK and Pipelines style deployments. I prefer to treat my compute layer as a swappable component which I'll change as and when I need to. I tend to lean towards serverless offerings which take care of the internal scaling details if I can while still giving me a traditional "instance", and if I can't then I'll go for the next best managed offering.

I've yet to see an example where internal tooling doesn't become a mess over time, and K8S requires a ton of work to keep things sensible.

nailer · on Aug 1, 2022

Yep CDK and/or Pulumi. It’s very easy to map your own custom concepts and logic to your cloud provider, rather than making a cloud provider on top of the cloud provider you already pay for.

kmac_ · on Aug 1, 2022

I've moved to a company that doesn't use Kubernetes at the moment (and that's a 100% calculated and rational decision). What I see, is that a lot of effort is put to provide functionalities that Kubernetes brings. In case of running a bunch of services, when you wish to do that in a stable and secure way, Kubernetes cuts down running costs. It covers so much cross cutting concerns that reimplementation of those capabilities is not possible unless you have heavy $$$ to spend.

spmurrayzzz · on Aug 1, 2022

I think you're right to point out how much ground k8s covers and to replace every vertical that it integrates could be challenging/costly. But k8s is not a zero-cost abstraction, so I think the calculus here is often more nuanced.

In the case of my org, we optimized for the features we thought were valuable and amortized that effort over time. Notably this was early in k8s history (2014/2015), but the fruits of those efforts have aged well so far (8 years or so). Small code footprint to cover service discovery, cert provisioning, release orchestration, and configuration management. The whole devops stack is less than 3k SLOC. Service ecosystem is ~150 distributed systems, roughly about 5 million SLOC, running on just over 1k servers on AWS.

I think if the aim is not to completely replace what k8s does, but to cherry pick the features that give you some pareto distribution of value, sometimes its worth it to build in-house. Nothing wrong of course with going with k8s for many orgs, but in our case we didn't have to reinvent the whole wheel to live without it.

kmac_ · on Aug 1, 2022

3k SLOC of devops code to cover a system of that scale is super impressive. And I agree, there's no reason to invest in k8s when only a small fraction of its capabilities is necessary (or we have a team that's already experienced). Otherwise we may end up bending our requirements to k8s abstractions (even though they are well designed).

roflyear · on Aug 1, 2022

At least for most hosted solutions, Kubernetes seems to be "cheap" (compared to other offerings at that provider) after you pass some reasonable threshold: something like 6-8 services each running 3-4 instances or so. This threshold seems to roughly end up being $500/m.

rcoder · on Aug 1, 2022

When I hear numbers like this I wonder what percentage of the compute and memory resources of that 18-32 node cluster — not to mention the engineering that went into making it work — goes into the “hard” problems of horizontal scaling, cramming stateful services into an architecture designed for stateless ones, etc.

You can actually get a couple of pretty beefy bare metal boxes for that budget. Or a couple of more modest ones for app servers plus a nice big RDS instance with all the trimmings. Based on past experience, that’ll get you to a few hundred rps for even a fairly complicated, poorly-tuned Rails or PHP app; your well-factored Go API server should handle 10x that pretty easily.

You might have to write some Bash or systemd unit files instead of a bunch of YAML, which may or may not bug you. I find shell easier to understand and debug than YAML-based scripting but YMMV.

roflyear · on Aug 1, 2022

Right, I don't think you can beat buying the machines, of course.

Nextgrid · on Aug 1, 2022

You don't have to buy - Hetzner, OVH, etc will happily rent you these machines dirt-cheap and that includes hardware maintenance & replacement.

roflyear · on Aug 1, 2022

Sure, that sounds some problems. Try to get your average CTO on board, though!

unity1001 · on Aug 1, 2022

OVH may be problematic if you don't speak French (their English support sucks), but Hetzner is pretty famous and well regarded. Their network is great. They provide a lot of automation. The provide little tooling compared to AWS, but what is there works and it works great. Its an engineering-minded provider. Also, its VM pricing is the lowest. What's best is their egress pricing cannot be matched by anyone else in the US or Europe.

malteg · on Aug 1, 2022

contabo is interesting too…

unity1001 · on Aug 1, 2022

First time i hear contabo.

mountainriver · on Aug 1, 2022

Same, we are on ecs and there is a lot of reinventing the wheel

tootie · on Aug 1, 2022

Genuinely curious what things k8s solves that you are reinventing. I run ECS and find that using microservices and their managed offerings (ie RDS, SQS) we don't any complicated topography to do complex work.

mountainriver · on Aug 1, 2022

1. Creating confined development environments containing multiple services. With k8s we can spin up a cluster locally, install all the dependencies and develop on it.

2. Remote development. With k8s we can develop right out of the cluster, ECS has no comparative.

3. Installing OSS software. K8s has loads of supported packages for OSS tooling.

Bayart · on Aug 1, 2022

Nope, I like k8s. What I don't like is people trying to be overly smart with it and leaving a configuration hell of templates, weird network configurations and broken certs behind them. For my personal workloads it's all basic containers with a reverse proxy, though.

llama052 · on Aug 1, 2022

Hell no,

I remember managing hundreds of virtual machines in datacenters & cloud, using Ansible and a myriad of other tooling.

It's nice when you're at a small scale and you don't have a lot of people making changes, but over time as it grows the pain grows with it unless you've enforced a consistent cattle model.

The longer VMs live with custom changes/code and updates over time the more brittle they can become. Part of the cattle model is so that you can recreate/rebuild when changing code so things stay consistent. The drift from infrastructure as code can be scary otherwise.

With the cattle model you need to have pipelines in place to build new VM images for infrastructure updates (packer etc), have multiple APIs to hit (easier in cloud) to upload images and serve them in a non damaging way. (HA deployments/rollouts/dealing with load balancers) It's certainly a non-trivial amount of work.

With Kubernetes, a lot of this tooling comes out of the box. You've got autoscaling, load balancing, health-checks, limits/requests, failure mitigation, service mesh options. On top of that it's served in a strict semi-consistent way. Good luck replicating that with virtual machines without a lot of tooling and effort.

If you can learn the Kubernetes tooling it can do a lot for you. However I agree that not all setups need it, a lot of times small setups never grow and that's ok a few virtual machines aren't that big of a deal.

We still use virtual machines for workloads that aren't container friendly, and to be honest these days I abhor it, even with pipelines in place.

Already__Taken · on Aug 1, 2022

> The longer VMs live with custom changes/code and updates over time the more brittle they can become.

Honestly kubernetes is not harder than dealing with this. It's keeping you back in the land of default google-able problems longer as weird tweaks and unique configs aren't piling up to make esoteric issues.

zomglings · on Aug 1, 2022

Not only have we left Kubernetes, we left Docker.

Replaced with Linux servers and SSH.

Have done a lot of work with k8s in the past. Not the right tool for my startup.

cpach · on Aug 1, 2022

Interesting! Feel free to elaborate. What does your CI/CD/deployment pipeline look like? Do you use something like Ansible, Puppet, Chef, Salt etc?

zomglings · on Aug 4, 2022

CI/CD pipeline is GitHub Actions which executes over SSH commands on our servers that execute deployment scripts: https://github.com/bugout-dev/spire/blob/main/deploy/deploy....

We use systemd to manage services.

We use Ansible to set up servers.

Our infrastructure spans AWS, Google Cloud, and servers in a datacenter.

louwrentius · on Aug 1, 2022

Why did you come to this conclusion and how are Linux servers a better fit?

zomglings · on Aug 4, 2022

Came to this conclusion for teams of our current size based on years of experience and experimentation (never at the expense of the business).

They are a better fit because they are much easier to manage and it's much easier to debug issues when something goes wrong. We are a small team and we hope to stay that way. But our operational responsibilities are growing significantly. The extra cognitive overhead of working with technologies like kubernetes would prevent us from scaling up our effort the way that we want to.

It's hard to answer your question in detail outside of a very long essay.

sph · on Aug 2, 2022

Kubernetes is overkill, but one will have to pry containers from my cold, dead hands. I will not deal with installing dependencies from the OS package manager and editing /etc files any more.

Linux namespaces are a brilliant idea.

zomglings · on Aug 4, 2022

Yeah I am not hardline anti-Docker. And anti-k8s only for small teams/companies.

With larger teams, I have written and maintained custom k8s operators in production. It was a great fit for the problems we had at that scale of developers.

A terrible fit for the problems my current team has.

k8sToGo · on Aug 1, 2022

Are you using at least Nomad or something?

zomglings · on Aug 4, 2022

Nope, literally SSH and bash scripts. We are fully open source (except for our security/operational code): https://github.com/bugout-dev

dijit · on Aug 1, 2022

Went to nomad, which is working better for my workloads.

There's still use-cases where k8s wins; but nomad handles state a bit better and is easier to reason about from scratch.

bluehatbrit · on Aug 1, 2022

I really like the look of nomad and want to give it a go. The two things holding me back are:

1) I don't really want to manage the installation but there aren't any(?) cloud hosts for nomad that I can see. 2) It doesn't seem as widely used so community support seems thin. There aren't many blog posts about good patterns with it etc, and I'd worry that we'd get stuck and end up reverting back to k8s.

rahen · on Aug 1, 2022

There is no installation needed with Nomad, it's a standalone binary. Just fire it in a small Debian (or Alma) instance on EC2 or GCE and you're done. That should solve point 1.

Point 2 is debatable. Lots of people nowadays put Kubernetes on their resume but that doesn't mean they are great architects or technicians, yet a good part of running production on Kubernetes is doing it right.

You'll see much fewer people with Nomad on their resume, but on the other hand you know they're not here for the buzz, they're usually more experienced and know what they're talking about.

dradtke · on Aug 1, 2022

HCP might be what you want, but it doesn't support Nomad yet, and unfortunately it's not clear when it will. https://discuss.hashicorp.com/t/status-of-hcp-nomad/33374

riadsila · on Aug 1, 2022

Koyeb also moved off Kubernetes and went with Nomad. We started with Kubernetes, thinking it was the right abstraction layer for us to build our platform, but then quickly ran into major limitations. The big ones: as others have mentioned in this thread, its complexity; security (we wanted to explore using Firecracker on Kubernetes, but it was very experimental at that time); we were not interested in keeping up with its release cycles; global and multi-zone deployments was not as straightforward as we needed; and the overhead (10-25% of RAM) was a cost we were not willing to take (we are around 100MB with our new architecture).

We wrote about our decision to switch here: https://www.koyeb.com/blog/the-koyeb-serverless-engine-from-...

AtNightWeCode · on Aug 1, 2022

Nomad replaces parts of K8. It is not a drop-in replacement. If one only want the container orchestration that is fine but then you need Consul for service discovery and so on.

wlonkly · on Aug 1, 2022

I've described this as "Nomad is a container orchestrator, while Kubernetes has a container orchestrator".

Nomad, Consul and Vault interoperate extremely well and are mostly pleasant to use, but I found myself missing the rest of the ecosystem pretty quickly, especially around ingresses, and I think they made the wrong decision on the networking model compared to Kubernetes.

That said, I haven't played with Consul Connect, the Consul+Envoy service mesh, yet. That might address a lot of the problems. But fundamentally I can't help but think that Nomad and Kubernetes both made a run of it and Kubernetes came out the winner of mindshare and ecosystem.

chucky_z · on Aug 2, 2022

Traefik and Nomad play very nice together if ingress is your main concern.

chucky_z · on Aug 2, 2022

This is not true anymore. They released service discovery a little bit ago.

AtNightWeCode · on Aug 2, 2022

Thank you, I did not know that. It seems a bit limited in comparison to Consul but it would probably work in many cases.

chucky_z · on Aug 2, 2022

They're also adding in a basic secrets k/v store in the next version. Their intention (I believe) is to target small use-cases and IOT use-cases; while allowing folks to scale to gigantic levels when mixing in Consul/Vault.

bogomipz · on Aug 2, 2022

>"There's still use-cases where k8s wins; but nomad handles state a bit better and is easier to reason about from scratch."

Can you elaborate on how Nomad handles state differently than K8S and what makes it better?

superice · on Aug 1, 2022

Yes! My startup of 5 people did. We started out with a managed Kubernetes cluster on DigitalOcean, but there were a number of reasons that caused us to not be very comfortable with that setup.

   - Taking random .yml configs from The InternetTM to install an Nginx Ingress with automatic LetsEncrypt certs felt not-exactly-great. It's no better than piping curl to bash, except the potential impact is not that your computer is dead, but the entirety of prod goes down.
   - Because of this, upgrades of Kubernetes are a pain. The DigitalOcean admin panel will complain about problems in 'our' configs, that aren't actually OUR configs. We don't know how to fix that, or if ignoring the warnings and upgrading will break our production apps.
   - Upgrades of Kubernetes itself aren't actually zero downtime, and we couldn't figure out how to do that (even after investing a significant amount of research time).  
   - We were using only a tiny subset of the functionality in Kubernetes. Specifically we wanted high-availability application servers (2+ pods in parallel) with zero-downtime deployments, connecting to a DO managed PostgreSQL instance, with a webserver that does SSL-termination in front of it.  
   - Setting up deployments from a GitLab CI/CD pipeline was pretty hard, and it turned out the functionality for managing a Kubernetes cluster from GitLab was not really done with our use case in mind (I think?).  
   - It would be bad enough if DigitalOcean shit the bed, but the biggest problem was that we couldn't reliably recognize if something was a problem caused by us, or by DO. Try explaining that one to your customers.

Summarizing: it was just too complex and fragile, even once you wrap your head around what the hell a Pod, a Deployment, an Ingress and Ingress Controller, and all of the other Kubernetes lingo actually means. I suspect you need a dedicated infra person who knows their stuff to make this work, so it could very well make sense for larger companies, but for our situation it was overkill.

We were not intellectually in control of this setup, and I do not feel comfortable running production workloads (systems used by 20k high-school students, mission-critical applications used by logistical companies) on something we couldn't quite grasp.

We went to a much simpler setup on Fly.io, and have been happy since. It's a shame they seem to be too young of a company to really be super reliable, but I suspect this is only a matter of time. In terms of feature set, it's all we need.

dbingham · on Aug 1, 2022

For context, I ran a DevOps team for the last 4 years that managed two products on AWS - one on EKS and one on ECS. I also just finished building out more or less that exact infrastructure on DO.

I can pretty confidently say, that's not K8s, that's Digital Ocean. On AWS, we ran the EKS infrastructure (which was not simple) with basically half a dev's time for years. It was only when it started to scale to millions of users that we needed to build a team to support it. It was still a much smaller team than the one that supported the ECS product (two devops).

I was mostly managing and not coding by the time Kubernetes was in our stack, so while I'm very familiar with infrastructure in general (and I know ECS inside and out unfortunately), I hadn't used Kubernetes directly much before I build this DO infrastructure. But I got it up in a week and though DO is a nightmare, k8s is an absolute joy as a DevOps. Holy shit it's perfect. It does exactly what it needs to, with exactly the right abstractions, with perfectly reasonable defaults.

The reality is that infrastructure work is just that complicated.

You wouldn't try to have a team of front end engineers build your rest backend. It's not reasonable to expect javascript engineers to know how to build and operate an infrastructure - at least not with out dedicating themselves to learning the tooling and space full time for a while. Think of it from the perspective of a frontend engineer learning Python and Django to build out a rest backend, and then multiply the complexity by 4. That's just infrastructure regardless of what you're using.

That said, if something like Fly.io can fit your needs, that's great! I haven't used them so I can't speak to them directly, but I know that with Heroku, the trade off was cost and, eventually, being limited in what you could build. Eventually you would need to build something that just couldn't be built with Heroku. A quick glance at Fly, the pricing looks reasonable, but I'm guessing the build limits will still apply.

superice · on Aug 1, 2022

That's fair enough. We took a look at 'native' AWS, but there are a multitude of reasons why just dealing with AWS at all is a huge upfront time investment too if you don't hire somebody already skilled at this (complicated billing, just figuring out the product names for their various services, to name a few).

> The reality is that infrastructure work is just that complicated.

Yes, if you need the flexibility of running anything in any setup. What we really wanted was 'yeet a docker image with a web server in it + env vars at some magic beast that'll run it for me, slap an SSL-cert on it, and make sure it's always online'. We tried to replicate this with Kubernetes, so we got the full complexity of k8s unloaded upon us.

Heroku was what we really wanted, but it was always too expensive. Fly.io strikes a good balance here, the defaults are sane, it's still flexible enough for other services, and it's relatively cheap (spend is similar to DO K8s).

> You wouldn't try to have a team of front end engineers build your rest backend.

Well, yes and no. I wouldn't expect frontend engineers to know the ins and outs of everything backend, but to build on your metaphore a bit further: Setting up a basic Node backend with express serving static files shouldn't take multiple weeks, even for a frontend engineer. I feel like I was trying to do the infra equivalent of that, and it did take me forever.

> A quick glance at Fly, the pricing looks reasonable, but I'm guessing the build limits will still apply

The build limits could be an issue but really isn't for us right now. It's fairly easy to build locally though (in our case: in our GitLab CI/CD runners)

dbingham · on Aug 1, 2022

> Well, yes and no. I wouldn't expect frontend engineers to know the ins and outs of everything backend, but to build on your metaphore a bit further: Setting up a basic Node backend with express serving static files shouldn't take multiple weeks, even for a frontend engineer. I feel like I was trying to do the infra equivalent of that, and it did take me forever.

Yeah, that's just what infrastructure work is. Like I said, take that analogy, multiply the complexity by 4 (at least... really maybe multiply it by an order of magnitude).

Let me put it in perspective. I've been coding since I was 12, I taught myself C to build a MUD in middle school and high school. I had about a decade of full stack professional experience in Java, PHP, javascript and I'd done infrastructure work with EC2 and chef before. When I moved into DevOps it was overwhelming.

I've been in DevOps for 4 years. I built that equivalent DO infrastructure with Kubernetes just last week (and in a week). I started the week going "Fuck, I don't know what I'm doing." The first 3 days were just spent reading documentation. Day four was spent writing the terraform and kubernetes manifests - with a distinct feeling that none of this was going to work because I was missing several key pieces. Day 5 was spent putting a few of those pieces in place and debugging. I finally got it working late Friday night. I took on a ton of tech debt and made a bunch of compromises just to get something working. I'm not the least bit happy with what I have working and intend to totally rebuild it on AWS when it comes time to build production.

And that's with 4 solid years of doing infrastructure work full time under my belt. For someone with no infrastructure experience? I would estimate 1 - 3 months. There's just way too much to learn to think you could do it quickly and simply.

With an express backend, if you have javascript experience, you really don't have much to learn. You need to learn how http interacts with the backend, how the backend interacts with the database, and databases (SQL). That's it. Learning database is not nothing, there's a lot that comes with it, but that's still only 2 new tools really.

With infrastructure, you need to learn networking, databases, securty, container orchestration (how does high availability work? Scaling?), bash, linux, provisioning, terraform, Docker, Kubernetes manifests, monitoring, secrets handling, and more. And for a lot of these things, the solutions are far from simple or perfect. Even when done as well as can be with modern tech it feels shakey and cobbled together at the end. You're tying a dozen different tool types together to solve a dozen different problems and you have dozens of choices for each tool type.

Like I said, infrastructure is just like that. And it's important to have the right expectations going in to it.

If you can't tell, I've had this conversation with my peers who stayed in full stack a lot.

j16sdiz · on Aug 1, 2022

Sounds like you have never understand what you have deployed. Kubernete is complex, you need somebody know how in your team.

Meanwhile, going fly.io sounds sensible to me.

superice · on Aug 1, 2022

Yep! And we had to make a decision whether we would focus on our core business of developing great applications for end-users, or spend more time running infra and try to wrap our heads around the mountain of complexity that is k8s. That choice at our size is a no-brainer, although that trade-off might be very different for larger teams.

dimitar · on Aug 1, 2022

Well all of those issues are fixable, but I think it is a totally valid reason not to use k8s if you don't have a dedicated infra person/team.

superice · on Aug 1, 2022

Yeah, definitely! This is why I am not that harsh of Kubernetes as a tool at all, I'm just saying that it's not suitable for us for these reasons. In our context of <4 FTE of dev power it just isn't worth the manpower we have to throw at it to make it work, I'd much rather invest that time into moving our core business forward. I might see ourselves moving back to it in the future, but in the meantime we really just need a Heroku / Fly.io / DO apps / AWS ELB or so.

At my previous employer (~50 FTE of devs, 2-ish FTE dedicated to infra) Kubernetes worked perfectly fine, and I think in that context it made a lot more sense.

tomphoolery · on Aug 1, 2022

Kinda? We use Cloud Run because for our workloads GKE was a lot more expensive. So far, it's been great. I wouldn't say I've "left" Kubernetes, since from what I understand Cloud Run implements the Knative standard, which is itself built on Kubernetes. But much like it was predicted early on, I think Kubernetes is best used as a means of building an infrastructure platform, not an infrastructure platform in and of itself. You certainly can cobble all this stuff together and build a nice system, but it takes a lot of work, and there's probably a hosting company out there which already does something similar enough that you can adopt.

With this approach to hosting and deployment, I think Kubernetes' main advantage is that it opens the door to new kinds of infrastructure businesses, not that it makes hosting a website any easier.

seabrookmx · on Aug 1, 2022

+1 for Cloud Run.

I've tried many of the serverless platforms and maybe it's the types of applications I work on, but I've found most of their limitations (short runtime, limited access to resources on your private network) basically make them useless. The more self-hosted types that don't have these limitations lose out on many of the benefits or are leaky abstractions on k8s.

Cloud Run has all the benefits I want: extremely easy deployment and scaling, as well as the ability to scale to zero if you need it (though generally you don't), while still being able to run basically whatever workload I want. My current employer is mostly a Python shop but we recently deployed a little .NET core service on Cloud Run and it's been awesome.

steren · on Aug 1, 2022

Note that Cloud Run is not built on Kubernetes, but on Borg. It implements the Knative Serving API spec, mainly for portability reason with Knative and Kubernertes.

Source: I'm the Cloud Run PM and we have commmunicated about that publicly in the past.

seabrookmx · on Aug 1, 2022

TIL!

Do you have any Google docs or blog posts that talk about this?

I always wondered why you need a Serverless VPC connector for "vanilla" Cloud Run (or you have to use Cloud Run on GKE) to access VPC resources, but I suppose this answers that question.

jdoss · on Aug 1, 2022

Yep! Well, kinda... I still use it at work but for any of my personal stuff at home or for my side projects I use Fedora CoreOS [1] with Butane YAML [2] which I template with Jinja2. Being able to define a VM with Butane and launch it quickly is pretty great. Nothing I am running requires the benefits that Kubernetes can bring to my workloads and the reduced complexity is a breath of fresh air.

I am slowly moving towards using Hashicorp's Nomad running on Fedora CoreOS using the Podman and QEMU drivers. I rolled out a Nomad at work for internal projects and it let's me get things done quickly without living in a total YAML hellscape.

1: https://docs.fedoraproject.org/en-US/fedora-coreos/getting-s...

2: https://coreos.github.io/butane/examples/

lkurusa · on Aug 1, 2022

We use Nomad from Hashicorp, it's super simple. Never liked the complexity K8s brings along.

dimitrios1 · on Aug 1, 2022

I never liked the cost Hashicorp products bring along.

OJFord · on Aug 1, 2022

What, zero? (GP didn't say anything about using Hashicorp-managed products, they're open source and beer-free to use. Another comment says Hashicorp's platform doesn't even offer Nomad (yet?) anyway.)

pentium166 · on Aug 1, 2022

If you need features offered by the self-managed Enterprise version of a Hashicorp product, I've heard the price tag is something like low six figures per product.

wlonkly · on Aug 1, 2022

Hashicorp is very inflexible about support plans -- either you go all in on their Enterprise product, or you're self supported. By the time you've licensed Nomad, Consul and Vault -- because they interact and you will find Nomad support ends where Consul support begins, and so on -- it is a LOT of money.

dimitrios1 · on Aug 1, 2022

Using terraform without TFE is something I would never recommend to any large org. Been there, done that.

lkurusa · on Aug 1, 2022

Hmm. Please could you explain further? I'm genuinely curious what costs you associate with Hashicorp products.

3np · on Aug 1, 2022

It depends on your size. For a fairly minimal close-to-best-practices you'll need for each DC, each on a separate physical host (I may be missing something):

  3 x Consul server
  3 x Nomad server
  2/3 x Vault server

It's long since I operated k8s but IIRC I think you can get similar capabilities and redundancy with 3-5 machines?

That's before you start looking at actual runner nodes, load balancers, proxies, logging and monitoring infra, etc...

Unless you cheat (which I think many do) or you're big enough, that overhead can be meaningful.

schmichael · on Aug 1, 2022

(Disclosure: Nomad team lead)

FWIW we recognized this was too much overhead for many users. Nomad 1.3 supports service discovery so you can start without Consul, and 1.4 will support secure variables to get folks farther along without requiring Vault.

So 3 Nomad servers should give you a pretty featureful and highly available cluster these days.

orthecreedence · on Aug 1, 2022

Yeah or, like, spin up three medium servers in different zones and have each server run all three services. We did that for a production setup for years and it worked fantastically. There's no need to have nomad/consul/vault all on different servers unless they are significantly underpowered or the workloads are crazy.

If best practices say otherwise, then maybe they should be reconsidered.

3np · on Aug 1, 2022

Sure, but at this point there's so much else we get from Consul that, like, what's the point...

I guess the path is set but I'd personally much prefer having a recognized deployment scenario be hosting Consul server and Nomad server on the same physical machines, and accommodating (be it through code or just docs) for making that play well with security, certs, and resource usage without becoming a confounding mess.

Even Vault, if the operator accepts and/or mitigates the sidechannel aspects - from a security perspective that still shouldn't be a step down from anything Nomad-specific?

Seeing as HC already provides solutions for all of these supposed to be serving for Nomad, doesn't it make more sense to make them play together smoother and nice on the same machine rather than reinventing a lesser wheel for each of them?

nicolas_t · on Aug 1, 2022

Entirely true, but I also think that neither k8 nor Nomad are that useful if you're not at a scale where the above is negligeable? It costs roughly 500 usd a month on aws for those 9 servers.

sph · on Aug 2, 2022

OT: I really do not like Hashicorp. Terraform has a terrible DSL, and terrible documentation. Also, I paid $70 for their VMWare Vagrant plugin ages ago, it was so buggy to be unusable, they were unresponsive on the Github repo, and they completely ignored my request for a refund under their own 30-day guarantee. Not very professional.

I really don't get why people love that company so much.

0xbadcafebee · on Aug 1, 2022

I would love to. But what I hate about K8s is how you can't not use it. It's like Jenkins. A total piece of shit, slow, buggy, insecure, maintenance headache, expensive to maintain, never works the way you want without a ton of work, lots of footguns, bad practice is the default. But try explaining to management how you don't want to use Jenkins and they'll just come back with "but it's free" and "everyone uses it" and "no vendor lock-in". They don't understand that they're asking you to become a Ferrari mechanic when you really need a Ford F-350 pick-up.

majodev · on Aug 1, 2022

No and we are happily using it within our overcommitted cluster (combination of shared and dedicated nodepools).

We are a small team of 5 infrastructure engineers and previously managed 200+ libvirt VMs running on bare-metal HA hypervisors in a GlusterFS storage pool (software agency, different customer application services). We started to migrate to GKE in 2017 and finished within a year or so.

I know many associate k8s with a yaml mess, but this is actually our most favourite part of it. We are able to describe a whole customer project in this format and it's not something we have to maintain in-house (Ansible). As long as you don't try to be smart (templating/helm, operator dependance), it works out pretty well, prefer plain manifests and extend that with you own validation scripts.

Nevertheless, if you have no 24/7 operations, stay the hell away from bare-metal - go managed.

andybak · on Aug 1, 2022

I'm particulary interested in a variant of this question.

My company has clients who usually have very simple requirements. A Python/Django app server and a database. Sometimes there will be another background service or two (memcached or equivalent etc).

The most complex site we had was the above but with some Postgres replication clients.

We use docker and docker-compose. We've used ansible in the past as well as fabric and other simple solutions.

We've had a couple of devs try and convince us that we should be using Kubernetes and I counter with "it's overkill for what we need". Am I wrong?

unity1001 · on Aug 1, 2022

Nope.

You can install all of those servers in different containers, and then combine them in the same pod. For all intents and purposes from the outside, it will be a singular VM. But from the inside, you will be able to separate all those servers/tasks to separate containers, running from inside the same virtual localhost machine. They can also use the same PVC, making running a stateful app much more easier. You dont even need to make it a stateful set.

You get a lot of benefits with this - you will be able to easily manage each different server in the containers inside the pod. Easily manage their resource constraints. Security. You can make the pod's containers not accept connection from anything that does not belong to the particular app that they belong to. K8 will manage resources in the cluster, its autoscaling up and down, everything. All of the stuff that you had to maintain scripts or ansible to make happen in non k8 setups will be automated.

K8 is basically abstraction of the non-business stuff a lot of infra approaches were doing. Its containers inside VMs without you needing to manage VMs.

KronisLV · on Aug 1, 2022

> We've had a couple of devs try and convince us that we should be using Kubernetes and I counter with "it's overkill for what we need". Am I wrong?

You're not necessarily wrong, as long as Docker Compose isn't incompatible with what you're trying to do - e.g. if you'd need overlay networking across multiple nodes, or scheduling things across them in one go, then Docker Compose might not be the best fit and you might instead be better served by looking in the direction of Nomad or even Docker Swarm, though the future there is unclear - maintenance mode project, but very similar to Docker Compose and comes out of the box with any Docker install.

Either way, Kubernetes might indeed be overkill for simple setups, unless you're using just a subset of its functionality and are running lightweight clusters, like K3s or K0s. I guess some might be pushing it because it's basically become the industry standard, at least in some capacity, in some places, or maybe people just want to put it on their CVs.

p_l · on Aug 1, 2022

Well, "it depends". This is similar to my first production k8s deployment, where we used k8s to host a lot of PHP, some node.js, some other stuff, and due to legacy code it meant a lot of apache2 containers with mod_php.

The reason we went for that setup is that it helped us cut cloud/hw costs significantly (at the start we pretty much had two workers and that was because we ran everything with replicas=2) - each individual site had small requirements, and with k8s we could guarantee enough resources while binpacking as many of them per server as possible.

The actual deployment story can possibly get simpler than docker-compose, but I'd say the real question is whether you'd get a financial win out of it, as it seems you have a pretty good steady state going.

adra · on Aug 1, 2022

Back in time, k8s was a glorified docker swarm and swarm was largely compose spread over multiple servers, so if you deploy everything on a single computer and don't have requirements to care about redundancy/failover and all that, then k8s is almost certainly overkill.

AtNightWeCode · on Aug 1, 2022

I would first do calculation of what it would cost to host it on vanilla cloud services. They are often cheaper than what people think, if you include work hours needed.

andybak · on Aug 1, 2022

I'm not sure I understand the distinction you're making?

I would be hosting on vanilla cloud with or without Kubernetes.

TobbenTM · on Aug 1, 2022

Kind of?

For my new projects nowadays, I'm pushing mainly serverless approaches using AWS Lambdas (behind API Gateways for stuff that needs to be reachable by HTTP).

I think this shifts the complexity from managing Kubernetes and its accompanying ten-thousand-yaml-files to infrastructure-as-code and the complexities of dealing with AWS. And I happen to prefer the latter, even though it's not infinitely better by any margin.

For the few things that needs to be always-online, or 3rd party self-hosted apps, I'm still on Kubernetes, or pure Docker if possible.

roflyear · on Aug 1, 2022

How do you get over vendor lock-in, and the issue where it is often difficult to debug these solutions?

Also how is the cost of Lambda? I know for AWS the logic apps have a really high cost (but function apps seem to be reasonable)

TobbenTM · on Aug 3, 2022

Vendor lock-in is always omni-present, even if you are using Kubernetes. The only difference is that the vendor lock-in you get when you run Kubernetes is mainly with the infrastructure layer hosting the code, instead of the code entrypoints.

Not to mention any managed services you want to use, which also will lock you in. So I don't do anything extreme to avoid vendor lock-in, other than making my code general enough to only have a small surface area for the lambda entry point. As a practical example, all of my APIs hosted on Lambdas are ordinary ASP.Net apps that would work identically if hosted in Docker containers.

Pricing so far is one of the biggest benefits of doing serverless approaches. I'm down to paying a couple of dollars per month for something that I'd pay tenfold for if doing Kubernetes. Both the monetary sum of only paying for what you're using, and also not having to worry about cluster maintenance, scaling and management is a godsend.

nova22033 · on Aug 1, 2022

infrastructure-as-code and the complexities of dealing with AWS

So cloudformation YAML? or CDK?

mdm12 · on Aug 1, 2022

Not the OP, but I have had success with CDK. The main advantages for me have been discoverability with respect to resource properties, along with proper, higher-level abstractions pertaining to AWS infrastructure. https://aws.amazon.com/blogs/devops/leverage-l2-constructs-t...

stoobs · on Aug 1, 2022

Or terraform/terragrunt, Pulumi or one of the other options out there

TobbenTM · on Aug 3, 2022

Terraform only for me, have not experienced anything better.

besus · on Aug 1, 2022

Sort of.....

I went from on-metal K8s clusters, which were a complete PITA and required a full team to manage, to using EKS which has been everything K8s should be... easy peasy.

troelsSteegin · on Aug 1, 2022

EKS... https://aws.amazon.com/eks/getting-started/

3np · on Aug 1, 2022

Nomad + Consul(with Consul Connect) + Vault. With Terraform obv.

We don't really have a use-case for Boundary but it looks pretty neat as well if you do.

Was on k8s for years and I don't miss it one bit.

While there definitely is some complexity once you get serious and set everything up properly with raft, federation, Connect, CAs, proxies, ACLs, proper secrets lifecycles... I find it's worth it. With the current assumptions that HC will keep improving and existing bugs and edge-cases will be ironed out.

vincentdm · on Aug 1, 2022

We adopted it in 2017 and got rid of it in 2021. It introduced a lot of complexity, while still leaving a lot of issues up to us to figure out. E.g. deployment strategies.

Also: our main reason to adopt Kubernetes was to stay cloud-agnostic, but we soon realized that this is as unrealistic as writing a complex app's SQL in a vendor-independent way.

Instead, we decided to embrace our cloud (AWS) by using their CDK tooling and leveraging their features as much as possible. If we ever need to switch to another cloud we will bear the cost then, but for now it is clearly YAGNI.

alexswensen · on Aug 1, 2022

I am not sure so much that Kubernetes itself is an issue, as far as the technology. I'm personally a fan of serverless/lambda style functions, but my understanding is that many of those can run on Kubernetes under the hood.

Same goes for heroku/digital ocean app services. Even elastic beanstalk. If you are large enough that you need to manage your own k8s cluster, that is one thing, but I would encourage you to look at your needs from a usage and compute perspective long before you start solutionizing with trendy technologies.

varispeed · on Aug 1, 2022

I really wish Rancher didn't abandon Rancher 1.6 and moved to k8s. This was a perfect solution for a small business and bare metal.

I am trying to move on k3s but it is just too complex to run anything and there is still not solved problem of exposing services to internet.

What I want is to declare I want this service to be under this domain and this IP - so for that you still need to configure your load balancer (bare metal) manually, setup certificates etc. I am writing a tool to automate this, but it's been a pain.

toteno · on Aug 1, 2022

> What I want is to declare I want this service to be under this domain and this IP - so for that you still need to configure your load balancer (bare metal) manually, setup certificates etc. I am writing a tool to automate this, but it's been a pain.

After initial setup you can do it quite easily.

Exposing a service on selected domain is several lines in Ingress and adding certificates is several more. Example: https://cert-manager.io/docs/tutorials/acme/nginx-ingress/#s...

varispeed · on Aug 1, 2022

So this is not going to work for several reasons. One being that on bare metal you don't have a cloud provider, so there is no load balancer it can talk to. Second - it will setup a hostname and a certificate on the ingress, but there is no way to contact it from outside world. The domain still needs A record pointing at the server and in the cluster that may be a local IP or a set of IPs.

What I have in mind is an external server that is not being a part of the cluster that bears the role of load balancer. It will contact the cluster and look for services and then setup up a reverse proxy based on their declared hostname, then setup certificates and update DNS records at DNS provider.

As far as I know something like this does not exist.

Maybe Traefik has such a capability, but their documentation is so complex I have no idea.

toteno · on Aug 1, 2022

Actually I'm using it on bare metal and it works. Initial setup wasn't very hard but I think it could be more intuitive. Overall I think documentation for self-hosting kubernetes sometimes a bit incomplete.

Yes, I need to add A records with IPs for each domain, but that's one time setup. I did it manually, but you can automate it [1] (depends on what you use for DNS provider but you can extend it to support your provider or maybe there is another existing solution).

I'm not sure that one server in front of the cluster is more reliable than using all cluster nodes for load balancing. I guess that in automated solutions like [1] cluster's node could be automatically deleted from DNS if it went down.

My setup is not so big so I don't have real need for load balancing, but it seems possible with existing solutions.

[1] https://github.com/kubernetes-sigs/external-dns

badLiveware · on Aug 1, 2022

Sure it does, I ran kube-vip[1](but there are many others, e.g. metallb) as my cloud controller, all it needs are valid static IPs/range/dhcp and it will assign these to LoadBalancer services(which you usually only need one of for your ingress) and it will either ARP or use BGP to route external traffic.

As for DNS records, external-dns[2] works perfectly as long as your DNS as some way to doing automatic updates.

1. https://kube-vip.io/

2. https://github.com/kubernetes-sigs/external-dns

varispeed · on Aug 1, 2022

The problem with kube-vip is that it has poor documentation. I have read it many times and still don't know how I could use it. Last time I was running something assigning IP addresses to the dedicated server interface I got it null routed and provider threatened to terminate the service because it was interfering with other clients network. So if I see things like ARP, BGP, DHCP it is not clear what exactly it does on the network and how that would work in the real world. I am missing an example where I have a server with a static IP from which I want to access the exposed services that are on a private network. All I really want is an automatically configured reverse proxy that will direct traffic to appropriate services and take care of certificates and DNS.

Before the Kubernetes I used Rancher 1.6 and that was super simple. For instance I would start a wordpress container and then all I needed to do was to add a reverse proxy entry with its hostname as a backend and point where the certificates are (that was before lets encrypt).

Closest I could get was exposing a NodePort and having nginx to reverse proxy to the nodes at given port, but that seems more complex / fragile, as I need o keep track which service uses which port and it is still manual, so I might as well just use containers without Kubernetes.

badLiveware · on Aug 1, 2022

Another option is running something like haproxy ingress in external mode on dedicated vms

https://www.haproxy.com/documentation/kubernetes/latest/inst...

fasteddie31003 · on Aug 1, 2022

My opinion on Kubernetes is that it's great orchestration software that is trying to fix poor underlying application architecture issues. The biggest underlaying software application architecture issue today is the idea of a single responsibility worker. Why do applications have one worker only working on messages from one queue? This architecture looks great on a whiteboard, but has issues around spikes in traffic, uses tons of unused server resources, and requires lots of custom software plumbing. The solution is a generic worker that does work from any queue. It is such an obvious fix to lots of the scaling issue that large software applications face today. I'm personally only using Tempral.io, which is a generic worker orchestrator, from now on when making large distributed applications.

throwawaymaths · on Aug 1, 2022

I think this is due to "needing teams to be independent" which for some orgs is a real thing (Conway's law) but there are small teams trying to build microservices, which, IMO, is a bad idea.