Launch HN: Runops (YC W21) – A better cloud shell for production apps

0xbadcafebee · on March 8, 2021

What you've basically created is automated change control, but lacking some change control features. You might want to add a set of features specific to managing change control, because otherwise I'll have to build a change management system around this.

> You run an AWS CLI command in the terminal and it goes to Runops instead of AWS

Most enterprises are wary of handing over control to a vendor, especially if it's the underpinning of all operations in the company. I suggest a self-hosted/Enterprise release. After a few years of trying to make it work, the Enterprise will gladly pony up more money for a hosted cloud solution, but the self-hosted will get you in more doors.

> We do everything using Lisp. The CLI uses Clojurescript; the REST API uses Clojure.

Do you expect regular people to be able to contribute to or modify this? Do you find a lot of Lisp/Clojure devs out there for when you need to expand?

> Painless audit trails: No need complex for ETL to connect trails from Cloud Trail, Database Audit Logs, Kubernetes audit, etc.

You still have to audit those things. If a hacker gets in to your infrastructure, you have to know what they did.

andriosr · on March 8, 2021

I agree with many of these points, here are some thoughts on how we deal them:

> Most enterprises are wary of handing over control to a vendor

Great point, we do have the Enterprise version, which is self-hosted.

> Do you find a lot of Lisp/Clojure devs out there for when you need to expand?

We won't hire engineers based on the language they know, but instead in general engineering skills, and they can learn Clojure here (already worked for the first one:)

> You still have to audit those things

Yes we do, but mostly to trigger alerts if anything happens there and to show that the accesses are either from Runops or the applications during audits. This is way lighter than relying on these as the source of truth for trails.

I'm curious about the Change Management features you think are missing. We do have review workflows and other CM-related features I didn't add here, this demo shows some of it: https://see.runops.io/videos/demo

0xbadcafebee · on March 9, 2021

That's great, I didn't know about the Enterprise version. I'll look closer at this and see if it fits our use cases. (fyi, the more docs we can read [esp. operational docs, but also technical], the more likely we are to push for adopting this in our enterprise)

Here's some of the things a typical IT CM process handles: How do you properly surface and acknowledge the risk & impact of a change? What are the order of operations & how do you track them? How do you validate an operation? Roll back? Are there governance structures in place (X team can run Y task in Z environment)? With multiple stakeholders, how do you get approval from everyone, or handle change overrides? How do you run 'almost' the same thing in different environments? Do you use multiple communication/coordination methods, like e-mail, MS Teams, Slack w/multiple workspaces/orgs, Zoom, Jira, ServiceNow, etc? Is there a "change plan" which is drafted, edited, published, approved, and executed, in coordination with multiple teams?

All that may be overkill for your tool, but it's some of the stuff I would need to build around it to use it in my enterprise. Otherwise I (the ops guy) would need to build & run all those steps and coordinate with others when it's time for them to perform their steps (release, validation, debugging, rollback, etc).

andriosr · on March 9, 2021

This is great, thanks for sharing! We handle most of these in Runops itself, but you can also use your existing CM tool and leverage only the UX and automations from Runops. We have APIs and webhooks that enables you to extend Runops, one company is doing this and integrating Runops to ServiceNow.

jeremyis · on March 8, 2021

I haven't personally been on an infra team but I've seen Infra / Dev tools teams being overwhelmed with requests. This seems like a really helpful and elegant solution!

andriosr · on March 8, 2021

Curiously I started in the dev team and migrated to infra in an attempt to fix things :)

ystad · on March 8, 2021

The information on https://runops.io/ is light, does not have information on examples, workflow etc.

what is the setup like (is it cloud hosted or hosted by onself in a cloud). Is your code open source? how is authn/authz if I want to use this?

andriosr · on March 8, 2021

Yes, we have a lot of work to do on our landing page to better explain these points. It's early days, but we will get there! Here is some light on them:

It's cloud hosted, and we do support self-hosting for enterprises. The code is not open-source. We support Okta, Google, and other OAuth providers for Authentication. For Authorization we have the concept of Targets, which are abstractions of your cloud resources to users/developers. Say you have a Mysql database, you can create a read-only Target and let everyone use it, and create a second Target for the same database with write access. In the second Target you require reviews from tech leads, or let selected groups run queries.

ystad · on March 8, 2021

Thanks!.

how does your service compare to services such as teleport

https://github.com/gravitational/teleport

andriosr · on March 8, 2021

Teleport is a fantastic tool. The main difference are: 1) Runops doesn't require you to have tools (kubectl, psql, etc) installed locally and don't download temporary credentials to access resources, commands execute in the Cloud. 2) Runops has synchronous reviews workflows on the command/intent level, again as opposed to getting an open session for a period of time. 2) We automatically remove sensitive data from the results of every command. 4) Runops uses Git as the source of truth for the audit trails.

tyingq · on March 8, 2021

I'm curious if it's visually obvious that commands are running in a non-dev environment. Saving people from the scenario where they walk away for a coffee, return, then accidentally start typing into the wrong terminal window.

andriosr · on March 8, 2021

I've done that and can relate to the problem! It's common for Kubernetes, where you never know which cluster kubectl is pointing to. The Target (what we can where you are running things), is one of the options in the CLI. So you have to at least provide: the Target and the script to run a command. This way you always know where you are running things, it's something like this:

runops tasks create --target mysql-demo --script 'select * from dundermifflin.customers;

tyingq · on March 8, 2021

Ah, great, thanks. Some of the wording made it sound like perhaps it was hooked transparently. This appears very clear.

cpressland · on March 8, 2021

We’re an Azure/M365 house, but some of the things this tool explicitly solves were mentioned as areas of improvement for us during PCI assessment recently. I’ll be keeping an eye on this. Great work so far!

andriosr · on March 8, 2021

Glad to hear we could help in the future, feel free to reach out any time. We would love to hear more about your use cases and the alternatives you guys have in mind to improve the PCI assessment results. We support Azure :)

thisisxavier · on March 8, 2021

How did you acquire your first customers?

andriosr · on March 8, 2021

It was a combination of multiple things. The first customer came from the newsletter I run called SRE Teams (https://sreteams.substack.com). Others came from intros from my network and from reaching out to people I thought we could help. When I was running the DevOps team at Pismo we used to organize meetups and knowledge sharing sessions with other companies having similar problems, this also helped.

candiddevmike · on March 8, 2021

So hosted PAM, but I don't see any compliance certifications on your website? How do you have fintech customers today using it (or really, anyone using it)? Why would anyone trust you guys to proxy access to their environments?

andriosr · on March 8, 2021

Yes, I like your definition. We don't have certifications yet, but the team has done the biggest ones before we are keeping everything ready to get them. We should start the processes in the next couple of months. That being said, not all certifications require all software you use to also have the certifications. I understand this is critical for PCI, where anything with access to the data is also scope, but for SOC2 this is not the case. Most of our customers today are fintech, we are very transparent about our architecture and how we do things with our customers, that is where the trust come from. We the best solutions available to deal with things like storing credentials and sensitive data. That being said, you can always opt for the self-hosted enterprise version.

meow112012 · on March 9, 2021

That looks promising. Make me think of my favorite tool rundecks...

andriosr · on March 9, 2021

Yes, Rundeck is nice, but has its downsides. We have a lot of companies migrating from it. Runops is the perfect alternative :)

1vuio0pswjnm7 · on March 9, 2021

Could one refer to this as a so-called "API Gateway".

andriosr · on March 9, 2021

This is an interesting way to put it. Yes, you could say it's an API Gateway for Cloud tools. I like it!