Kernel-WASM: Sandboxed kernel mode WebAssembly runtime for Linux

phoe-krk · on Jan 21, 2023

It's amazing to see https://www.destroyallsoftware.com/talks/the-birth-and-death... becoming more and more true with every year.

amelius · on Jan 21, 2023

Wasn't it kind of obvious that a virtualized hardware layer is to be preferred over heterogeneous hardware?

Granted, it is surprising that browsers are increasingly taking this role rather than operating systems. But this module runs in the kernel.

phoe-krk · on Jan 21, 2023

Not until hardware became fast enough and compilers performant enough to support this, and not until everyone had to learn their lessons with approaches like the JVM.

amelius · on Jan 21, 2023

> Not until hardware became fast enough and compilers performant enough to support this

Yes, but the easy part is to think about what would be possible if hardware was fast enough. Not sure what you mean by lessons learned from JVM.

phoe-krk · on Jan 21, 2023

I meant that automatic garbage collection is not always the way to go for some people, especially in the days when JVM GCs were not as advanced as they are now.

amelius · on Jan 21, 2023

Ok. Yes, ideally you'd allow the user of your VM to implement their own GC. But to make this fast you typically need to support special memory barrier instructions, like a modern CPU does.

Anyway, the GC-included approach is of course the one taken by Javascript and JVM.

phoe-krk · on Jan 21, 2023

I assume that's one of the parts of the work done at https://github.com/WebAssembly/gc - not happening any soon yet, but it'll eventually be done.

pjmlp · on Jan 22, 2023

I doubt it will ever be happening, everyone will keep shipping the runtimes on their WASM blobs and that will be it.

Instead of adopting PNaCL in 2011, here we are a decade later still trying to figure out how WASM will be able to support all computing models.

pjmlp · on Jan 21, 2023

System 360, Burroughs B5000, Xerox Alto, Xerox Star, Xerox Daedilon, AS/400, Lilith, Ceres,...

Such a couple of examples of platforms using bytecodes as main executable format.

yjftsjthsd-h · on Jan 21, 2023

> Wasn't it kind of obvious that a virtualized hardware layer is to be preferred over heterogeneous hardware?

We've been trying for decades, and it only kind of caught on with Java, so no it's not really obvious that it's preferable.

pjmlp · on Jan 21, 2023

On mainstream it already caught on with UCSD Pascal and VB (only compiled on VB 6.0 and P-Code was still an option), on the server room since a few decades, with IBM and Unisys mainframes being the survivors of such approach.

marcosdumay · on Jan 21, 2023

Hum... Yes, but why did you jump to runtime virtualization instead of compile time? That jump is not obvious at all.

ksec · on Jan 21, 2023

Yes it is sad. Cause No body wants a Nuclear War and 40 million refugees off the coast of Australia.

phoe-krk · on Jan 21, 2023

Yes. Probably the most scary thing is that this part is becoming more and more true as well.

nequo · on Jan 21, 2023

He pronounces JavaScript as YavaScript.

Is this a reference to another old language whose name we mispronounce today? Or just a joke to say that by 2035 JavaScript will be so irrelevant that we won't even know how its name was pronounced?

mannerheim · on Jan 21, 2023

I think it's a joke because I think I remember him pronouncing it differently in another talk.

marcosdumay · on Jan 21, 2023

It's very likely an accent inherited from his first language.

jay-barronville · on Jan 21, 2023

WebAssembly isn’t JavaScript nor does it require JavaScript.

a_humean · on Jan 21, 2023

The direct inspiration of WebAssembly was asm.js, which is what the talk is about. A future where everything targets asm.js and vendors don't take the next step and create wasm.

The talk has predicts a lot of what is happening around wasm.

pjmlp · on Jan 21, 2023

Only because Mozilla refused to adopt PNaCL.

The irony is that with its 3% market share, if it was today on the age of Chrome OS based Web, Google would have been able to push it no matter what.

conaclos · on Jan 21, 2023

Why PNaCL could be better than WASM?

pjmlp · on Jan 21, 2023

Was there first instead of delaying progress for a decade and still not moving beyond MVP 1.0 as baseline.

mgaunard · on Jan 21, 2023

PNaCL is native.

dmytrish · on Jan 21, 2023

You're confusing PNaCL (based on LLVM bitcode) with NaCL.

photonbeam · on Jan 21, 2023

What was the rationale for opposition?

5e92cb50239222b · on Jan 21, 2023

https://bugzilla.mozilla.org/show_bug.cgi?id=729481#c83

https://www.theregister.com/2010/06/24/jay_sullivan_on_firef...

marcosdumay · on Jan 21, 2023

Looks like Google learned EEE from Micrsosft quite well.

pjmlp · on Jan 21, 2023

Chrome OS is what IE wanted to become.

photonbeam · on Jan 21, 2023

Thanks, that seems reasonable

phoe-krk · on Jan 21, 2023

Your response is technically correct but misses the entire point of the video.

To make a tl;dr, the video talks about a technology called asm.js which other applications end up compiled into so they can run with near-native speeds on any system which supports asm.js. This technology ends up being a compilation target meaning that JS gets eventually forgotten as a language because everyone just compiles everything into asm.js which is executed natively on the kernel.

Just do a s/asm.js/WASM/g and it'll suddenly start making sense.

conaclos · on Jan 21, 2023

Moreover, Mozilla's asm.js paved the way to WebAssembly. Without asm.js, WebAssembly would never exist.

pjmlp · on Jan 21, 2023

asm.js was Mozilla's answer to Google's PNaCL and Adobe's CrossBridge.

jay-barronville · on Jan 21, 2023

Okay, fair enough. I read the description but I didn’t watch the video, so it’s possible I’m lacking context. Thank you for clarifying.

phoe-krk · on Jan 21, 2023

Yes, the video is pretty amazing - Gary in 2014 managed to predict pretty much everything that's happening with WASM nowadays: the push for safe execution of software compiled in unsafe languages, a common VMesque instruction set, web browsers implementing it first and pushing the frontier in this regard, and now kernels also starting to be capable of running that in order to avoid the cost of switching rings.

What we're yet to see is this last point becoming the default, so execution of user programs kind-of makes a circle by returning to kernel mode, except now a trusted WASM-to-native compiler/runtime is responsible for ensuring system safety. Might happen in the next few years.

magicalhippo · on Jan 21, 2023

Reminds me of Singularity OS[1], where all programs were verified bytecode, allowing them to share address space with all the benefits that has.

[1]: https://en.wikipedia.org/wiki/Singularity_%28operating_syste...

phoe-krk · on Jan 21, 2023

Also the Common Lisp OS Mezzano[0] runs like that. It runs Doom and Quake by means of lowering C to LLVM-IR which is then lowered to a subset of CL which is then compiled by the trusted native Mezzano compiler.

[0] https://github.com/froggey/Mezzano/

jamal-kumar · on Jan 21, 2023

I was playing around with server-side wasm by way of OpenCL compiled to that and interfaced via Deno. It felt pretty cool until I realized wasm is 32-bit and this limits memory to 4GB for whatever process you've compiled in it. Unfortunately this obliterated it as an option for our use case.

The idea is still really interesting, though. It's almost like micro-containerization. I was reading this and it's what sold me on attempting it:

https://www.techtarget.com/searchitoperations/news/252527414...

miloignis · on Jan 21, 2023

The Memory64 proposal is coming along, with experimental support in Firefox, Chrome, Wasmtime, Node.js, and Deno. ([0] has the status of the current proposals with a supported matrix) Hopefully it'll get fully supported soon!

[0] https://webassembly.org/roadmap/

cube2222 · on Jan 21, 2023

This actually seems to be experimentally supported by some runtimes, like wasmtime[0].

[0]: https://github.com/bytecodealliance/wasmtime/pull/3153

lukevp · on Jan 21, 2023

wasm doesn’t have threads right now, so I wonder if you couldn’t just make multiple wasm processes and implement message passing between them to get around this limitation? I imagine if you’re building something where 4 gigs of Ram is a limitation, it would also benefit from using multiple CPU cores and you’d have to do multi-process for that anyway.

yjftsjthsd-h · on Jan 21, 2023

So the performance win appears to be from relying on sandboxing to safely run everything in ring zero so they can bypass the overhead of system calls. Which is actually pretty cool, and depending on the workload could actually reasonably lead to performance being better than user space.

naasking · on Jan 21, 2023

It also introduces DoS opportunities because resource ownership is murkier in the kernel.

syrusakbary · on Jan 21, 2023

It’s always refreshing to see interest on WebAssembly and Wasmer projects!

This kernel project is a bit outdated since is not using the latest Wasmer API (3.1), but it would be great to see community can pick it up and move it further. By making Wasm run in the kernel we were able to get really great runtime systemcall speeds since the program has not to pay any costs for crossing kernel protection rings

mgaunard · on Jan 21, 2023

Or you could just use io_uring and stay in userland without context switching.

rektide · on Jan 21, 2023

90% accurate. If you are on a single core system, you might avoid a lot of syscalls, yes. But the kernel is still going to be context switching your work out to actually do the io.

mgaunard · on Jan 21, 2023

Single core systems don't really exist anymore outside embedded systems.

rektide · on Jan 21, 2023

Sure. I guess the point is that the kernel still does work.

There's no official kernel mechanism where IO is entirely done in userland (Like Intel's DPDK and SPDK). The kernel will be context switching & doing IO. Single core was just an extreme example to make that extremely visible & obvious, but the core point is the kernel needs time too during io_uring, whatever the core count.

mgaunard · on Jan 21, 2023

So long as you have multiple cores, you can make it so that userland and the kernel are on different cores with nothing else on them. You can also disable interrupts and just spin continuously both the user and kernel thread.

No context switches ever needs to occur.

rektide · on Jan 21, 2023

Are you proposing giving up a core or more of your system to kernel- and moving your data between cores to ship it!- as a win? There's some scenarios where I can picture that being a win, where ultra-low latency compute is key but higher latency is fine- but this technical possibility doesnt general excite me.

We may technically be at 100% no context switching, but we did kill 1/8th of our cores or whatever & stress the fabric a lot more, just to satisfy a technical constraint we set up that sounds good but is actually foolish to ask for.

Going totally userland is probably smarter. Go all in on SPDK or DPDK (storage/networking). You'll need dedicated networking for the app. In general though, I think your top post was 90% right, and that's good enough, and right now hunting for 100% is a mis-goal.

mgaunard · on Jan 21, 2023

Using DPDK not only requires dedicating one core to it, but also very often dedicating the NIC to your app. It's also a very heavyweight and complex framework, and disables all kinds of security.

io_uring is a compromise with a lot of advantages over that, and can be configured in different ways depending on how hard you want to optimize things.

rektide · on Jan 21, 2023

I already mentioned the dedicated nic in DPDK.

I very much agree io_uring is a great way to go & it was a good mention. But you mischaracterized it at the start & dont acknowledge that at all.

mgaunard · on Jan 21, 2023

I didn't. io_uring allows to remove context switching entirely. That is a factual statement.

rektide · on Jan 21, 2023

If you give up external cores, and make your fabric soak the traffic. The first alone is probably in almost all cases worse than using it and allowing context switching, then it gets worse.

This is a parade of shitty disingenine crappy posting, that favors pathetic minor technicalities over the main use & actuality. Shame on you. I was trying to inform a little bit while 90% agreeing & you have brought confusion & misdirection at every turn. Seriously bad mojo dude, just awful. Almost no users will experience what you describe. It's still phenomenally good & great, excellent, a world better, but you are still using a 0.001% case, a technical possibility, to say you arent wrong, while admitting nothing, no caveats, to the 99.999% who wont experience what you are promosing. Kill your insane ego & be reasonable, think of all the people you are misleading just a little bit.

If you seriously think io_uring completely supplants any and all reason for projects like kernel-wasm, I continue to disagree. I'd agree that io_uring captures a huge amount of the potential value, makes things much better. But there's still barriers that get crossed in io_uring & considering other options, in my humble view, is interesting. I have seen zero willingness on your side to entertain any such possibility.

mgaunard · on Jan 22, 2023

You're the one derailing the thread. I made a factual statement that is true. You argued it wasn't. I explained how it was true. Then you keep insisting with "but [...]" that are all irrelevant to the problem that this solution addresses.

Being in the kernel alone doesn't remove interrupt-driven context switching, or being scheduled out by other kernel threads. Those are all completely orthogonal concerns, which can all be fixed without needing to put more user code in the kernel.

rektide · on Jan 23, 2023

The single threaded example I made is exactly the counter I made to this kind of narrow-minded disregard. Yeah there are interrupts happening. But whether io has to bridge the user/kernel barrier is still the main question here, and io_uring reduces but emphatically does not free us from that, it simply defers/batches/reduces the number of sys-calls. You seem unwilling in extreme to recognize this, and I don't get why you falsify your statements again and again to resist this, with your reliance on 0.001% technical possibilities almost no user would experience to justify your essentially irrational & misleading claims.

You still haven't shown any compromise. I think almost no one would agree with you. io_uring is great, but as much as you dodge around the fact, there's still a kernel/user barrier, and there are systems like kernel-wasm or DPDK that keep processing on one side or the other, and as much as you for some asinine reason or simply personal failing, you are unable to admit that obvious clear & essential advantage.

> I didn't. io_uring allows to remove context switching entirely. That is a factual statement.

If you use io_uring, there is more kernel work than there otherwise would be. Work must transit the user/kernel barrier. And this has a cost, which can be avoided by other schemes. Simple as that. Fact. Stop throwing smoke bombs & be real. Can you show even any recognition what-so-ever, any attempt to acknowledge a single statement I've made (as I have done yours), rather than blow up every single thing I've said? It seems not. This seems like enormously bad faith posting, not done in the spirit of finding out & discussing.

Laaas · on Jan 21, 2023

This is 2 years old, does anyone know of any newer progress in this area?

brabel · on Jan 21, 2023

It might be just my impression, but it seems WASM itself has not moved forward in the last 2 years.

Laaas · on Jan 21, 2023

No it definitely has moved forward. See:

https://news.ycombinator.com/item?id=32069418

https://github.com/WebAssembly/tail-call/issues/15#issuecomm...

k__ · on Jan 21, 2023

Faster than native?

So, WebAssembly has achieved what Java tried to sell us since it's inception?

egberts1 · on Jan 21, 2023

Yeah, basically by bypassing the OS VM memory switching/context-switch with their own brand of memory-switching/context-switching, but with one notable difference: stackless internal representation (IR) bytecodes.

Can we open up a new class of vulnerabilities in Mitre?

k__ · on Jan 21, 2023

Could operating systems use this technique to speed up things?

dahfizz · on Jan 21, 2023

We already have things like io_uring to achieve this speedup.

As usual, this benchmark is comparing against an extremely naive implementation. It even uses a non-optimizing compiler!

A fair benchmark would take advantage of all the features that native code has available - huge pages, CPU acceleration, io_uring, and, most obviously, a compiler that supports -O2.

If they did that, the native code would smoke wasm.

AnIdiotOnTheNet · on Jan 21, 2023

To be fair, almost any JIT VM is capable of being "faster than native" (by which is meant pure AoT) in some circumstances because it has more information on the environment and circumstances of execution.

k__ · on Jan 21, 2023

That's what I'm reading since the first release of Java. Still, C/C++ and Rust are running circles around it.

pjmlp · on Jan 21, 2023

Depends on the benchmark, and how it actually matters in production code.

k__ · on Jan 22, 2023

What are good examples?

pjmlp · on Jan 22, 2023

Any product that has managed to make its customers achieve business goals within profit margins.

Winning benchmarks is meaningless without a business case for them.

IncRnd · on Jan 21, 2023

> Safely run WebAssembly in the Linux kernel, with faster-than-native performance.

That's the first sentence on the GH page. It seems they decided to mix marketing in first.

imtringued · on Jan 21, 2023

And faster than "native" means faster than userspace...

And by faster they mean IO performance which is fine in principle but generally people think about compute when they care about the performance overhead of WASM because there is nothing about it that should make IO slower than native but everything about it makes compute slower than natively compiled programs.

dahfizz · on Jan 21, 2023

I would love to see an actually fair comparison, where the native implementation uses io_uring and huge pages.

remram · on Jan 21, 2023

I'm surprised no one is mentioning eBPF. I know the goals are slightly different but there seems to be a lot of overlap, e.g. would we really want both runtimes in the kernel?

Laaas · on Jan 21, 2023

eBPF is mentioned in the README, but I think you could trivially implement eBPF on top of this (and quite possibly have it be at least as performant).

egberts1 · on Jan 21, 2023

Today is January 21, 2023 and JavaScript must die.

dmytrish · on Jan 21, 2023

I don't quite understand what Javascript has to do with the topic of the discussion.

WebAssembly is to Web what JavaScript is to Java.

pffft8888 · on Jan 21, 2023

I was thinking that these comments about death of JS in this thread which is about running in OS kernel mode not inside the browser are coming from ChatGPT type bots or the users are product managers with no clue.

hulitu · on Jan 21, 2023

Haha. Now the browser runs in the kernel. What can go wrong ?

I think it was Microsoft that saw that running graphics in kernel is a bad idea.

But, yeah, this was a very long time ago and why not make the same mistakes over and over again.

jay-barronville · on Jan 21, 2023

Sorry but respectfully, you might want to research things before jumping to conclusions. WebAssembly doesn’t have anything to do with a web browser. It’s literally just a virtual machine standard. The standard was written with web browsers in mind (i.e., secure sandboxes within the browser environment), but nothing about WebAssembly’s semantics and runtime environment requires a web browser.

chx · on Jan 21, 2023

When I put webassembly into google the very first result has this to say:

> WebAssembly (abbreviated Wasm) is a binary instruction format for a stack-based virtual machine.

Yes, browsers implement this nowadays but there are other implementations as well. You can compile them into llvm bytecode for example or there's another for the GraalVM etc

brundolf · on Jan 21, 2023

It's not a browser (or even a piece of a browser), it's a method of statically sandboxing a native binary.

yencabulator · on Jan 24, 2023

WASM isn't really "native". A C/Rust/etc compiler can target it, but it's very much a non-native ISA, with its own "syscall equivalents", its own limitations, etc.

brundolf · on Jan 26, 2023

But it doesn't have to be run in a runtime; it can compile directly to a platform-native binary. Internally it has limitations, as any sandboxed code would, but it lets you take native code and compile it to a native binary with sandboxing built-in

yencabulator · on Jan 26, 2023

By that argument, Python is "native" too then, with e.g. https://github.com/exaloop/codon

The reason why it doesn't quite work out is that typically such a thing, AOT-compiled into native code, still carries with it its limitations and assumptions. It's technically native, but experiencing the native APIs only through a narrow peephole in the fence.

If we let this meaning of "native" take over, we're gonna need another word for actually native things.

Sometimes, that works well enough, though!

brundolf · on Jan 26, 2023

That's fair, although I guess "having a runtime" can also refer to things like running a garbage collector (which WASM doesn't but Python does, and even Go does), even if it's part of the same binary. But we're entering the territory where these terms start to get poorly-defined I think (new tech is pushing the boundaries!)

Subjectively I think WASM's feature and non-feature list makes it plausibly kernel-friendly in a way that eg. Python isn't, but there may not be a concrete line to draw around that