Choosing between programming in the kernel or in user space

tptacek · on Jan 21, 2018

This answer is malpractice on at least two levels:

1. It entertains a performance argument without interrogating the task and environment the appliance is engaged in. You can always in theory make a typical program faster by rewriting it in carefully scheduled assembly. We don't often do that because the performance win would be irrelevant to the problems we're trying to solve.

2. It pretends that the only reliability benefit from moving to u from k is in virtual memory. But, no: the major benefit is from moving from C to something more expressive and safer. The notion of "defect density" captures this argument: with it held constant, a solution that can be expressed in fewer lines of code will have fewer faults. But, of course, defect density between C and a high-level language won't be constant, and it won't favor C, where you can't even count safely with the basic types provided by the language.

codemac · on Jan 22, 2018

Once the driver doesn't exist in the kernel (vfio/iommu), and you have processes and threads and efence/mprotect, this all doesn't make sense for practical purposes. The author's reasoning just doesn't hold up in modern networking and storage appliances when it comes to state of the art performance in Linux userspace.

aidenn0 · on Jan 22, 2018

RE: 2

There is no reason HLLs can't be used in kernel space, is there? Excepting shell-like languages (which require fork, which you typically can't do in a kernel), most embedded kernels provide everything a HLL might expect in kernelspace.

koverstreet · on Jan 22, 2018

Malpractice? Strong words, considering you're mischaracterizing the article.

"Virtual memory—which gives us the user/kernel space split and the process model of programming whereby programs are protected from each other—is just the most pervasive form of isolation."

That's not claiming virtual memory is the only reliability benefit, and you can't expect a short article to be a treatise on all the differences between kernel space and userspace.

Furthermore, the article isn't claiming that kernel programming is _better_ than userspace programming, it's just pointing out that there's a tension between performance and safety - which you acknowledge!

tptacek · on Jan 22, 2018

There isn't necessarily even that tension. Modern high-speed networking does TCP/IP entirely in userspace.

koverstreet · on Jan 22, 2018

When networking is faster in userspace it's because it's a clean slate implementation, not because userspace is faster.

JoachimSchipper · on Jan 22, 2018

A TCP/IP stack isn't fundamentally faster for running in user- or kernelspace, but avoiding user/kernel transitions helps. So maximum-performance implementations that don't need to care about playing nice with others might be best off choosing one or the other and staying there; running in userland is then more convenient (but e.g. the TUX in-kernel webserver exists/existed.)

Mind you, most high-performance software doesn't need to go this far, and stuff like PF_RING / Linux' packet_mmap or "batched" systems calls like recvmmsg() (or even read()) further reduces the number userland/kernel transitions.

koverstreet · on Jan 22, 2018

The vast majority of the time though you need to let multiple processes use the same TCP/IP stack. As soon as you have to do that, moving the TCP/IP stack to userspace means you have to do context switches instead of syscalls.

That is not a good thing if you care about performance.

justincormack · on Jan 22, 2018

The kernel implementation does context switches and syscalls. It depends on your usersoace architecture if you have to do context switches, you get to decide.

koverstreet · on Jan 22, 2018

No. If I use netcat on my Linux machine to connect to a remote machine and send and receive packets, no context switches are involved.

If I want to move the TCP/IP stack to userspace on my Linux machine, how do all the programs I use (firefox, ssh, IRC client) send and receive packets without context switches?

If your answer is to change the userspace architecture and shove everything into one process, you've made isolation _worse_. If you don't, how do you avoid the context switches to talk to the network stack?

The key point is that the network stack has to own shared state, and sending and receiving packets _must_ touch that shared state. If it's accessible to every process (shared memory), you've tossed out isolation and security. If it's not, you have to context switch.

lmm · on Jan 22, 2018

> If your answer is to change the userspace architecture and shove everything into one process, you've made isolation _worse_.

Processes are one way of doing isolation but they're a very crude one (give each thing that needs to be isolated its own address space, swap the whole stack every time you switch tasks). A userspace scheduler (which, sure, would be running within a single process at the UNIX level, but that's the wrong level to be thinking about it) that knows about the details of what it's running can achieve the same level of isolation in a more lightweight way.

JoachimSchipper · on Jan 22, 2018

netcat does switch into and out of the kernel; you're otherwise correct that userland networking pulls towards a one-process appliance (hence my original "... that don't need to care about playing nice with others").

pjmlp · on Jan 22, 2018

Microkernel operating systems, like L4 and QNX beg to differ.

koverstreet · on Jan 22, 2018

What is your claim?

Are you going to claim they do networking faster than OpenBSD?

pjmlp · on Jan 22, 2018

Given the actual use in production systems, I guess so.

How fast does OpenBSD do radio networking on mobile phones?

How fast does OpenBSD do CAN networking on car systems and factory installations, with real time constraints?

koverstreet · on Jan 22, 2018

You're arguing a strawman.

pjmlp · on Jan 22, 2018

No, if OpenBSD was actually better for those use cases I guess companies would have noticed it already, instead of paying for microkernels with slower performance.

koverstreet · on Jan 22, 2018

They absolutely do have worse networking performance. Just not enough to matter for those applications.

pjmlp · on Jan 22, 2018

If OpenBSD is so great serving network requests why isn't it taking over the world of cloud services?

lmm · on Jan 22, 2018

Because cloud providers would rather pay for a little more hardware than have to hire or train an ops team that understands OpenBSD?

maga_2020 · on Jan 22, 2018

The succinctly noted

"The tension in any of these systems is between performance and isolation. "

Also caught my attention. In my mind it is a bit analogues to discussion about 'microservices'.

Although micro services, are probably at the peak of the Hype Cycle. Where isolation is projected as 'productivity enabler', across teams.

Very difficult to challenge this, especially if you work for a company where revenue is declining/not growing because of lack of ideation/innovation/market conditions.

So, when company fails to ideate and grow revenue, the product management/CIO will be turning onto the productivity.

I suspect the next move for the company, where the above is discussed, -- is moving Development to a lower cost area.

Especially if the 'hiring problem HR is noting' is not just with requirements, but also salary ceilings.

cjbprime · on Jan 21, 2018

Answer seems like a bad take to me. There are plenty of zero-copy userspace APIs, if memory copies are even a bottleneck.

jeff571 · on Jan 22, 2018

Yeah exactly! Just look at DPDK for example.

jacksmith21006 · on Jan 21, 2018

Be interesting to include Meltdown as a factor in the discussion at this point from a performance standpoint.

andyidsinga · on Jan 22, 2018

THIS : " If your problem must be decomposed into a set of cooperating processes, then programming in user space is the exact same problem as programming in the kernel"

Corollary (in my experience): many systems evolve to have many cooperating processes.

and THIS: "The tension in any of these systems is between performance and isolation."

To solve for the root issue described - hiring(1) - they should be looking for people who have an aptitudes for these those types of issues ..more so than say kernel or user space skills & experience.

(1) "HR complained that they could not find sufficient, qualified engineers to meet the demands of management for more hands to make more features"

TheAceOfHearts · on Jan 22, 2018

Minor unrelated complaint, but when you write "hiring(1)" like that, it immediately makes me think there's an unknown man page I need to read rather than a footnote reference. In my experience, citations and footnotes generally use brackets, e.g. "hiring[1]".