QEMU 6.2

rubatuga · on Dec 16, 2021

QEMU 6.2 introduces full hypervisor support for M1 Macs. I've installed ARM Ubuntu Server in a VM on my MacBook, and wrote a blog post explaining it. It uses libvirt to start the VM in headless mode.

https://www.naut.ca/blog/2021/12/09/arm64-vm-on-macos-with-l...

And this is the script to force QEMU to run on the M1 efficiency cores to save battery life:

https://github.com/yoonsikp/vm_configs/blob/master/power_sav...

flyinghamster · on Dec 16, 2021

> It uses libvirt to start the VM in headless mode.

So then, I could be running virt-manager and manage VMs on an M1? That would slide nicely into my existing virt-manager usage. But, not with a soldered-in SSD. I tend to keep machines running a LONG time, and I'd hate to pitch an otherwise perfectly-good machine because the SSD wore out and couldn't be replaced. The "expensive disposable" situation with cell phones is bad enough, let's not have this in our PCs as well.

As a counterpoint, the only SSDs I've ever had fail on me were some early OCZ units (big surprise), but I still check the SMART data on drives I'm using and see the wear indicators slowly advancing. I keep stuff backed up on ZFS spinning rust.

pbowyer · on Dec 17, 2021

I'm sure Apple will introduce a good "SSD Replacement" program when they wear out. We'll be without our machines for a few weeks, it'll cost $200 more than it should - but they will find a way to replace them and stop everyone complaining.

pbowyer · on Dec 16, 2021

Thanks for that blog post! I downloaded https://mac.getutm.app/ a couple of weeks ago but haven't used it yet - this sounds similar but a more automated way to do what you do?

Coming from Windows, Vagrant & Virtualbox to a M1 Mac I'm looking to find an easy way to manage multiple VMs for developing different projects.

throwaway-jim · on Dec 16, 2021

I'm using utm to run a ubuntu server that where I do all my dev work. Since it is a vm I am a lot less scared of messing up my main system. It is very convenient with vscode remote, which only needed to be configured once and will automatically connect to the vm.

NiekvdMaas · on Dec 16, 2021

Interesting! I tried to install the Windows 11 ARM preview with it (https://www.microsoft.com/en-us/software-download/windowsins...) but no luck. Do you have a working config with Win11?

my123 · on Dec 16, 2021

    qemu-system-aarch64 -M virt,accel=hvf -smp cores=2 -m 4G -cpu host -bios ../QEMU_EFI.fd -device ramfb -device nec-usb-xhci -device usb-kbd -device usb-tablet -drive if=none,file=Windows.vhdx,id=windows -device nvme,serial=1234,drive=windows

And for networking, use the virtio drivers for Windows, after installing those, you can remove usb and switch to -device virtio-keyboard -device virtio-tablet for input too.

tyingq · on Dec 16, 2021

I'd be curious to see startup time, performance, etc, vs the more typical way to run docker on MacOS (xhyve).

veb · on Dec 16, 2021

That's really neat stuff, thank you for writing that article. I'm going to give it a go :-)

If you hadn't, I probably wouldn't have had any reason to try it!

amelius · on Dec 16, 2021

Can we also use QEMU to emulate M1 Macs on i86 hardware?

xvilka · on Dec 16, 2021

Probably this fork[1] could be a good starting point.

[1] https://github.com/alephsecurity/xnu-qemu-arm64

miles · on Dec 16, 2021

PowerPC versions of Mac OS X run well under QEMU/UTM and Apple silicon. Cobbled together this guide using Tiger as the guest: https://tinyapps.org/docs/tiger-on-m1.html . More than performant enough for running OED[1] and MPEG Streamclip[2], two old favorites.

[1] https://global.oup.com/academic/help/technical-support/cdrom...

[2] http://www.squared5.com

anatoly · on Dec 16, 2021

I'd love to understand virtualization/emulation better, on the technical level. If I understand correctly, in modern CPUs it's all done with the help of specialized support in the instruction set, but I've never learned in depth

   - how that works
   - how it differs between Intel/AMD/ARM
   - whether there are competing approaches
   - what are the main software project "players" in the field
   - which of them are open-source with humanly understandable 
   source code

Is there a summary/tutorial about all this stuff someone could recommend (or write)?

q3k · on Dec 16, 2021

For 1 and 2 you can go straight to the sources:

Volume 3 of the Intel 64 and IA-32 Architectures Software Developer’s Manuals, Chapter 22: Introduction to Virtual Machine Extensions:

https://www.intel.com/content/www/us/en/develop/download/int...

Volume 2 of the AMD64 Architecture Programmer's Manual, Chapter 15: Secure Virtual Machine:

https://www.amd.com/system/files/TechDocs/24593.pdf

Joker_vD · on Dec 16, 2021

Well, yes, you can do that, but Intel's Software Developer's Manuals are terribly written. They are good as references for details, but only after you understand the grand picture. Learning something new from them is quite a torturous process.

q3k · on Dec 16, 2021

Maybe it's that brain worm that I got from staring at too much assembly, but I actually find that part of the Intel manuals to be quite approachable.

Even to get a general overview, the introduction section I linked seems fine?

NtGuy25 · on Dec 16, 2021

Here's the really simple explanations.

Emulations is pretty much literally just mapping instructions between processors. So there may be an instruction in my custom chipset called "Add4", which adds 4 inputs. I would emulate ADD4 RAX, 0x1234, 0x2345, 0x3456 that by

ADD RAX, 0x1234; ADD RAX, 0x2345; ADD RAX, 0x3456;

It gets a bit more complicated with architecture differences like memory configurations. But that all emulation is.

When you're virtualizing, you pretty much just need to manage hardware. The hypervisor does this for you by managing which resources go to where. You could virtualize it by just running it like a program. But that's really painful and tedious, so you rely on the CPU to support it. Each chip has it's differences, but it's effectively just like a syscall. You have VMCALL and VMEXIT instructions. And then you have a handler in your vmexit table, which is exactly like a syscall table. So if(exitreason == CPUID_EXIT) cpuid_handler();

For a good book you can look up "Hardware and software support for virtualization" https://www.amazon.com/Hardware-Software-Virtualization-Synt... . It's honestly the only good resource i've found on what really makes this work.

zenlot · on Dec 17, 2021

Thank you for a good explanation and book recommendation.

krylon · on Dec 16, 2021

I know next to nothing about hardware / instruction set support from the CPU side, but there are several hypervisors who are open source and thus available for study: Qemu itself, Xen, Linux KVM, FreeBSD's bhyve, OpenBSD's vmm. Oh, and there's VirtualBox.

I vaguely recall someone telling me that the qemu source code is fairly readable, but I have not looked at any of these myself, so consider it hearsay.

I don't know about the others, but the FreeBSD developers have a separate mailing list for discussion of virtualization, https://lists.freebsd.org/subscription/freebsd-virtualizatio...

q3k · on Dec 16, 2021

> I vaguely recall someone telling me that the qemu source code is fairly readable [...]

That someone was either pranking you or knows a mysterious part of the qemu codebase that's unlike the rest of the criminally underdocumented qemu soup. Or my standards are too high.

pas · on Dec 17, 2021

The source is okay, the problem is that there's really no docs/examples for even the basic stuff. Do you want to boot from a virtual block device and PXE from a network drive ... well, maybe these 300 command line arguments in some random order will work. Eventually. Have fun. So reading the source is a big help in that. But it's not ideal. And libvirt/oVirt/etc are helpful, but not for just that quick and dirty "let's try something out with the linux kernel in qemu" thing.

krylon · on Dec 16, 2021

That person might have been sarcastic, I've been known to miss that, especially in written conversations. I also might misrember.

tyingq · on Dec 16, 2021

The xhyve fork/port of bhyve for MacOS is worth mentioning: https://github.com/machyve/xhyve

lloeki · on Dec 16, 2021

a quick bird's eye view summary to get your bearings:

(caveat, I'm taking shortcuts so things may not survive scrutiny but it should be good enough for an overall understanding and getting down to the truth if you so desire)

emulation: software that implements and simulate hardware to mimic it as expected by software that would run within emulation

this is slow, because you have to really pretend hardware down to the details (simulate clocks, irqs, video output, etc). often you can get away with not being 100% exact and accurate depending on the emulated software's assumptions or you don't care if some specific output is incorrect (e.g graphical difference or oddity in games), which gives you some nice speedups. 100% accurate emulators can and do exist but you can see the difference in requirements (see mesen, higan) with inaccurate emulators.

some people quickly figured out that if you are running software compiled for a specific target (e.g a x86_64 CPU) that is the same target as the host then it kinda doesn't make sense to emulate the CPU... and that you can kinda obviously pass through cpu instructions instead of simulating the very hardware you're running on... this is:

virtualization: instead of emulating, expose a non-emulated device from the host that the software is known or expected to be compatible with, thus saving a lot of cycles.

for the CPUs this is typically done through an instruction that creates an isolated context in which the software will run. reminder: a CPU has multiple "rings" already under which software is run, for security and stability reasons (memory protection, etc...), e.g the kernel runs under ring 0 and userland processes are under ring 1, each in a different context preventing one to access the other in the same ring, and to go up rings, all checked at the hardware level. the virtualization instructions roughly do the same thing and nest things so that a virtual cpu runs code in a separate ring+context, in a way that the guest OS thinks it's alone. see also type 1 (xen, hyper-v) vs type 2 (qemu, vmware, parallels) hypervisors.

other devices can be implemented in lightweight, efficient ways, and exposed as virtual devices to the virtual machine for the nested OS to pick up. e.g instead of emulating a full PCI network card complete with pseudophysical hardware matching a real one, one can have a simpler device exposed that merely takes ethernet packets and inject them into the host kernel's network stack. or same for IO, instead of emulating a sata device complete with commands and all, one could just expose a block device and ask the host kernel to read its data from some host file directly. this requires dedicated drivers for these virtual devices in the guest OS though.

virtualization is as good as native because often it's literally native, only with hardware sandboxes and maybe a few added indirect wrappers, that Windows on PCs and apps and games on Xboxes actually run in VMs under HyperV all the time!

so back to the CPU side of things, you can notably see how it is impossible with virtualisation for a CPU to execute anything else than its own instruction set.

now, executing foreign instructions more efficiently than literally interpreting them can be achieved through various means such as binary translation or dynamic recompilation, but it still has to take some guest assumptions into account, e.g Rosetta 2 (which is neither virtualisation nor emulation) translates x64 code to arm64 but x64 assumes a certain ordering of operations between cpu and memory that just cannot work as is on arm64, so there's a special Apple-specific instruction to switch a CPU thread to a different memory model, still running arm64 instructions (translated from x64) but in a way that matches the x64 behaviour.

there are many more gritty details, and things are not always as clear cut as I described (or flat out lied about, the Feynman way) but it should be a good enough model to start with.

ku1ik · on Dec 16, 2021

A wonderful answer. Thank you!

jillesvangurp · on Dec 16, 2021

Nice, I've been using qemu on my mac as am alternative to docker. I run arch linux in a vm with docker installed and an ssh daemon running on port 2222. On my mac, I simply point docker cli to use the vm by setting DOCKER_HOST=ssh://jilles@localhost:2222

Works great and it cut me loose from docker for mac. The networking was a bit of a pain in the ass to get going until I figured out that it basically does not work without libvirt installed as well. Basically none of the networking options do anything until you do that (I've tried them all).

chriswarbo · on Dec 16, 2021

I do a similar thing using Lima (which uses Qemu under the hood) https://github.com/lima-vm/lima

(I'm not a fan of Docker, but unfortunately work on some projects that use it)

lovelyviking · on Dec 22, 2021

> Basically none of the networking options do anything until you do that (I've tried them all).

Can you share more? Is it completely impossible to make networking function in vm let's say debian x86 vm running on mac m1 without libvirt or it's very hard to configure?

regularfry · on Dec 16, 2021

How are you doing volume mounts? That's where I tend to come unstuck.

tallytarik · on Dec 16, 2021

I’ve done something similar, although I ditched Docker at the macOS layer entirely and use Docker within my (Ubuntu) VM.

I use Mutagen (https://mutagen.io/) for syncing files between macOS and the VM. Works well.

(And overall, my Drupal project is about 20x faster through the VM than Docker for Mac, which I still find incredible)

apatheticonion · on Dec 17, 2021

I run Debian in a VM, use ssh and VSCode SSH tools to do work within the VM.

Compilation, docker, everything happens in the VM and my MacOS host is just a thin client that hosts my code editor and browser.

VSCode will automatically forward ports, but I prefer to manually manage port forwarding using `-L` and `-R` ssh arguments.

lovelyviking · on Dec 22, 2021

>I run Debian in a VM, use ssh

Can you share how you configure networking for your Debian in a VM? Is it different for qemu running on mac m1 silicon?

jillesvangurp · on Dec 16, 2021

Don't tend to use those for development. But I imagine they would end up mounting directories inside the vm. You could get to those via scp or something like that.

regularfry · on Dec 16, 2021

The problem tends to be getting inotify to work properly across the boundary. You want to propagate filesystem changes into the VM, in the ideal world. Docker-desktop's got some closed-source wizardry that makes it Just Work, I'm just not sure what's as slick in the open-source ecosystem.

dnautics · on Dec 16, 2021

is official NFS over vsock a thing yet in QEMU?

duskwuff · on Dec 16, 2021

The main interesting bit (to me, at least) is:

* On macOS hosts with Apple Silicon CPUs we now support the 'hvf' accelerator for running AArch64 guests

MuffinFlavored · on Dec 16, 2021

Is Windows 10 ARM64 or AArch64?

duskwuff · on Dec 16, 2021

Both names refer to essentially the same thing. (Much like "x86_64" vs. "amd64".)

PikachuEXE · on Dec 16, 2021

Only for AArch64 guests though

So not so interesting for me

https://wiki.qemu.org/Features/HVF

duskwuff · on Dec 16, 2021

Well, yes. HVF (Hypervisor.framework) passes through to the native CPU, which can't run code for other architectures.

qalmakka · on Dec 16, 2021

Unless Apple implements some kind of support for the x86 ISA in its CPUs (which is unrealistic), hvf will always only support arm and aarch64 guests. Running a non-native architecture requires emulation, which will always be slower than just running a guest on the native CPU.

vbezhenar · on Dec 16, 2021

Rosetta emulates x86 quite fast. I think that it's matter of engineering effort required to implement that emulation on a VM level.

djxfade · on Dec 16, 2021

Rosetta recompiles ahead of time. That's why it's so fast. Would probably not be possible with a full VM

tinus_hn · on Dec 16, 2021

This would definitely be possible with a full VM, it’s what VMware did before hardware accelerated virtualization became a thing.

It’s just more difficult and less efficient than accelerated virtualization.

qalmakka · on Dec 16, 2021

This, exactly. Rosetta 2 emulates x86_64 Darwin binaries on AArch64 Darwin - it means that tricks like jumping to native code as soon as possible are feasible.

For instance, you can call a native counterpart of a system library instead of emulating it down to the syscall level, and generate native code that calls into that.

Running kernel level code requires you to emulate an entire machine, which is way more complicated than just emulating the userland like Rosetta does. Running an unmodified OS kernel requires emulation of stuff like interrupts, firmware, peripherals ... Yes, you can dynamically JIT compile some codepaths into native code, which is something that AFAIK most emulator already do, but there's still a much higher performance cap than with something like Rosetta or Wine.

yjftsjthsd-h · on Dec 16, 2021

Well no, you can't use the native virtualization on a different architecture. You should be able to use TCG acceleration, though.

foft · on Dec 16, 2021

Great news, QEMU is really great. Will have to give this a spin on the MiSTer again. I was experimenting with emulating the 68K in qemu in combination with FPGA reimplementation of the Amiga chipset. I ran into performance issues with the way AmigaOS manages memory, where stack and code can be in the same page. Qemu seems very inefficient in that case since it constantly invalidates the translated code I think, I wonder if that aspect has been improved at all.

Chris2048 · on Dec 16, 2021

MiSTer - an FPGA gaming board: https://www.polygon.com/22640171/mister-project-classic-gami...