X86 instruction encoding and the hacks we do in the kernel [pdf]

userbinator · on Sept 4, 2015

The reference to the octal format (2-3-3) of the instructions is to this document:

http://reocities.com/SiliconValley/heights/7052/opcode.txt

If you're at all interested in the x86 architecture, it's highly recommended reading.

(This used to be one of the first search results for "x86 octal instructions". Now it's nowhere in the first 10 pages of results and searching for its link directly doesn't even cause it to appear. What happened, Google...?)

kens · on Sept 4, 2015

The interesting thing is the original source of the octal format. The 8008 had octal-structured instructions because that's what the Datapoint 2200 had, and the 8008 took its instruction set. The Datapoint 2200 used octal because it was built out of TTL chips and used 7442 decimal decoder chips for instruction decoding (which could fully decode 3 bits).

ianlevesque · on Sept 4, 2015

Not enough adsense on the page. But more seriously the fact that it is a usenet post copied to reocities with probably zero inbound links hurts its ranking.

cbd1984 · on Sept 4, 2015

> Not enough adsense on the page. But more seriously the fact that it is a usenet post copied to reocities with probably zero inbound links hurts its ranking.

Also not helping: Reocities is a website hosting the Geocities dump Jason Scott and his merry band of guerrilla archivists made back before Yahoo killed Geocities. The point of Reocities (and Oocities, and possibly others) is to allow people to fix their old Geocities links by changing one letter in the domain name.

So it's not only obscure, it's an ancient document which has been rehosted, so none of the original links to it still work.

userbinator · on Sept 4, 2015

The thing is, Googling the link (the Reocities version, not the original Geocities one) reveals that others have linked to it a few times before - that's how I originally found it - and I've also linked to it from here on HN; yet Google seems to refuse to acknowledge the existence of that page itself, as searching for specific quoted phrases in it shows. In other words this isn't "link rot"; it's more like "Google rot".

I think it's even more unfortunate that such gems of information are being lost not because the sites hosting them are gone, but because search engines are rendering them inaccessible despite the sites still existing. In fact with things like the Internet Archive, coming across a dead link is not so bad; not being able to know (from a simple search) that a page with such information actually exists, but just wasn't present in the results, is far worse.

This isn't the first time I've seen Google "disappear" pages still around and containing useful information, but it makes for a good example.

Edit: I didn't know Jason Scott was behind Reocities - I think he deserves another donation.

moyix · on Sept 3, 2015

If you're interested in reading more about the various kinds of runtime code patching used in Linux, there was a nice paper on it at last year's Malware Memory Forensics Workshop:

Slides:

https://www.acsac.org/2014/workshops/mmf/ThomasKittel-Code%2...

Paper:

https://www.acsac.org/2014/workshops/mmf/Thomas%20Kittel-%20...

legulere · on Sept 3, 2015

I still wonder why AMD didn't opt for a saner encoding for 64 bit mode while still mostly keeping assembly compatibility (32-bit binary code doesn't run in 64-bit mode anyway except some edgecases)

amluto · on Sept 3, 2015

The only one of those edge cases I can think of is in an exploit I wrote (http://www.openwall.com/lists/oss-security/2015/08/04/8). This part need to work when interpreted as 32-bit or 64-bit code:

  1: .byte 0xff, 0xca /* decl %edx */
     jnz 1b
     mov %%ss, %%eax  /* grab SS to display */
  
     /* Did we enter CPL0? */
     mov %%cs, %%dx
     testw $3, %%dx
     jnz 2f

     /* this part knows it's 64-bit */

  2:
     /* this part knows it's 32-bit */

The .byte thing is because 32-bit x86 allows two encodings for decl %edx, but the one-byte encoding got recycled for REX on 64-bit.

ant6n · on Sept 4, 2015

Probably this way they could reuse most of the instruction decoder between 32bit and 64bit mode, giving equal performance while saving transistors. I doubt it's related to compiling tools - instruction encoding dealt with by the assembler and those are easy to write. The actual changes that would have to be done to the compiler because of switching to 64bit are independent of the instruction encoding.

userbinator · on Sept 4, 2015

I also wondered about that when AMD64 first came out, and my feeling was "because they wanted to make it harder for Intel too". At the time, Intel was likely already developing a 64-bit extension of x86.

On the other hand, I think the 16 to 32-bit extension (with the 386) was done quite well. 16 and 32-bit code can coexist, and it's even possible to use the 32-bit registers in 16-bit mode; not so with AMD64. It's not too difficult to figure out how to add 64-bit support in a more non-disruptive way, without having to do silly things like removing instructions and features that they had to reintroduce later [1][2].

[1] http://www.pagetable.com/?p=25 [2] https://en.wikipedia.org/wiki/X86-64#Differences_between_AMD...

ithkuil · on Sept 3, 2015

perhaps it has something to do with the fact that it was easier to port existing compilers and perhaps even designing the first microarchitecture to support it and give decent performances without having to wait too much and loose competitive advantage.

I guess that 32bit mode could use most of the same microarchitecture and thus guarantee that during the transition period people would still buy those new chips.

Itanium was a good example of such a strategy that failed. However there might have been other reasons as well.

Dylan16807 · on Sept 3, 2015

I wouldn't call Itanium 'such a strategy', because it wasn't even similar to x86. 64 bit mode could have rearranged the instruction coding while keeping everything from ASM up roughly the same. Few single byte opcodes and bigger ranges for prefixes to improve density. Keeping all parts of register specifiers together to improve sanity. Etc.

caf · on Sept 4, 2015

Possibly to allow parts of the instruction decoder to be shared between modes?

earlz · on Sept 3, 2015

heh this reminds me of when I did some runtime patching in my own hobby OS kernel. I used it specifically for interrupt routines so that I didn't have to repeat this monstrosity header and footer for every interrupt handler.

Also, another thing begging for runtime modification is the `int` instruction (used to create an interrupt). There is literally no way to choose a random number of an interrupt to call. You're only option for that is to either do runtime modification to construct the 2 byte opcode on the fly (with the second byte being which interrupt to call), or to make a big table like `int 1; ret; int 2; ret; int 3; ret` and call into that manually. It is quite infuriating

blt · on Sept 4, 2015

The instruction format sure is interesting, to put it politely. I tried to write a toy JIT once for compiling math expressions in a scripting language... Didn't get very far.