A deep dive into how linkers work (2008)

Apple released a new linker that's in the same ballpark as mold - see following link for previous discussion - https://news.ycombinator.com/item?id=36218330

haberman · 2024-08-22T16:41:47 1724344907

I always take this as inspiration that the performance quest is rarely over. LLD was specifically designed to be fast, as was Gold which came before it. But Mold blew them both away.

skywal_l · 2024-08-22T05:29:58 1724304598

[2008]

But these articles are gold (no pun intended) so always good to see a refresh on HN front page.

ForOldHack · 2024-08-22T21:21:41 1724361701

Yes. I had heard about someone who had fixed a bug in a linker, and then I thought just how hard can it be, and then there is this.

*GREAT* explanation.

weinzierl · 2024-08-22T07:48:43 1724312923

This is one of my most favourite article series and had so many eye openers for me. I also think there is no other resource, neither internet nor elsewhere, that has all this information in one place. I really wished Ian made a book out of it.

karma_fountain · 2024-08-22T08:16:15 1724314575

The book Linkers and Loaders by John R. Levine is pretty good.

kibwen · 2024-08-22T11:12:39 1724325159

Is there anything in this article series that's not in Linkers And Loaders?

KerrAvon · 2024-08-22T23:03:43 1724367823

This is much closer to the metal of a modern ELF implementation. Linkers and Loaders is excellent for background and covers a wider spectrum, but was something like a decade old at the time this was written.

boffinAudio · 2024-08-22T09:27:02 1724318822

I've printed-to-PDF all 20 chapters and have my own book of it now. My only desire is that Ian made a single-page-with-all-chapters view of it available...

InDubioProRubio · 2024-08-22T13:25:32 1724333132

https://www.airs.com/blog/archives/51

Pattermatching on assembler code and rearranging and reusage of sequences..

kuharich · 2024-08-22T15:12:08 1724339528

Past comments: https://news.ycombinator.com/item?id=27445981

xyst · 2024-08-22T16:49:27 1724345367

I can see why linkers were created, especially in a time of constrained memory. But given an abundance of memory in modern systems, are linkers even necessary anymore?

As a matter of fact, aren’t these shared libraries a supply chain attack vector (ie, xz attack that was thwarted earlier this year)?

deckard1 · 2024-08-22T17:11:58 1724346718

I know it's fashionable to use flatpak, Docker, etc. but I'd still rather not have 30 instances of Gtk running for every GUI app I decide to run. Consider that we still run on Raspberry Pi, etc.

> aren’t these shared libraries a supply chain attack vector

Not any more than the apps themselves. If you're downloading a static binary you don't know what's in it. I don't know why anyone trusts half the Docker images that we all download and use. But we do it anyway.

akvadrako · 2024-08-22T21:43:23 1724363003

I think what you mean when you say instance of Gtk is a copy of the Gtk library in memory?

That's not how flatpak works; identical libraries will share the same file on disk and will only be loaded once, just like non-flatpak apps. And because Gtk is usually part of the runtime most apps will use one of a few versions.

compiler-guy · 2024-08-22T19:58:03 1724356683

Somehow the compiler needs to either have the whole program in one single go--every last source file at the same time, all with exactly the same build options--or there needs to be a way to combine the results of multiple compilation steps.

Even with modern LTO, the compiler doesn't typically see _all_ files in the program at the source code level. Just many. Usually the C-library and C++ library are different.

So as long as various languages don't build the entire program in a single compilation and assembly step, we will need something that combines the results.

That's the linker.

Even building everything statically doesn't eliminate the need for the runtime linker, unless one hard-codes the exact address where a program can run. That runs counter to security measures like ASLR.

account42 · 2024-08-23T10:07:37 1724407657

> Even building everything statically doesn't eliminate the need for the runtime linker, unless one hard-codes the exact address where a program can run. That runs counter to security measures like ASLR.

You could have the program be position independent (use only relative adressing) and do without a linker for that limited use case.

citrin_ru · 2024-08-22T17:37:33 1724348253

1. static linking is still linking and you still need linkers to combine multiple object files into a single executable 2. mindset that memory and CPU are in abundance IMHO one of the reasons that user experience is not visibly improving over the years despite orders of magnitude faster hardware

ignoramous · 2024-08-22T17:44:15 1724348655

> But given an abundance of memory in modern systems, are linkers even necessary anymore?

This sudden abundance in memory has been adequately matched by sandboxes, packagers, libraries, and frameworks.

jcranmer · 2024-08-22T22:48:08 1724366888

Do you want builds to change a single line of code to complete in less than half an hour? If yes, then you need something in the vein of a linker to handle less-than-whole-program units of compiled code.