Hybrid Bonding: 3D Chip Tech to Save Moore's Law

causality0 · 2024-06-06T22:30:58 1717713058

How do they solve the problem of doubling the component count without reducing the node size also doubling the power consumption and heat generation?

dehrmann · 2024-06-07T05:34:22 1717738462

I was thinking this, and figured it wasn't viable for CPUs because they're already near the limits be being reasonably air cooled.

It's probably good news for RAM and SSDs, though.

inhumantsar · 2024-06-07T14:34:55 1717770895

the article says that AMD used the process for it's Epyc line though

wmf · 2024-06-06T22:43:46 1717713826

They don't. You get twice the transistors, twice the power, and more than twice the cost (because the bonding itself costs money). See GPUs where power is increasing from 700 W to 1500 W. Moore's Law without Dennard scaling is kind of meh. You do save on networking because you need fewer servers.

kaibee · 2024-06-06T23:28:18 1717716498

How do you get twice the power without the chip gaining a gooey caramel filling?

wmf · 2024-06-06T23:37:32 1717717052

That's tricky. For example, AMD bonds more cache on top of their cache but they don't put anything on top of the cores to prevent them from melting.

wtallis · 2024-06-07T00:19:42 1717719582

I think the current version of their 3D cache does now extend over the cores and not just the cache of the underlying die.

The other big factor is that their cache chiplets are built with a fab process and standard cells that cannot tolerate high voltages, so the cores (which are on the same power rail) are constrained to not operate at the extreme end of the voltage/frequency curve where high-end desktop processors sacrifice everything to win a benchmark.

Manabu-eo · 2024-06-07T00:51:40 1717721500

Dark Silicon. Having proportionally less transistors firing at any given time.

cma · 2024-06-07T02:26:45 1717727205

More component count often let's you run same perf at lower clock (Nvidia 2000 series mobile chips had more cuda cores and lower clock than desktop and could match it at lower TDP).

_carbyau_ · 2024-06-07T03:48:06 1717732086

You are right that geometry suggests a volume(heat generation) to surface area(heat dissipation) issue arising. You clearly don't want to build a sphere of pure compute layers.

But with the ability to stack compute layers AND non-compute layers of varying thickness you now can have most any 3D shape you like. There will be adverse effects that distance will have on spacing things out of course.

Maybe stack a bunch of compute rings to form a hollow tube with liquid cooling though the middle and outside?

Or reverse that and have multiple compute sticks hanging off a baseboard dipping into your cooling vat like some reactor homage.

I think liquid cooling will become more commonplace.

Early days yet and brighter minds than mine will make things work but I am optimistic for the future!

inhumantsar · 2024-06-07T14:43:27 1717771407

> sphere of pure compute layers

reminds me of an alastair reynolds novel...

anyway, just daydreaming with this, but I wonder if that would be feasible if every layer had fluid channels baked in? maybe oriented so convection does most of the work and submerged in subzero fluid?

would have to use a transparent chamber and really complicated looking connections ofc. need to maximize that cool factor

_carbyau_ · 2024-06-10T01:15:43 1717982143

The computer layers are incredibly thin. Even having deadweight cooling layers transporting fluid between the computer layers would want minimal thickness.

This would require some fancy liquid management. Flow would be important and the slightest hint of a blockage most detrimental.

In similar vein to 3D printers being used to print parts to upgrade themselves: "This generation of computing is fantastic for fluid modelling the cooling required for the next generation of computing."

Taniwha · 2024-06-07T02:28:28 1717727308

These connections are big compared to other features - that means more capacitance (but far less than pins and PCB traces) that means that you are not going to be building designs where there's a design block moves into 3D more it's for connecting logical blocks together (CPU's L1 to L2/L3 to memory) the sorts of places where you can spend a clock to move data between layers.

As other's have mentioned dealing with heat is an issue too - all those insulating layers don't conduct heat well either - 3D chips tend to the "hairy smoking golfball" scenario where getting rid of heat becomes your biggest problem

simne · 2024-06-08T06:03:06 1717826586

This whole tech is mostly to solve problem of die defects. When die size increases, output decreases very much. One (really frequently used) solution, to make design from few separate chips on one physical die and then disable part of die with jumpers (and reconfigure working parts with jumpers).

For example, many DDR-3 chips have 6 chips on one die, but only 2..4 enabled after finish testing.

But if one could cheaply construct 2.5D, placing tested chiplets on silicon imposer, this create whole new fabrication opportunities.

For example, in newest Intel chips, could be one high-performance core (i7) + few Atom cores + HBM high-speed DRAM and even some high power current switch or analog die.

Looks like, in nearest decade we will see single chip smartphones, etc.

petra · 2024-06-07T15:09:30 1717772970

To solve the heat issue let's limit ourselves to 2.5D chips, since the bottom layer barely heats.

Die size:350mm2,millions of connections per mm2, meaning 350 million connections @1ghz - 350,000 tbit/sec of memory bandwidth. The realistic number Will probably be much lower, but still, even 2.5D could solve the memory wall.

rbanffy · 2024-06-07T10:11:51 1717755111

Since the interposer traffic is mostly vertical, you could add horizontal tracks of metal to conduct some heat out. Limited, but better than nothing.

I wonder how small a Stirling engine can be and power microfluidic cooling inside one such interposer.

What I really like about this is that the buses can be very wide, so even if you spend one clock cycle to do something, that something can be quite a lot of small things.

ChoGGi · 2024-06-07T13:36:59 1717767419

I wonder if we'll get little tiny heat pipes or "solid state" fans embedded inside chips, be nice to use some of that extra heatsink block.

kvemkon · 2024-06-07T16:13:19 1717776799

This reminded me of a so called "micro-vibration ventilator" https://www.pcworld.com/article/1388332/new-airjet-chips-can...

wmf · 2024-06-07T16:04:54 1717776294

IBM Zurich did some work on etching microchannels into dies for water cooling.

gtsnexp · 2024-06-07T07:00:01 1717743601

Could it be that by enabling dense, efficient connections between stacked chips, we are not only sustaining Moore’s Law but also paving the way for new chip architectures? What could integrating exotic materials and functions within a single package mean for the future of HPC?

cainxinth · 2024-06-07T13:37:35 1717767455

Unrelated, but does anyone know why IEEE.org uses such a massive font size on their website? I don’t entirely like it.

kvemkon · 2024-06-07T16:35:09 1717778109

Such approach I see more and more often. Maybe they test it only on HiDPI screens?

inhumantsar · 2024-06-07T14:33:55 1717770835

really? the font size seems normal to me. normal for a professional publication at least. I'm on mobile atm and can't inspect, but it seems like 14pt maybe?

cainxinth · 2024-06-07T16:56:10 1717779370

When I copy and paste it into a Google doc, it says it's 18pt.

Looks big to me in Safari on an ipad and in Chrome on Windows. Here's three websites (Engadget, NYTimes, and IEEE) at 100% scaling in Chrome for me (all three images are 1200x800):

https://imgur.com/a/695ipVb