The German Tank Problem

dooglius · on June 30, 2019

This is only the toy version of the actual problems solved by the Allies, which were more nuanced, and involved reasoning about the tank manufacturing pipeline. The write-up [0] doesn't go into the math but makes an interesting read.

[0] https://sci-hub.tw/10.2307/2280189

cortesoft · on June 30, 2019

Yeah, I can't imagine the assumption that tanks captured were "randomly uniformly distributed" is a good one. I can imagine all sorts of reasons that wouldn't be the case.

Causality1 · on June 30, 2019

How accurate did the allies' model turn out to be when compared to the real number?

dooglius · on June 30, 2019

Quite well, see pg. 86 for a plot of all predictions

walrus01 · on June 30, 2019

I recall something about targeted bombing of ball bearing factories.

jabl · on June 30, 2019

There were the infamous raids on the ball bearing factories in Schweinfurt. (At the time the allies didn't have escort fighters with sufficient range, and the bombers suffered heavily.)

But AFAIK those targets were selected based on pre-war "traditional" intelligence what the likely bottleneck resources would be, not statistical analysis of captured equipment.

sandworm101 · on July 1, 2019

Except that the data on the number of tanks, thier increased weight and number of wheels, pointed to a likely increase in the need for bearings. (And all the other wheeled non-tank things too.)

tzury · on June 30, 2019

More about Frequentist and Bayesian analysis can be found here:

https://en.wikipedia.org/wiki/German_tank_problem

Matter of fact...

    According to conventional Allied intelligence estimates, the Germans 
    were producing around 1,400 tanks a month between June 1940 and September 1942. 

    Applying the formula below to the serial numbers of captured tanks, the number 
    was calculated to be 246 a month. After the war, captured German production 
    figures from the ministry of Albert Speer showed the actual number to be 245.

debbiedowner · on July 1, 2019

I was actually surprised that MVUEs and the fellow point estimators are called frequentist (though it makes sense). In school we always referred to them as non-Bayesian, at the same time frequentist always seemed like a dirty word to us students so maybe that's why

jackfoxy · on June 30, 2019

How ironic that the nation that led the world in the frontiers of maths in the 19th century completely missed the boat in the applied math of signals intelligence in WWII. I'm referring to the tank serial numbers and the lack of care in Enigma codes, except by the Kriegsmarine, but even they eventually lost a code book to the allies, which they apparently considered an impossibility.

PhasmaFelis · on June 30, 2019

They had a lot of opsec problems. There's a great story about how using "cool" codenames instead of random ones bit them in the ass.

It's nearly impossible for a bomber to navigate long distances in the dark over a blacked-out country, so the Germans came up with a radio navigation system involving beams transmitted from the mainland to intersect over the target, which the British figured out how to jam; the Germans came up with another nav system, and the Brits eventually jammed that one too.

The British knew the Germans would be trying to find yet another way. They'd learned from Enigma decrypts about a new device called Wotan. One researcher looked up the word, learned that it was the name of a one-eyed god, and concluded that the new system would use a single transmitter with a rangefinding transponder aboard the bomber, instead of multiple beams like the previous ones. Starting from there, they had a countermeasure online and ready to go before the Germans even deployed Wotan. When the Nazis realized they'd been outmaneuvered from the start, they gave up on radio-guided bombing completely, at least against Britain.

noir_lord · on June 30, 2019

We also caught every single German spy and turned them all (iirc it was all) but didn’t know we’d got them all till after the wars conclusion.

British intelligence was pretty impressive during WWII.

neaden · on July 1, 2019

To be fair, some of the German spies were pretty bad at their jobs. Josef Jakobs stands out as a man who was just not a good spy: https://en.wikipedia.org/wiki/Josef_Jakobs

chiph · on June 30, 2019

Dr. R.V. Jones had significant involvement in the War of the Beams, and after the war wrote a book about British Scientific Intelligence efforts during the war.

https://www.amazon.com/Most-Secret-Penguin-World-Collection-...

hef19898 · on July 1, 2019

Slightly off topic, but that mindset, ignoring expertise in field that could help in another, is still quite common in Germany if you ask me.

And yes, the military intelligence of the Germans sucked in WW2. Didn't help neither that the culture, military and political, was highly idiological. When truth cannot be spoken and power won't listen facts are ignored. It cannot be what's not allowed to be. And then reality bites your ass ultimately.

anoncake · on June 30, 2019

It wasn't all incompetence. The head of the Abwehr was part of the resistance.

https://en.wikipedia.org/wiki/Wilhelm_Canaris

hef19898 · on July 1, 2019

True. Still, from what I know, he saw himself as a patriot. Which was the reason why he opposed the Nazis but the reason why he didn't defect or betray the Germans.

anoncake · on July 1, 2019

If by "the Germans" you mean the German government, he did betray them, at least when it to war crimes and the Holocaust. I don't know if he also sabotaged the war but that would also be consistent with patriotism. A patriot loves his country, that doesn't mean they don't care about others at all. That would be some sort of combination of patriotism and psychopathy.

A patriot could also have decide d to sabotage the war effort to end the war faster, to get rid of Hitler or to somewhat save the reputation of Germany.

He didn't openly defect but that doesn't mean much for a spy.

hef19898 · on July 1, 2019

From what I remember of the documentation I saw about him back the day he never actively sabotaged the war effort or did things that sis put German troops in danger. He opposed the Nazi regime.

Yes, I agree it is quite a feat of mental acrobatics. My impression was that he somehow seperated Nazis and the German nation. And that the war was a German and not really a Nazi thing. Maybe he just didn't want to see that Germany and the Nazis were the same thing at the time, maybe he also wanted a round two after WW1 or maybe he wasn't able to shake decades of upbringing and training.

Either way, he was one of the few "good" Germans, even if not on Schindler levels, and definitely a very interesting person. Just look up his WW1 adventures.

Notable, so, is that even in WW1 and after he was not necessarily a trained spy intel guy, AFAIK.

anoncake · on July 1, 2019

I didn't read much about Canaris but I don't think his stance required much mental acrobatics. Considering a nation and its government somewhat separate entities is quite normal.[1] Of course many Germans used this as an excuse after the war but this does not apply to Canaris.

Considering wars of aggression acceptable wasn't all that unusual either.

Sabotaging the war effort would have meant helping the Allies fight Germany. Sabotaging war crimes and the Holocaust meant trying to stop Germany from something evil and stupid (at least if he considered German Jews German). While there were reasons for a patriot to sabotage the war effort, only sabotaging the crimes was also a consistent position.

[1] Especially when it isn't democratically elected. The last multi-party elections in 1933 weren't free. The communist Reichstag members were jailed, many others were intimidated to make them support the enabling act.

hef19898 · on July 2, 2019

Summing it up pretty well. And yes, that is how I understood Canaris. And yes, from his perspective it seems a logical stance to take. Hindsight makes a lot of things easier, doesn't it? Also true that he saw what the Nazis really were and did something about itt. A rare feat during these days.

bnegreve · on July 1, 2019

I suspect there is a strong winner bias here. Success stories of allies tends to be reported more often.

michaelt · on July 1, 2019

If you were a Nazi codebreaker whose successes in the war were classified, would you publish detailed memoirs? Or would you destroy the evidence, which was probably what your orders said to do anyway?

tsss · on June 30, 2019

It certainly didn't help that they killed all the academics and free-thinkers.

dmos62 · on June 30, 2019

Quote? I'm vague on what went on in Germany in the first half of the century.

baobrien · on June 30, 2019

In 1933, the Nazi regime passed a law[1] banning anyone they considered Jewish from holding any civil service job, including positions at universities. A large proportion of German academia was considered Jewish.

[1] - https://en.wikipedia.org/wiki/Law_for_the_Restoration_of_the...

laGrenouille · on June 30, 2019

Interesting article, though I think it incorrectly leaves the reader thinking that there is some interesting informating hidden in the average spacing of the numbers. In fact, all you need to know is that maximum observation and the number of observations. Once you simplify the average spacing goes away.

If M is the maximum serial number of N is the total number of observations, using the formula in the post:

    M + (avg. spacing) = M + M / N - 1 = (N + 1) / N * M

To me that gives a more clear picture of what the unbiased estimator is doing: inflate the maximum value by a factor that limits towards one as the sample size grows.

comicjk · on July 1, 2019

If you just assume that the sample mean = the population mean, then you get the right answer, at least for this example. I don't see why the article fools around with the maximum at all - isn't the maximum a much more noisy statistic than the mean?

skosch · on July 1, 2019

The range matters – had they found 10 serial numbers between 100000 and 101000, would the mean still be a meaningful estimate of the production rate? In this case, the author just tacitly assumes the minimum to be zero.

ptero · on June 30, 2019

To be the devils advocate: what you say is true if you know the distribution. If spacing looks weird (e.g. clustered) it might indicate that the number is, for example a pairing of model and serial numbers, etc.

popotamonga · on July 1, 2019

Distribution of manufacturing date or distribution of rate of tank capture?

Or does it make a difference?

mhh__ · on June 30, 2019

For anyone else interested in WW2 reverse engineering and design etc., https://www.youtube.com/watch?v=GJCF-Ufapu8 "The secret war" is a huge documentary covering british efforts to counter german electronic warfare and V-weapons.

spectramax · on June 30, 2019

Why didn't they use randomized and scrambled serial numbers? Sort of like what Amazon does to their order numbers. I know it can still be cracked but serially numbering military equipment is not very smart. I was setting up a Shopify store the other day and it doesn't allow for a lookup table to be used for order numbers. I don't want competitors to know that I've sold so many X items. Same thing with Squarespace and Square e-commerce stores. It blows my mind that a multi-billion dollar ecom giant has not implemented despite of forum posts and requests from users.

Nitramp · on June 30, 2019

World War II was (at least one of) the first industrialized war. So the whole situation was genuinely novel to most participants.

Additionally, the German army command didn't think that way. Where the US relied on overpowering by materiel dominance, and the Soviets fought and won through unimaginable human sacrifice, the considerable initial success of the German army was based on better, smarter tactics, individual leadership, bravery, ruthlessness, etc. The leadership assumed they'd be able to win the war that way, even when the war had turned into a much more industrial operation.

You can see that in operations such as the Battle of the Bulge, the war in Normandy, and most importantly in the the Russian campaign.

This is of course over-generalizing, but I believe the general mode of thinking was there, and that'd explain the lack of attention on such details.

greedo · on June 30, 2019

Don't underestimate Soviet industrial capabilities during the war. The Soviets produced over 58K T-34 type tanks compared to Germany producing 37K (PzIII through Pz6).

Nitramp · on June 30, 2019

You're correct, the Soviets outproduced Germany as well as being willing to run much higher losses. E.g. in the Battle of Kursk, the Soviets outnumbered the Germans by x2 in just about everything (tanks, planes, men). They won, but lost x2-x4 in tanks, planes, men.

In either case, terrible times that we should be thankful not to have been born into.

ptaipale · on June 30, 2019

Though a substantial part of the steel used to make those tanks came from USA. As did much of the trucks and other equipment used by Soviets.

I recall seeing actual numbers (proportion of American steel in Soviet production), but couldn't find them, does someone have a source?

Anyway, e.g. this article talks about it:

https://www.rbth.com/defence/2016/03/14/lend-lease-how-ameri...

adventured · on July 1, 2019

At the point of German-Soviet conflict, the US had about six times the steel production of the USSR, six times the iron production, eight times the oil production, and three to four times the coal production. It definitely wouldn't be surprising if US steel was a large share of Soviet figures.

The scale of resources delivered to prop up the Soviets was extraordinary, including what the British sent them.

In just 3 1/2 years the British sent them[1]:

3,000+ Hurricanes aircraft, 4,000+ other aircraft, 27 naval vessels, 5,218 tanks, 5,000+ anti-tank guns, 4,020 ambulances and trucks, 323 machinery trucks, 1,212 Universal Carriers and Loyd Carriers, 1,721 motorcycles, £1.15bn worth of aircraft engines, 1,474 radar sets, 4,338 radio sets, 600 naval radar and sonar sets

And the US sent them:

427,284 trucks, 13,303 combat vehicles, 35,170 motorcycles, 2,328 ordnance service vehicles, 2,670,371 tons of petroleum products (gasoline and oil) or 57.8 percent of the High-octane aviation fuel,[32] 4,478,116 tons of foodstuffs (canned meats, sugar, flour, salt, etc.), 1,911 steam locomotives, 66 Diesel locomotives, 9,920 flat cars, 1,000 dump cars, 120 tank cars, and 35 heavy machinery cars. Provided ordnance goods (ammunition, artillery shells, mines, assorted explosives) amounted to 53 percent of total domestic production

Beyond Russia also notes:

"The USSR received a total of 44,000 American jeeps, 375,883 cargo trucks, 8,071 tractors and 12,700 tanks. Additionally, 1,541,590 blankets, 331,066 liters of alcohol, 15,417,000 pairs of army boots, 106,893 tons of cotton, 2,670,000 tons of petroleum products and 4,478,000 tons of food supplies"

The notion of sending a country 375,000 trucks and 1,900 locomotives in just three years, is incredible to think of today.

[1] https://en.wikipedia.org/wiki/Lend-Lease

olegious · on July 1, 2019

The Soviets repaid the debt with unimaginable human losses: 25-30 MILLION war dead.

80 percent of all German military casualties occurred on the Eastern Front.

Just think about those numbers.

neaden · on July 1, 2019

While the Soviet Union lost a huge number of people, the 25-30 million dead is an inflated estimate because it counts people living in areas that were conquered by the Soviet Union. Polish people by and large do not appreciate being lumped in with the people who had invaded them the year before.

ptaipale · on July 1, 2019

And quite many of the dead of course died with a bullet from behind, or in slave labor camps run by Soviets themselves.

gumby · on June 30, 2019

First, they probably didn't consider that serial numbers might be an information leak.

Second, all calculations were done by hand in those days (and documents that weren't printed in bulk had tp be retyped by hand) so sequential numbers were not only easier to issue but to track (e.g. if you have a production problem you can say "let's check all tanks with S/Ns between A and B" rather than having to maintain a list mapping production dates to serial numbers that might be in a file cabinet somewhere distant from where you are.

dragonwriter · on June 30, 2019

> Why didn't they use randomized and scrambled serial numbers?

Because there weren't well-known examples of the risk of not doing that, and not doing it is the easy and obvious thing if you have no clear reason to do it, and makes lots of things you might use those numbers for yourself easier (and if it wasn't for your own use, you wouldn't issue the numbers at all.)

jcranmer · on June 30, 2019

Because supply chain logistics. The Germans in WWII were world leaders in manufacturing (perhaps bested only by the US), and one of the elements of that manufacturing quality is the ability to trace individual parts back to the exact manufacturing batch to figure out why particular batches go wrong.

The Germans did (eventually) make some effort to obscure details of their supply chain--they forced manufacturers to use three-letter codes instead of their normal trademarks--but that still suffered from poor operational security which allowed the codes to be quickly matched up to manufacturers. It didn't help that the British analysts meticulously kept track of everything, allowing them to identify the manufacturer of one unlabelled part by the inspector's number.

lostlogin · on July 1, 2019

They might have had good equipment sometimes, but they had nowhere near enough of it. They were outproduced by Britain “alone” (counting the colonies) in most areas, most the time, and often by considerable margins.

The German army was not particularly mechanised or well equipped as a whole, relying on a lot of horse draw vehicles for the entire war.

When you look at the war from a manufacturing perspective, the question is more about how Germany survived for so long again it’s such huge manufacturing nations. For a seemingly dry subject, David Edgerton’s book on this is very readable. https://www.theguardian.com/books/2011/mar/27/britains-war-m...

jcranmer · on July 1, 2019

I should say that Germany was a leader in manufacturing quality, not so much quantity, although I believe they did comparatively well there considering that they were facing down the gargantuan industrialized economies of the US, UK, and USSR.

It's also worth point out that Germany suffered from a severe lack of resources, particularly oil and rubber (although everyone in WWII was short on rubber). While they did have synthetic fuel and rubber plants that they made excellent use of (part of the reason for German superiority in the chemical industry was their need for it), these synthetic routes are not really sustainable for a massive war effort, and Germany ran out of their stockpiled reserves by 1942. Case Blue, the second offensive in the USSR, had obtaining the Baku oil fields as its main objective.

hef19898 · on July 1, 2019

Even quality I'm not so sure. Because quality largely depends on the intended use. And if that use is to shoot at things an be shot at in return in abysmal operating conditions the traditional German quality standard is just over the top and unsuited. And while quality beets quantity on a per unit basis, globally there is inly so much quantity difference quality can make up. Something Germans seem incapable of culturally understanding even today the concept of "good enough" is beyond the understanding for a lot of my fellow country men.

greedo · on July 1, 2019

German quality was largely a myth. Examine tanks; they need to be survivable, reliable, and potent. Without all three, they're useless. German tanks were rarely the best on the battlefield when measured this way.

Aircraft (especially fighters) have the same three requirements: until the ME-262 was deployed, Germany was only on par with the allies.

Artillery? Other than the feared 88mm, its artillery was clearly second fiddle to the Allies.

What enabled Germany to have any success was the initial training of its NCOs and officer corp. This allowed them to exploit opportunities faster than their opponents (think of Boyd's OODA cycle).

But all the oft-touted German "super-weapons" were usually over-engineered stuff that didn't work reliably. Note that the ME-109 flew until the end of the war since it was reliable, and effective against bombers until they were escorted by Mustangs and Thunderbolts.

Guthur · on July 1, 2019

It wasn't about lack of industrial know how, it was the policy of Autarky. The Nazi regime literally starved itself of essential manufacturing and war time resources.

They knew for example that they only had enough oil, with the limited mechanized forces they had, for operational effectiveness until autumn 1941. After that Germany would never again have the resources for grand operations on the strategic level of operation barbarossa. They needed to get to the oil fields of the Caucasus region which they did not even get close to due to some screwed up leadership decisions.

Fall blau was a pale comparison to the earlier operations and Germany's logistics system and resources were beyond tipping point.

And then when it came to Kursk all they could really manage was a single limited scope battle.

mattkrause · on June 30, 2019

They did, sort of, starting in the 1940s. The tank make/model were replaced by arbitrary codes, but it was done sloppily and many of the tanks could be re-identified. (See page 80 of the paper @dooglius posted above).

Still, the idea that this could leak valuable information is probably more obvious in hindsight, and sequential serial numbers do have some upsides. If there's a design flaw in one version of the gearboxes, you can just pull everything with a serial number between XXXX and YYYY. With randomized numbers, you'd have to maintain some master database, which is a lot harder when most of logging is done with pen-and-ink ledgers, carbon copies, and maybe punchcards.

tyingq · on June 30, 2019

They could just randomly skip some numbers. That would maintain the XXX to YYY advantage.

Johnny555 · on June 30, 2019

Does that resolve the problem, or just add another variable to the analysis?

mattkrause · on June 30, 2019

Naively, it would inflate the absolute values, but you might be able to correct for it in various ways. Still, knowing that tank production fell 50% is probably useful, even if you don't know what exactly it fell from....

FrojoS · on July 1, 2019

You could increase the average spacing when production slows down.

dooglius · on June 30, 2019

It's interesting to see other commenters saying that manipulating IDs wouldn't occur to the Germans, it reminded me of an interesting anecdote from Hitler's rise to power: "It was in January 1920 when a numeration was issued for the first time and listed in alphabetical order Hitler received the number 555. In reality, he had been the 55th member, but the counting started at the number 501 in order to make the party appear larger." [0]

I think it's more likely that many were aware of the security issues, but it wasn't worth the coordination of coming up with a scheme, giving it to all spare parts suppliers in a secure way, etc. potentially slowing down the war effort. I bet the Allies used a lot of serial numbers too, despite this work.

[0] https://en.wikipedia.org/wiki/German_Workers%27_Party#Adolf_...

terramex · on June 30, 2019

>Why didn't they use randomized and scrambled serial numbers?

Because it happened 80 years ago, when German army (or any other) did not understand statistics as well as they do today. It was a groundbreaking achievement by allies.

spectramax · on June 30, 2019

They did know about encryption and developed the Enigma machine.

I don't think you need deep statistics knowledge to know that if the enemy captured Serial # 0020, 0120, 0439, 1293 and 1356; they would at least have some hint that the lower bound is 1356 tanks.

mattkrause · on June 30, 2019

Sure, but as the article points out, that's not a great bound—and you really care about the expected number of tanks (or the upper bound), rather than the lower. The Allies had drastically overestimated this, by about fivefold.

Furthermore, the interesting part isn't just the number of tanks or planes—though that has obvious strategic uses too— but the insight it gives you into their industrial production. What's the limiting factor in getting a tank to the front--machining the parts? assembly? fuel? Which of our raids affected that?

notinversed · on June 30, 2019

Because they were Germans.

srean · on June 30, 2019

The job interview version: If you are being interviewed for a position by engineers who have their employee ids (serially allocated) on their badge find the number of employees from those ids assuming all engineers are equally likely to be on the panel of 8.

chiph · on June 30, 2019

I have looked at my payroll check numbers from contracting firms to see how they're doing as a business. If the interval between check numbers drops in a month, I have a good idea that there aren't as many people working there anymore.

feintruled · on July 1, 2019

I worked for a big company and we used to put bug numbers in our change releases. We were told to stop doing this, as some customers would see that their bug would appear to have been given a lower priority when they saw lower numbers coming in first.

carlmr · on June 30, 2019

I would guess that the likelihood of older employees should be higher. Although I've never seen a panel of 8 at a job interview.

rootw0rm · on July 1, 2019

when i first started a grey-market research chemical company some years ago, i added like 31 or something to each invoice number to make it seem like i did more business.

breakingcups · on July 1, 2019

Invoice numbers have to be sequential where I live.

kgwgk · on July 2, 2019

But, in some places at least, you can choose to start at 32.

HeWhoLurksLate · on June 30, 2019

I read your comment before I read the article, and my head started spinning really hard.

Congratulations on your nerd snipe!

pieterr · on June 30, 2019

wcoenen · on June 30, 2019