He's saying there should have been a control group. We can't directly compare startups that YC funded with startups of equal quality that YC did not fund, because the latter does not exist (assuming YC is correctly selecting for quality.) To isolate YC's influence, you'd need YC to pick a set twice as large and fund only half of them at random.
/Insert Einsteins's quote about god being dicey