You haven’t addressed the chief, highest rated comment in the original thread: what about the fact this dataset was generated by scraping LinkedIn in violation of that site’s ToS, and the generates a liability?
To be frank, we and others interpreted that entire thread as resolved and didn't need addressing. Scraping is clearly a grey area of the law. This is all public data.
Did you read your own links? The HiQ labs decision in favor of scraping was vacated by the Supreme Court and then settled. Not a clear cut case law, but definitely ended on LinkedIn’s terms.