>Domain knowledge, while useful is not a necessity - I'm fine with contacting a specific domain expert for help and even pay for his or her services(with my own money at that),
Nah, it's the other way around. Specifically in this domain, innovation is definitely driven by domain experts (researchers, medical equipment manufacturers), and they contract out or hire technical expertise as they need it.
>To put it as question: do I really need to explain how open source works in theory and practice on a place like hackernews?
Your definition of open source is much too narrow and does not include what these researchers meant by it.
>Especially given that most of the world's infrastructure in practice runs thanks to the collaboration of millions who have built most of what we use in their "spare time" as you call it.
Certainly not in medicine.
>Same applies to DNN's(if not more so) - take any large DNN with the papers, data set and code to the author and ask them to give an explanation as to why it works as well as it does while it performs terribly on a different data set, even a similar one. "Well yeah, it's curve fitting which works here but doesn't work there". Why? ¯\_(ツ)_/¯
You've identified the precise reason why all this "machine learning" stuff has been such a dud when it comes to medicine. That is not good enough and never will be, particularly the "works so well on the dataset you overfitted on while performing so terribly on other datasets".
>My rant concerns a different problem - if you have such data and you want to share it, just go ahead and do. 1 of every 10000 might do something useful with it but we aren't talking about nuclear experiments where something can blow up, are we? Worst case scenario someone's cpu or gpu might overheat, big deal. Just ditch the entire bureaucracy crap, we have enough of that as it is in our daily lives.
They spell out the reasons for requiring a data sharing agreement in the data sharing agreement itself. They want you to cite it, they don't want you to sell it, they want you to confirm that you understand you're getting a dataset with no warranties, FDA approvals, etc.
And as I said before, the chances of you doing something useful with it may be 1/10000, but the chances of you doing something useful with it while not being motivated enough to sign a simple agreement and wait for the download link is essentially 0.
Nah, it's the other way around. Specifically in this domain, innovation is definitely driven by domain experts (researchers, medical equipment manufacturers), and they contract out or hire technical expertise as they need it.
>To put it as question: do I really need to explain how open source works in theory and practice on a place like hackernews?
Your definition of open source is much too narrow and does not include what these researchers meant by it.
>Especially given that most of the world's infrastructure in practice runs thanks to the collaboration of millions who have built most of what we use in their "spare time" as you call it.
Certainly not in medicine.
>Same applies to DNN's(if not more so) - take any large DNN with the papers, data set and code to the author and ask them to give an explanation as to why it works as well as it does while it performs terribly on a different data set, even a similar one. "Well yeah, it's curve fitting which works here but doesn't work there". Why? ¯\_(ツ)_/¯
You've identified the precise reason why all this "machine learning" stuff has been such a dud when it comes to medicine. That is not good enough and never will be, particularly the "works so well on the dataset you overfitted on while performing so terribly on other datasets".
>My rant concerns a different problem - if you have such data and you want to share it, just go ahead and do. 1 of every 10000 might do something useful with it but we aren't talking about nuclear experiments where something can blow up, are we? Worst case scenario someone's cpu or gpu might overheat, big deal. Just ditch the entire bureaucracy crap, we have enough of that as it is in our daily lives.
They spell out the reasons for requiring a data sharing agreement in the data sharing agreement itself. They want you to cite it, they don't want you to sell it, they want you to confirm that you understand you're getting a dataset with no warranties, FDA approvals, etc.
And as I said before, the chances of you doing something useful with it may be 1/10000, but the chances of you doing something useful with it while not being motivated enough to sign a simple agreement and wait for the download link is essentially 0.