It's deep in the 'just statistics' territory of ML/AI by the sounds of it, yeah. This is just a classification problem (cancer, not cancer) with a bunch of available patient data spanning different dimensions, four of them known risk factors, and regression analysis to find others (well, re-discovering/confirming those four too) that correlate.
I imagine at least the title here is a groan for the paper authors.
I imagine at least the title here is a groan for the paper authors.