Deep Learning with Electronic Health Record (EHR) Systems

oldgradstudent · on Sept 26, 2019

That would a terrible idea.

Here is a simple example why:

Finasteride is a compound that is used in two drugs. Proscar is used for prostate enlargement. It is old, out-of-patent, and has cheap generics. Propecia is used for hair loss. It is a newer, and (at the time) very expensive. The only difference is that Propecia is a lower-dose formulation.

What people did was to ask their doctors to perscribe generic Proscar, and then break the pills up to take for hair loss. Doctors would then justify the prescription by "diagnosing" enlarged prostate. This would enter the patient's health records.

If you apply deep learning without being aware of this "trick", you would learn that a lot of young men have enlarged prostates, and that Proscar is an effective, well-tolerated treatment for it.

Health records are often political-economic documents rather than medical.

epmaybe · on Sept 26, 2019

This is a really dumb way for doctors to prescribe finasteride, by the way.

If we assume that the reason patients get finasteride this way is to get insurance to cover it, and the doctor has no evidence to suggest the patient actually has an enlarged prostate, and further doesn't document a physical exam confirming the enlarged prostate, it's basically insurance fraud.

If this is simply a way to get the patient pills via prescription regardless of the insurance coverage (you said yourself that it's cheap, and hair loss is cosmetic and therefore shouldn't even be covered by insurance), this is even more stupid, because doctors are well within their rights to prescribe or administer medications that are FDA approved for one condition "off label".

oldgradstudent · on Sept 26, 2019

Different countries have different health systems and doctors face different constraints. This was in Israel in the early 2000s.

In this case it was done just to save the doctor the hassle of arguing with the higher ups. It's been a while, but if I recall correctly, Merck was aggressively enforcing its patent rights. It was even raiding compounding pharmacies that sold lower dose formulations of finasteride instead of the brand name Propecia.

https://news.walla.co.il/item/932056 (Google Translate does an almost reasonable job)

oncooncogene · on Sept 26, 2019

I think this article actually agrees with you. The very first NOTE is this:

""" Note: Be cautious about using data that was primarily created for insurance purposes. Often, it's not truly reflective of patient's condition but rather encompassing for billing / profit. Luckily, there are clinical reports, like radiology, diagnostic imaging, pathology reports, etc., that are intended for physician use and are more reflective of true patient conditions. Unfortunately, most of this data is not readily available in APIs because it's largely unstructured. This is a ripe space for ML to take raw, unstructured data and produce structured, computable data. """

oldgradstudent · on Sept 26, 2019

Insurance is just a one part of the problem.

Large practices have treatment standards on which physicians are evaluated. Reporting side-effects might be politically inconvenient in some cases. Medicine is also, like other human endeavors, subject to fashions and fads.

At best, applying machine learning to health records will generate a hypothesis that must be checked in a properly controlled trial.

travisporter · on Sept 29, 2019

So it’s not a “terrible idea”?

nradov · on Sept 26, 2019

Many organizations have been attempting to use AI to extract coded concepts for years. Even the best systems have a relatively high error rate, so if you want to use the output for anything important you still need a trained human medical coder to fix the errors. But that AI rough draft does have value since correcting its errors is still faster than entering all the codes manually.

dekhn · on Sept 26, 2019

doctors don't have to make fake diagnosis to justify off-label prescriptions. It would place them at liability and it's not required. Do you haave any reasonable citations to support this?

dfsegoat · on Sept 26, 2019

Agreed, would love to see some cites.

I work with insurance claims and post-marketing adverse event reporting data for pharma drugs, and I don't recall seeing anything resembling what OP described when looking at offlabel usage, but it's been awhile. (note: this is not EHR data, but it relates)

Anecdotally I've had a few off label RX's that weren't attached to any DX code in my EHR, same with my wife.

* edit: said 'you' vs. OP

dekhn · on Sept 26, 2019

OK I looked into it more. It's more about reducing time spent by doctors appealing health insurance non-payment (IE, if the insurance doesn't allow prescribing medicine X for insurance code Y, the doctor would have to write up some appeals form).

victor106 · on Sept 26, 2019

>Health records are often political-economic documents rather than medical.

Wow!!! Great observation.

One thing though, if ML/AI can detect when they are political-economic vs when they are medical then maybe you could apply some of the techniques of ML/AI

nradov · on Sept 26, 2019

There is no ultimate source of truth so I don't see how AI/ML could make that distinction.

marzell · on Sept 26, 2019

Yeah I agree. This is another example of our own biases/errors being injected to the data, thus poisoning any models we try to build with it. On the surface it seems unlikely there's much of a way to compensate for that, and if there was, it would probably have to be tailored to every specific type of bias/error.

practicalAI · on Sept 26, 2019

Wrote this last yr. before jumping back into clinical ML but never got around to sharing it. Added some 2019 updates (clinical BERT and approach for industry applications).

I updated the appendix with a few papers from 2019 but it felt like there were 10X more papers compared to 2018 (which is fantastic!). But, instead of making the appendix even longer, I highly recommend just following http://www.arxiv-sanity.com/search?q=health to stay up-to-date.

Even if you’re not in the health space, you’ll find ingenious interpretability techniques and tips leveraged by researchers out of necessity of being in the clinical space. I conclude with a realist note on the challenges that lie ahead for safely transitioning research to the clinical setting.

JusticeJuice · on Sept 26, 2019

Great work! I've got a rather wild ML ERH story to share.

So wrote my thesis on the design of EHR system (http://barnett.surge.sh/), and I interviewed this guy who said he was working on a crazy revolutionary ML-powered EHR system. It apparently had features such as voice recognition for writing notes instead of typing - but it also featured taking unstructured medical notes, and with the great AI, structuring them. So suddenly large scale medical record analysis would be easy - a rather incredible idea. However it sounded like they were at a very early stage, and they had nothing to show - so I thought 'yeah good luck getting that working' and trucked along with the thesis.

A few months after finishing my thesis, a friend of mine sends me this news article (https://thespinoff.co.nz/the-best-of/31-12-2018/summer-reiss...).

Turns out a journalist had heard of this medical AI, and dug way deeper. This guy (Albi) had convinced the GP I talked to that he had a working medical AI system. He had actually gone so far as to have the GP email 'the ai' and have it reply. However the whole thing was a fucking sham - it didn't exist, and the person replying was honestly probably just Albi. They were trying to raise funding to get it further, based on this claim of a 'functional ai'.

I don't think the GP I interviewed was in on the con, I think he was being taken for a ride. However once the article was published, there was heaps of attention and it all kind of fizzled out, and Albi was recently bankrupted.

But, I'm glad to see somebody is working on this problem - who isn't a conman haha.

nradov · on Sept 26, 2019

Every major EHR vendor and some other related organizations have been working on that problem for years. It's seen as a "holy grail" of EHR functionality.

Bhagaban · on Sept 26, 2019

Arent there privacy concerns?

adi4213 · on Sept 26, 2019

Good question. I imagine you would train models using the deidentified datasets cited on the page and design your prediction pipeline in the same vein as any other HIPAA-data related application.

dlphn___xyz · on Sept 26, 2019

whats the goal of this project? what are you actually trying to learn from the EHR record?

oncooncogene · on Sept 26, 2019

I think it's mean to be a good review on the field and what's possible. They're also covering what you shouldn't do with EHR data and some of the dangers