Show HN: Using GPT-3 and Whisper to save doctors’ time

117 点作者 ar7hur大约 2 年前

Hey HN,We're Alex, Martin and Laurent. We previously founded Wit.ai (W14), which we sold to Facebook in 2015. Since 2019, we've been working on Nabla (<a href="https://nabla.com" rel="nofollow">https://nabla.com</a>), an intelligent assistant for health practitioners.When GPT-3 was released in 2020, we investigated it's usage in a medical context[0], to mixed results.Since then we’ve kept exploring opportunities at the intersection of healthcare and AI, and noticed that doctors spend am awful lot of time on medical documentation (writing clinical notes, updating their EHR, etc.).Today, we're releasing Nabla Copilot, a Chrome extension generating clinical notes from video consultations, to address this problem.You can try it out, without installation nor sign up, on our demo page: <a href="https://nabla.com/copilot-demo/" rel="nofollow">https://nabla.com/copilot-demo/</a>Here’s how it works under the hood:- When a doctor starts a video consultation, our Chrome extension auto-starts itself and listens to the active tab as well as the doctor’s microphone.- We then transcribe the consultation using a fine-tuned version of Whisper. We've trained Whisper with tens of thousands of hours of medical consultation and medical terms recordings, and we have now reached an error rate which is 3× lower than Google's Speech-To-Text.- Once we have the transcript, we feed it to a heavily trained GPT-3, which generates a clinical note.- We finally return the clinical note to the doctor through our Chrome extension, the doctor can copy it to their EHR, and send a version to the patient.This allows doctors to be fully focused on their consultation, and saves them a lot time.Next, we want to make this work for in-person consultation.We also want to extract structured data (in the FHIR standard) from the clinical note, and feed it to the doctor’s EHR so that it is automatically added to the patient's record.Happy to further discuss technical details in comments!---[0]: <a href="https://nabla.com/blog/gpt-3/" rel="nofollow">https://nabla.com/blog/gpt-3/</a>

37 条评论

swatcoder大约 2 年前

I don't spend much time worrying about AI ethics, but bringing AI close to patient interactions and record-keeping in healthcare seems grossly premature and irresponsible. There are countless brilliant applications of AI and the last one we need is a mediocre automated transcriptionist that distances a doctor from their responsibility to engage with the patient and can bear no accountability for error.This is a task that perhaps can be supported with AI some day, but there are fields that deserve the application of a mature technology, not the gold rush rush game of integrating today's hottest thing.

评论 #35154143 未加载

评论 #35155841 未加载

评论 #35158110 未加载

JohnFen大约 2 年前

As a patient, this is an application that I would be extremely wary of, personally. I don't want the details of my conversations with my doctor to be sent to a third party in the first place, and I wouldn't trust the results of the transcription to be correct.My doctor could vet it for accuracy, I suppose, but why? He's already putting his notes in my records anyway.

评论 #35157690 未加载

评论 #35156611 未加载

评论 #35163623 未加载

评论 #35162162 未加载

v4dok大约 2 年前

I like the idea, but I would like it even more if it was on-prem. The doctors (at least in EU) will be very wary of having their client meeting essentially recorded by a third party. With this as a cloud SaaS, patient confidentiality is essentially broken since the raw data is available to you while you transcribe it. I understand that you compete with "google speech-to-text" but this is not a feature meant to be used by doctors (even if they "illegally" do).Obviously the business model is harder with on-prem, but cloud-first for doctor notes is in the long run much harder.

评论 #35152627 未加载

评论 #35155715 未加载

priyanmuthu大约 2 年前

My worry is verification of facts. What if the model summarized incorrect facts, and it gets added to the medical history? How can doctors easily verify and fix things?

评论 #35152760 未加载

评论 #35153044 未加载

评论 #35160674 未加载

评论 #35152824 未加载

avgDev大约 2 年前

I feel uncomfortable knowing doctor would use anything like this.How are 2 party consent states handled?Is this HIPPA compliant?

评论 #35157089 未加载

评论 #35152765 未加载

评论 #35152850 未加载

NorthOf33rd大约 2 年前

This is the scariest real life attempt to use the bullshit machine I’ve seen so far.Nothing like “subtly wrong” clinical notes to affect positive medical outcomes. And time pressed medical professionals are certainly well suited to vigilance tasks like correcting machine generated errors. /s.I sincerely hope this never sees the light of day.

评论 #35163712 未加载

urduntupu大约 2 年前

Technically interesting but solving the wrong problem.Doctor's already today spent too little time with their patients to understand diseases holistically enough.Adding technically between these 2 will make treatments in most of the cases worse, not better.

评论 #35152731 未加载

评论 #35157634 未加载

评论 #35152732 未加载

评论 #35152669 未加载

orcajerk大约 2 年前

Last time I went to the doctor, he charged me $300 to google symptoms on a computer. Now you're saying I get to pay $300 for him to ask ChatGPT? No thanks. We all know that is where this is going - doctor decision support system, i.e. the AI has access to information locked behind expensive journals that the patient does not.

评论 #35152701 未加载

评论 #35152800 未加载

评论 #35152656 未加载

评论 #35152633 未加载

评论 #35152812 未加载

评论 #35152762 未加载

评论 #35152702 未加载

评论 #35152727 未加载

评论 #35152726 未加载

D13Fd大约 2 年前

The "feed it to a heavily trained GPT-3" part is backwards. The transcribed notes should be stored and retained. The LLM should be applied later, in real time, when someone is referring back to the notes.That way you still get the benefit of summaries, without polluting the medical records with hallucinated AI nonsense.

Terretta大约 2 年前

Ignore the nay-sayers in this thread. I doubt the ones commenting have spent many days personally observing or personally doing the problem scenario you are solving for.This is spectacular for telemedicine in the post-covid regulation environment.The new model is an onsite medical assistant taking patient vitals and setting up the Zoom, the patient then talks to an APRN (Advanced Practice Registered Nurse) in the patient's field (e.g. subacute care, gerontology, etc.).The APRNs spend 30 minutes reading the file, tele-consult with a patient for a half hour. Medicare pays for the half hour. Then the APRN has to spend ~60 minutes turning that session into the EHR notes. Many just don't, they make an ultra short note, and hope Medicare doesn't challenge it. Patient loses. If too strapped for time, they also may not pre-read, because they only get paid for the video conf time. Patient loses.Downside I see with this is the APRNs doing telemedicine are making about 1/4 of the rate they should be, and can't afford an expensive tool. The telehealth provider is not going to pay for it, they get paid for the 1/2 hour, the wasted time is not out of their pocket, so they won't spend to make it up. This is going to be on the nurse, who likely can't afford it.On the plus side: time saved with this tool would go directly to ability to pre-read, or ability to see a patient an hour instead of one per two or three hours. It would also permit a higher caliber of expertise nurse to do tele-medicine visits in off days, because the time saved could bring comp back more in line with the expertise.Go for it!

908B64B197大约 2 年前

> This allows doctors to be fully focused on their consultationHere's the thing: rigorous medical documentation is not a burden on the practitioner, but rather an integral part of treating someone. You need detailed historical data to diagnose some tricky conditions and treat them correctly.What's the tool doing to prevent a practitioner from simply blindly copy-pasting the output without verifying it?> and saves them a lot time.Which means more billable. Have you considered charging per medical act performed?

评论 #35152996 未加载

norgie大约 2 年前

Is Whisper really up to this? When I've used it, it's...fine. But it also randomly dreams up weird transcriptions that have nothing to do with what's actually being said, and with perfect audio too. Like the audio will be a presentation at a tech conference, and Whisper will predict "check out our site at randomlink.com", when no one said anything close to that.I'd be pretty concerned if my doctor relied on this to be honest.

评论 #35156690 未加载

ysavir大约 2 年前

I'd prefer to have 0 AI interpreting my session with a doctor. Will there be any way for me to identify which physicians use this service so that I can avoid using them?

评论 #35158220 未加载

rubatuga大约 2 年前

I did some work at a startup that tried to solve a similar problem and currently finishing medical school. Nobody here is discussing the paperwork and burnout problem that exists in medicine that will make these tools valuable. Fortunately, language models used for summarization based on a transcript are much less likely to hallucinate compared to upscaling/denoising image algorithms. Recent LMs are scoring top marks on LSAT, so they are maintaining accuracy as they get larger. If doctors can save time, this will be useful for the patient in the long-run.On-prem stuff is more tricky, but useful for privacy. Nvidia is supporting the space with their health AGX series.

评论 #35159007 未加载

hartleybrody大约 2 年前

I used to work in a hospital and one thing that was already tough was when doctors would copy massive (multiple page) notes that were mostly boilerplate, probably stored as some template in the EHR. After seeing the same note a few times, you started to learn which page to scroll to in order to view the relevant section for the patient. I would hope this system only generates clinically-relevant text for that patient, usually only 2-3 paragraphs for a standard consultation.

lukah大约 2 年前

This is really interesting work, congratulations! A lot of negativity in the comments seems to stem from cynicism around doctors and their billing, but in a country like the UK I know removing essentially "admin tasks" from the GP would be very welcome and give time back to the clinician, allowing them to spend more time focussing on the patient and outcomes.

cinntaile大约 2 年前

> We then transcribe the consultation using a fine-tuned version of Whisper. We've trained Whisper with tens of thousands of hours of medical consultation and medical terms recordings, and we have now reached an error rate which is 3× lower than Google's Speech-To-Text.I think the general idea is interesting but this is the wrong benchmark to convince people imo. What is the actual error rate of Google's Speech-to-Text on this your data? How convinced are you that it's properly labeled? That it's 3x lower isn't necessarily impressive. Then a follow up question... What I as a patient would want to know is something else, how does it compare against medical secretaries that usually transcribe documents like these?

CiceroCiceronis大约 2 年前

I'm a medical student and one of my main takeaways from spending time in clinical environments is that a lot of doctors and other staff seem to either not realise or not care about how much time is wasted due to slow, inefficient processes and technology. I was wondering to myself about what it might take to train a model to recognise medical jargon to help doctors rapidly transcribe audio recordings of their consults/spoken recounts of their notes. Not surprised that somebody is looking to make a commercial product of it, there's definitely space for it.

tomasyany大约 2 年前

It looks pretty neat! Just wondering, what other languages are supported? English seems pretty natural as it is where GPT-3 thrives, but have you been able to support other languages as well?

评论 #35152682 未加载

retox大约 2 年前

How tied are you to the name, it's almost Nambla.

评论 #35160690 未加载

gigel82大约 2 年前

What is OpenAI's privacy policy? How do you ensure the transcription doesn't get stored by OpenAI or used in training their future LLM versions?I'd recommend looking at a fully self-hosted LLM alternative or ensuring the GPT-3 endpoint you're using is compliant (I've heard Azure has a GPT-3 API that is supposedly more "compliant").

ar9av大约 2 年前

Maybe the future of the first line of healthcare will start to use AI with the assistance of a specialized nurse to follow your health overtime and only fall back to a real doctor when a real problem occurs.This could elevate a lot of problems we have with our public healthcare(Canada).

tzm大约 2 年前

Love this and have had discussions w/ doctors and practitioners about this exact problem. I think extracting to FHIR is key, along with making it searchable as a longitudinal data store for a more robust personal record. Clear need. Would love to help.

stopachka大约 2 年前

This is really cool! What surprised you guys about fine-tuning whisper + gpt-3?

评论 #35152631 未加载

phren0logy大约 2 年前

Side-note: your email field to sign up for your newsletter is very wonky with both Safari and Firefox (at least for me). It only lets me type one letter at a time, for some reason.

评论 #35153142 未加载

mitchellpkt大约 2 年前

Just read the blog post, very interesting. I thought sections about changing the temperature and repetition penalty were particularly neat, thanks for including examples.

Banjor大约 2 年前

One thing that a lot of folks still don't get is that some problems are bad, while others are not-so bad.

SkyPuncher大约 2 年前

Pretext: My wife is a psychiatrist. Via her, I'm am close friends with many doctors. Also, I know it's super easy to be critical, but I'm going to save you an insane amount of effort and heartache.This is actually a solved problem in medicine, already. Maybe not solved, but absolutely "good enough". M-Modal, Dragon, and Telephone services (yes, call someone and have them translate) all exist. They all translate notes well enough. For most doctors, it's a huge improvement over typing.The problem is primarily access. Most institutions simply do not want to pay for this. They don't view it as necessary. Further complicating the issue. Unless a physician is in private practice, they have little to no control over their ability to use a tool like this. HIPAA is far too high of a barrier.You're trying to implement a technical solution to what's ultimately a politics/human problem.----I've spent a bunch of time in the space, both as a MedSpouse and the CTO of a HealthTech company. I wish you the best of luck as I'd love to see life be easier for my wife, but this is _extremely_ far from being MVP viable.* HIPAA - You absolutely, 100%, without a doubt need to be HIPAA compliant. You likely want to be HITRUST compliant as well. You're very, very clearly a Business Associate (HIPAA term) and processing PHI. Without that, you're dead in the water.* Chrome extension - This is not going to cut it. At baseline, it requires copy and pasting between Chrome and the EHR system. Bad UX.<pre><code> * More tactically, most EHR access occurs via remote access, like Citrix. Often from a thin-terminal on the hospital floor or at a med station. These are locked down environments that have a high likely hood of not having Chrome installed _and_ not being allow to install _any_ software (including Chrome extensions). </code></pre> * Doctors need real-time feedback and the ability to correct. Real-time feedback is critical because carrying a patient takes mental load. Part of the purpose of notes is to create a written record that "closes" a patient for the time being. An async service isn't really helpful since it generates an interrupt after the physician has already moved on.* Accuracy is important. Accuracy is even more important with _extremely_ similar and _extremely_ uncommon words. For example, can this system differential between prednisone and prednilisone? In fact, can it even identify those words? Can it do it with an accent?* FHIR is only as much of a standard as tables are to relational databases. The hard part is the thousands of custom fields that are different between health systems.* Structured data really isn't used or useful in medicine. You might have a medication list that can be structured as data, but actual notes include a lot of prose. I believe part of this is necessary to capture the nuance/minutiae of some healthcare. Part of it stems from the lack of any standard.* Again, this is all possible already. Tele-notes have been done since the introduction of EHR systems.* Nobody is going to use a Chrome extension.

评论 #35170470 未加载

ouid大约 2 年前

it sounds like you're removing a checklist from doctors.

BCM43大约 2 年前

Will you sign a BAA for HIPAA covered entities?

评论 #35152635 未加载

评论 #35152841 未加载

rubenstern大约 2 年前

Looks very interesting! Are you recruiting ?

评论 #35155735 未加载

kgiddens1大约 2 年前

Congrats on the launch Nabla!

mjdowney大约 2 年前

That's super cool!

futureisvintage大约 2 年前

Amazing idea!!

avipars大约 2 年前

how can this be compliant with HIPAA?

评论 #35158071 未加载

评论 #35158139 未加载

ismosoft大约 2 年前

this is amazing. great great

ismosoft大约 2 年前

amazing. you're ROCKKKKKKKKKK

37 条评论

swatcoder大约 2 年前

评论 #35154143 未加载

评论 #35155841 未加载

评论 #35158110 未加载

JohnFen大约 2 年前

评论 #35157690 未加载

评论 #35156611 未加载

评论 #35163623 未加载

评论 #35162162 未加载

v4dok大约 2 年前

评论 #35152627 未加载

评论 #35155715 未加载

priyanmuthu大约 2 年前

My worry is verification of facts. What if the model summarized incorrect facts, and it gets added to the medical history? How can doctors easily verify and fix things?

评论 #35152760 未加载

评论 #35153044 未加载

评论 #35160674 未加载

评论 #35152824 未加载

avgDev大约 2 年前

I feel uncomfortable knowing doctor would use anything like this.How are 2 party consent states handled?Is this HIPPA compliant?

评论 #35157089 未加载

评论 #35152765 未加载

评论 #35152850 未加载

NorthOf33rd大约 2 年前

评论 #35163712 未加载

urduntupu大约 2 年前

评论 #35152731 未加载

评论 #35157634 未加载

评论 #35152732 未加载

评论 #35152669 未加载

orcajerk大约 2 年前

评论 #35152701 未加载

评论 #35152800 未加载

评论 #35152656 未加载

评论 #35152633 未加载

评论 #35152812 未加载

评论 #35152762 未加载

评论 #35152702 未加载

评论 #35152727 未加载

评论 #35152726 未加载

D13Fd大约 2 年前

Terretta大约 2 年前

908B64B197大约 2 年前

评论 #35152996 未加载

norgie大约 2 年前

评论 #35156690 未加载

ysavir大约 2 年前

I'd prefer to have 0 AI interpreting my session with a doctor. Will there be any way for me to identify which physicians use this service so that I can avoid using them?

评论 #35158220 未加载

rubatuga大约 2 年前

评论 #35159007 未加载

hartleybrody大约 2 年前

lukah大约 2 年前

cinntaile大约 2 年前

CiceroCiceronis大约 2 年前

tomasyany大约 2 年前

It looks pretty neat! Just wondering, what other languages are supported? English seems pretty natural as it is where GPT-3 thrives, but have you been able to support other languages as well?

评论 #35152682 未加载

retox大约 2 年前

How tied are you to the name, it's almost Nambla.

评论 #35160690 未加载

gigel82大约 2 年前

ar9av大约 2 年前

tzm大约 2 年前

stopachka大约 2 年前

This is really cool! What surprised you guys about fine-tuning whisper + gpt-3?

评论 #35152631 未加载

phren0logy大约 2 年前

Side-note: your email field to sign up for your newsletter is very wonky with both Safari and Firefox (at least for me). It only lets me type one letter at a time, for some reason.

评论 #35153142 未加载

mitchellpkt大约 2 年前

Just read the blog post, very interesting. I thought sections about changing the temperature and repetition penalty were particularly neat, thanks for including examples.

Banjor大约 2 年前

One thing that a lot of folks still don't get is that some problems are bad, while others are not-so bad.

SkyPuncher大约 2 年前

评论 #35170470 未加载

ouid大约 2 年前

it sounds like you're removing a checklist from doctors.

BCM43大约 2 年前

Will you sign a BAA for HIPAA covered entities?

评论 #35152635 未加载

评论 #35152841 未加载

rubenstern大约 2 年前

Looks very interesting! Are you recruiting ?

评论 #35155735 未加载

kgiddens1大约 2 年前

Congrats on the launch Nabla!

mjdowney大约 2 年前

That's super cool!

futureisvintage大约 2 年前

Amazing idea!!

avipars大约 2 年前

how can this be compliant with HIPAA?

评论 #35158071 未加载

评论 #35158139 未加载

ismosoft大约 2 年前

this is amazing. great great

ismosoft大约 2 年前

amazing. you're ROCKKKKKKKKKK