TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

ChatGPT Has Passed the United States Medical Licensing Examination (USMLE)

45 pointsby zekriocaover 2 years ago

8 comments

tokaiover 2 years ago
The hn title is really pushing way beyond what the study can shoulder. The title from the result chapter sums it up better; &quot;ChatGPT yields moderate accuracy approaching passing performance on USMLE&quot;<p>The study was done on public available questions that have been used in usmle exams. They used two physician reviewers to judge ChatGPTs textual answers. While really interesting no exam was actually passed, and the authors never claim that.
评论 #34483441 未加载
YeGoblynQueenneover 2 years ago
The HN title is misleading. The study is titled:<p><i>Performance of ChatGPT on USMLE: Potential for AI-Assisted Medical Education Using Large Language Models</i><p>The study did not attempt to get ChatGPT to pass any exam. Rather, ChatGPT was question on material in past exams. The authors are enthusiastic about ChatGPT&#x27;s performance but nowhere do they claim it &quot;passed the US Medical Licensing Exam&quot; as the title claims.<p>Indeed, the study concludes that ChatGPT&#x27;s performance is less than enough to pass the exam:<p><i>ChatGPT yields moderate accuracy approaching passing performance on USMLE</i> (line 287 of the manuscript).<p>@zekrioca, please correct the title so that it more closely reflects the results of the linked study.
评论 #34485358 未加载
eigover 2 years ago
Medical student here. I’m impressed by ChatGPT’s ability to come close to passing Step 2&#x2F;3. I tried plugging in my own question banks that I used for studying and it got many right. Even when it was wrong it was also “almost right” and its answer would not be disparaged on rounds.<p>However one thing to keep in mind: a bot passing these exams alone will not disrupt medicine. There are plenty of doctors today that could ace these exams. I don’t think that the ability to outsource medical decision making to a bot will change much, especially since 90% of medical decision making is already outsourced to a flowchart or an “expert attending” physician. Much of the value of doctors today comes from procedures, patient interaction, legal liability, and patient trust, which cannot yet be done by a bot.
Slightedover 2 years ago
Just because it can pass the test doesn&#x27;t mean that its behavior will be consistently correct. Trusting ChatGPT in medical applications is the same as trusting it in programming applications: It will work for simple to intermediate tasks and may even look as though its explaining itself properly, but for specialized tasks, sufficiently complicated ones, and even in simple tasks, it can&#x27;t be expected to product correct results consistently.
评论 #34483400 未加载
评论 #34483360 未加载
评论 #34483401 未加载
评论 #34483348 未加载
xiphias2over 2 years ago
,, The most recent iteration of the GPT LLM (GPT3) achieved 46% accuracy with zero prompting12, which marginally improved to 50% with further model training. Previous models, merely months prior, performed at 36.7%13. In this present study, ChatGPT performed at &gt;50% accuracy across all examinations, exceeding 60% in most analyses&#x27;&#x27;<p>I have heared many times that ChatGPT is basically GPT3, but somehow it feels like it gives more correct answers. Now there&#x27;s data to show it.
jedbergover 2 years ago
I&#x27;m not sure if this is more of a pro in the ChatGPT column or an indictment of the USMLE. Although I don&#x27;t think the USMLE is known as an easy exam, so I guess ChatGPT really is that good now?
评论 #34483346 未加载
评论 #34483344 未加载
transitivebsover 2 years ago
Does anyone know the methodology? E.g., what were the inputs and context windows given to ChatGPT?
评论 #34483406 未加载
simonhaidarjhover 2 years ago
Heart palpitations