A recent study published in Science reveals that large language models, particularly OpenAI's o1 and 4o, can provide more accurate emergency room diagnoses than human physicians. Conducted by a research team from Harvard Medical School and Beth Israel Deaconess Medical Center, the study involved analyzing 76 patients' cases where the AI models' diagnoses were compared to those of two attending physicians. The results indicated that the o1 model achieved a correct diagnosis 67% of the time during initial triage, surpassing one physician's 55% and another's 50%. The researchers emphasized the need for further prospective trials to assess AI technologies in real-world medical settings and cautioned about the lack of accountability frameworks for AI in healthcare. They also noted the limitations of current models in reasoning over nontext inputs.
Harvard Study Highlights AI's Diagnostic Accuracy in Emergency Rooms Compared to Human Doctors
More Articles From This Day
OpenAI Unveils Scalable Low-Latency Voice AI Powered by Rebuilt WebRTC Stack
OpenAI has announced the reconstruction of its WebRTC stack, enabling the delivery of low-latency voice AI solutions at a global scale. This advancement facilitates seamless conversational turn-taking, enhancing the user experience in real-time voice interactions. The initiative aims to improve the efficiency and responsiveness of voice AI applications across various platforms.
