OpenAI releases HealthBench, a medical AI evaluation benchmark

05/13/2025, 03:19 PM

PANews reported on May 13 that OpenAI announced the launch of HealthBench, a new evaluation benchmark for AI medical systems, which was designed by 262 doctors from 60 countries and covers 5,000 real simulated conversations. HealthBench uses the scoring criteria set by doctors to test the accuracy, completeness and clinical practicality of the model's response. The code and data set are now open.

In addition, OpenAI announced this morning that all Plus, Team and Pro users can now export in-depth research reports as well-formatted PDF files, including tables, pictures, citations and source links. This feature applies to both new and old reports, and will be available to Enterprise and Edu version users later.

Original link

Share to:

Author: PA一线

This content is provided for informational purposes only and does not constitute investment advice.

depth data OpenAI AI user Report

Follow PANews official accounts, let's navigate bull and bear markets together

PANews WeChat Group

Telegram Discussion Group

Telegram Info Channel

@PANewsCN

Recommended Reading

PA一线5 hour ago

In the past 24 hours, the total network contract liquidation was 188 million US dollars, mainly short orders

PA一线7 hour ago

Tornado Cash co-founder Roman Storm puts pressure on the court on the eve of trial, demanding the disclosure of FinCEN-related materials

PA一线8 hour ago

Bybit suspected of Apple ID login vulnerability, user accounts were tampered with, causing withdrawals to be blocked

PA一线8 hour ago

Data: PYTH, ZKJ, PIXEL and other tokens will be unlocked in large amounts next week, of which PYTH unlocks about $338 million

一周预告8 hour ago

Weekly preview | US lawmakers to hold final vote on stablecoin GENIUS Act; Trump to attend TRUMP dinner on May 22

PA一线11 hour ago

The author of "Bitcoin Standard" provides funding to support developers to combat garbage data on the BTC chain

Curated Series

Pioneer's View: Crypto Celebrity Interviews

Exclusive interviews with crypto celebrities, sharing unique observations and insights

PAData: Web3 in Data

Data analysis and visualization reporting of industry hot spots

Memecoin Supercycle: The hype around attention tokenization

From joke culture to the trillion-dollar race, Memecoin has become an integral part of the crypto market. In this Memecoin super cycle, how can we seize the opportunity?

AI Agent: A Journey to Web3

The AI Agen innovation wave is sweeping the world. How will it take root in Web3? Let’s embark on this adventure together!