OpenAI highlights GPT-5 scores on math, coding, and health benchmarks: 94.6% on AIME 2025 without tools, 74.9% on SWE-bench Verified, 46.2% on HealthBench Hard (Carl Franzen/VentureBeat)

August 08, 2025 Leave a Reply Tags: Techmeme

Carl Franzen / VentureBeat:
OpenAI highlights GPT-5 scores on math, coding, and health benchmarks: 94.6% on AIME 2025 without tools, 74.9% on SWE-bench Verified, 46.2% on HealthBench Hard — After literally years of hype and speculation, OpenAI has officially launched a new lineup of large language models (LLMs) …

from Techmeme https://ift.tt/4asQR1i

Share Me

Tweet
Share
Share
Share
Share

Sky-News

0 comments:

Please do not enter any spam in the comment box!