OpenAI claims GPT-5 performance nears human experts across key industries

OpenAI's initial GDPval-v0 test, which assesses performance across 44 occupations in nine major industries (like healthcare, finance, and manufacturing), suggests that advanced models are "already approaching the quality of work produced by industry experts."

By Storyboard18| Sep 26, 2025 10:44 AM

OpenAI has introduced a new metric, GDPval, to measure how closely its latest AI models, including GPT-5, are performing compared to human professionals in tasks tied to America's economy. The benchmark is an early step toward fulfilling the company's mission to develop Artificial General Intelligence (AGI) capable of economically valuable work.

OpenAI's initial GDPval-v0 test, which assesses performance across 44 occupations in nine major industries (like healthcare, finance, and manufacturing), suggests that advanced models are "already approaching the quality of work produced by industry experts."

GPT-5-high: A souped-up version of GPT-5 was ranked as better than or on par with industry experts 40.6% of the time in the tasks tested.

Anthropic's Claude Opus 4.1: This competing model performed slightly better, winning or tying against human reports in 49% of tasks. OpenAI attributes this high score partly to Claude's ability to produce pleasing graphics, which may have swayed human professional evaluators.

The test asks experienced professionals to compare AI-generated reports (e.g., a competitor landscape from an investment banker) with those created by other humans, then select the best one.

OpenAI acknowledges that GDPval-v0 is currently limited, as it only tests the creation of research reports, which is just a small component of any professional's actual job. However, the progress shown is significant:

OpenAI's previous model, GPT-4o, scored just 13.7% (wins and ties) approximately 15 months ago.

The nearly threefold increase in performance with GPT-5 encourages OpenAI's evaluations lead, Tejal Patwardhan, who expects the rapid improvement to continue.

OpenAI's chief economist, Dr. Aaron Chatterji, suggests these results mean that people in these occupations can use the increasingly capable AI models to "offload some of their work and do potentially higher value things."

Benchmarks like GDPval are becoming crucial as existing, academic AI tests, such as AIME 2025 (math) and GPQA Diamond (science), are nearing saturation. GDPval aims to provide a more real-world assessment of AI's proficiency, a critical step as the industry attempts to definitively measure AI's value across various sectors.

SPOTLIGHT

Digital From Clutter to Clarity: How Video is transforming B2B storytelling

According to LinkedIn’s research with over 1,700 B2B tech buyers, video storytelling has emerged as the most trusted, engaging, and effective format for B2B marketers. But what’s driving this shift towards video in B2B? (Image Source: Unsplash)

Explained: Standing Committee’s draft report on India’s fight against Fake News

India’s parliamentary panel warns fake news threatens democracy, markets and media credibility, urging stronger regulation, fact-checking, AI oversight and global cooperation.

OpenAI claims GPT-5 performance nears human experts across key industries

OpenAI's initial GDPval-v0 test, which assesses performance across 44 occupations in nine major industries (like healthcare, finance, and manufacturing), suggests that advanced models are "already approaching the quality of work produced by industry experts."

SPOTLIGHT

Explained: Standing Committee’s draft report on India’s fight against Fake News

POPULAR

More from Storyboard18

How it Works

Foxconn under fire as report alleges harsh conditions at iPhone 17 factories in China

Social Media

Meta launches 'Vibes': A new AI-generated video feed challenging TikTok and Reels

How it Works

FSSAI launches festive season drive to ensure food safety, curb adulteration in sweets and dairy products

Digital

Google launches AI tool Mixboard; takes on Pinterest and Nano Banana integration

How it Works

How to spot AI-generated images (Nano Banana) - Tips & Tools

Agency News

Breaking: Madison–Havas deal reaches signing stage, clearance expected by early 2026

How it Works

How Google’s Nano Banana is changing social media in India

Digital

Google launches AI tool Mixboard; takes on Pinterest and Nano Banana integration