OpenAI employees publicly accused Grok3 of misleading benchmark results | PANews

OpenAI employees publicly accused Grok3 of misleading benchmark results

PANews reported on February 23 that according to Jinshi, recently, an employee of OpenAI publicly accused Elon Musk's xAI company, saying that the benchmark results of its latest AI model Grok3 were misleading. In response, Igor Babushkin, co-founder of xAI, insisted that the company was not wrong. xAI's chart shows that two versions of Grok3 - Grok3 Reasoning Beta and Grok3 mini Reasoning - outperformed OpenAI's current strongest available model o3-mini-high on AIME 2025. However, OpenAI employees quickly pointed out on the X platform that xAI's chart did not include the AIME 2025 score of o3-mini-high under "cons@64" conditions. Babushkin argued on the X platform that OpenAI had released similar misleading benchmark charts in the past. Although these charts are used to compare the performance of its own models.

Share to:

Author: PA一线

This content is for market information only and is not investment advice.

Follow PANews official accounts, navigate bull and bear markets together

PANews WeChat Group

Telegram Discussion Group

Telegram News Channel

Recommended Reading

PA一线

07/14/2025, 12:34 AM

SpaceX has agreed to invest $2 billion in xAI

PA一线

07/08/2025, 09:56 AM

Musk: Grok 4 will be released live at 11 am this Thursday

PA一线

06/27/2025, 11:39 AM

Musk: Grok 4 will be released after July 4th

PA一线

05/28/2025, 12:26 PM

Telegram and xAI reach a cooperation, Grok AI will be integrated into the Telegram application

PA一线

04/26/2025, 01:58 AM

Musk's xAI plans to raise $20 billion, which is expected to become the second largest startup financing in history

PA一线

04/05/2025, 03:35 AM

Musk v. OpenAI jury trial to begin next spring

Trending:Bitcoin

Popular Articles

In the past 24 hours, a total of $366 million in contract liquidations occurred across the entire network, primarily involving long positions.

U.S. stocks opened lower, with the S&P 500 down 0.15%.

Spot gold falls below $4,800.

US stocks closed lower across the board, while crypto stocks generally rose.

"Big Short" Michael Burry: The current Bitcoin decline is similar to the 2022 bear market.

Industry News

Market Trends

Curated Readings

PANews App

24/7 blockchain news tracking and in-depth analysis.

Download PANews App

App Store Google Play

All three major U.S. stock indexes closed lower, with COIN falling more than 7.59%.

PANews Newsflash2 hours ago