When researchers at Tsinghua University and other institutions built MMMU-Pro, they designed it to be nearly impossible to game. Every question pairs an image with text, and any item a model can ...
OpenAI Group PBC today launched a new large language model that is significantly better than its predecessors at solving math problems and writing code. GPT-5.5 is rolling out a week after rival ...
OpenAI claims its reasoning model disproved a geometry conjecture unsolved since 1946 — and this time, the mathematicians who ...
OpenAI’s GPT-5.2 Pro does better at solving sophisticated math problems than older versions of the company’s top large language model, according to a new study by Epoch AI, a non-profit research ...
GPT-5.5 delivers polished, useful answers across tasks. Strong performance across writing, coding, and reasoning tasks. Overeagerness hurts accuracy and instruction following. OpenAI has released ...
OpenAI announced GPT-5.5, its latest AI model that is better at coding, using computers and pursuing deeper research capabilities. The launch comes just weeks after Anthropic unveiled Claude Mythos ...
GPT-5.4's 83% score suggests AI rivals expert professionals. Tests span nine industries and 44 real-world occupations. New capabilities boost coding, tools, and computer control. It seems like only ...
On Thursday, OpenAI released GPT-5.4, a new foundation model billed as “our most capable and efficient frontier model for professional work.” In addition to the standard version, GPT-5.4 is also ...
Katelyn is a reporter with CNET covering artificial intelligence, including chatbots, image and video generators. Her work explores how new AI technology is infiltrating our lives, shaping the content ...
An exclusive conversation with Kevin Weil, head of OpenAI for Science, a new in-house team that wants to make scientists more productive. In the three years since ChatGPT’s explosive debut, OpenAI’s ...