Grok 4 by xAI was released on July 9, and it's surged ahead of competitors like DeepSeek and Claude at LMArena, a leaderboard for ranking generative AI models. However, these types of AI rankings ...
There is a common problem for all AI companies for overfitting to benchmarks. XAI Grok 4 has some problems with prompt adherence. XAI could have had overfitting resulted from the reinforcement ...
Elon Musk-owned xAI is testing Grok 4.20, a new model update to Grok 4, which already competes with GPT-5 in some benchmarks, such as ARC-AGI 2. GPT-5 is one of the best models for coding, and it ...
A new OmniCalculator report finds xAI’s Grok 4.2 outperforming both Claude and ChatGPT in logic and problem-solving, cutting answer instability to nearly half of legacy models. Claude 4.6 still leads ...
A Mathematician with early access to XAI Grok 4.20, found a new Bellman function for one of the problems he had been working on with my student N. Alpay. Not an Erdős problem, but original research.
The launch of Grok 4.3 represents a calculated bet by xAI that the market wants specialized brilliance and extreme cost ...
For the fastest way to join Tom's Guide Club enter your email below. We'll send you a confirmation and sign you up to our newsletter to keep you updated on all the latest news.
Is Grok 4.2 the most intelligent coding model we’ve seen yet? With its release in January 2026, this AI powerhouse has already sparked conversations across the tech world. In this comparison, World of ...
Elon Musk's frontier generative AI startup xAI formally opened developer access to its Grok 4.1 Fast models last night and introduced a new Agent Tools API—but the technical milestones were ...
Elon Musk-funded xAI is skipping Grok 3.5 and releasing Grok 4 after Independence Day in the United States, and it could be the best model from the company. Grok 3.5 was originally supposed to be a ...
What if the future of AI could not only dream up stunning web designs but also code them into reality with unmatched precision? In this overview, Universe of AI explores how Grok 4.2, codenamed ...