OpenAI released o3 and o4-mini reasoning models that can use web browsing, code execution, and file analysis tools during the thinking process.
Meta's Llama 4 model family introduces Scout (17B active params, 10M context) and Maverick (17B active, 128 experts). Open source AI reaches new heights.
Anthropic released Claude 3.7 Sonnet, the world's first hybrid reasoning model. Extended thinking mode combines fast responses with deep reasoning capability.
xAI released Grok 3, trained on the Colossus supercomputer with 200,000 GPUs. It claims the top spot on several AI benchmarks.
Chinese AI startup DeepSeek released R1, a reasoning model rivaling OpenAI o1. Trained for a fraction of the cost, it sent shockwaves through global markets.
OpenAI's new o1 model introduces the "chain-of-thought reasoning" paradigm. It achieves PhD-level results in math and science by thinking step-by-step.
Meta released Llama 3.1 405B, the world's largest open-source language model. It rivals GPT-4 and Claude 3.5 Sonnet in performance.
Anthropic's Claude 3.5 Sonnet surpasses both GPT-4o and Claude 3 Opus in many benchmarks while being 5x faster and 80% cheaper than Opus.
OpenAI's new model GPT-4o can natively process text, audio, and visual inputs simultaneously. Response time has dropped to 232 milliseconds.
Meta's Llama 3, released in 8B and 70B parameter versions, sets new standards in the open source AI world. The 405B model is also on the way.
Anthropic's Claude 3 model family offers three different models at varying price and performance tiers. Claude 3 Opus surpasses GPT-4 in several benchmarks.
Google has renamed its AI chatbot Bard to Gemini. A Gemini Advanced subscription and mobile app were also announced.