Reasoning
-
Blog
A reasoning model to rival GPT-4 at 0.5% the cost – Computerworld
“The entire reinforcement learning phase used only 512 H800s for three weeks, with a rental cost of just $534,700,” the company explained. “This is an order of magnitude less than initially anticipated.” However, industry analysts urge caution. “MiniMax’s debut reasoning model, M1, has generated justified excitement with its claim of reducing computational demands by up to 70% compared to peers…
Read More » -
Blog
OpenAI Launches o3-pro, Its Most Capable Reasoning Model Yet
On Wednesday, OpenAI released o3-pro, its most capable reasoning model on ChatGPT. In April 2025, the company released the standalone o3 model along with o4-mini. The o3-pro model uses the same o3 as the underlying model, but runs in a high-compute mode, which uses more computing power and extended thinking time to solve harder problems. OpenAI says o3-pro offers better…
Read More » -
Blog
‘A complete accuracy collapse’: Apple throws cold water on the potential of AI reasoning – and it’s a huge blow for the likes of OpenAI, Google, and Anthropic
Apple has suggested that AI reasoning models have clear limits when it comes to solving complex problems, undermining developer arguments that they are useful for tasks that a human would traditionally solve. Reasoning models can solve more complex problems than standard large language models (LLMs) by breaking them down into a series of smaller problems which are solved one by…
Read More » -
Blog
DeepSeek releases new version of its R1 reasoning AI model – Computerworld
Chinese AI startup DeepSeek has released an update to the R1 reasoning AI model that took the tech world by storm when it was launched at the beginning of the year. The open-source model sent shockwaves through the AI industry as its efficient use of compute and memory resources helped it match leading US models’ performance at a fraction of…
Read More » -
Blog
Which Two AI Models Are ‘Unfaithful’ at Least 25% of the Time About Their ‘Reasoning’?
Anthropic’s Claude 3.7 Sonnet. Image: Anthropic/YouTube Anthropic released a new study on April 3 examining how AI models process information and the limitations of tracing their decision-making from prompt to output. The researchers found Claude 3.7 Sonnet isn’t always “faithful” in disclosing how it generates responses. Anthropic probes how closely AI output reflects internal reasoning Anthropic is known for publicizing…
Read More » -
Blog
Introducing an Enhanced AI Reasoning Technique
Image: Envato/DC_Studio Researchers from AI company DeepSeek and Tsinghua University have introduced a new technique to enhance “reasoning” in large language models (LLMs). Reasoning capabilities have emerged as a critical benchmark in the race to build top-performing generative AI systems. China and the U.S. are actively competing to develop the most powerful and practical models. According to a Stanford University…
Read More » -
Blog
Microsoft 365 Copilot’s ‘First-of-Their-Kind Reasoning Agents’ — Here’s What They Do
Microsoft is adding two new AI reasoning agents to its Microsoft 365 Copilot suite: Researcher and Analyst. These AI agents are designed to streamline workflows by handling multi-step processes that typically require significant time and expertise. The tech giant introduced these reasoning agents, which Microsoft says are “first-of-their-kind,” as part of its continued push to make AI a core part…
Read More » -
Blog
Microsoft adds ‘deep reasoning’ Copilot AI for research and data analysis
After Google and OpenAI offered up AI news on Tuesday, Microsoft has followed with announcements of its own, including details of two “deep reasoning” agents for Microsoft 365 Copilot that it claims are the first of their kind, dubbed Researcher and Analyst, as well as new capabilities for custom AI agents. Researcher relies on OpenAI’s deep research AI model to…
Read More » -
Blog
Google says its new ‘reasoning’ Gemini AI models are the best ones yet
After delivering a new “open” AI model with better performance on a single GPU, Google has now introduced an update to the AI models for its products with Gemini 2.5, which combines “a significantly enhanced base model with improved post-training” for better overall performance. It’s claiming that the first release, Gemini 2.5 Pro experimental, leads competition from OpenAI, Anthropic, xAI,…
Read More » -
Blog
OpenAI unleashes o3-mini reasoning model – Computerworld
The model also offers new features for developers who incorporate OpenAI models in their software, including function calling, developer messages, and structured outputs. They can also choose one of three reasoning effort options — low, medium, and high — to adjust power and latency to suit the use case. However, unlike OpenAI o1, it does not support vision capabilities. The…
Read More »