Reasoning
-
Blog
Which Two AI Models Are ‘Unfaithful’ at Least 25% of the Time About Their ‘Reasoning’?
Anthropic’s Claude 3.7 Sonnet. Image: Anthropic/YouTube Anthropic released a new study on April 3 examining how AI models process information and the limitations of tracing their decision-making from prompt to output. The researchers found Claude 3.7 Sonnet isn’t always “faithful” in disclosing how it generates responses. Anthropic probes how closely AI output reflects internal reasoning Anthropic is known for publicizing…
Read More » -
Blog
Introducing an Enhanced AI Reasoning Technique
Image: Envato/DC_Studio Researchers from AI company DeepSeek and Tsinghua University have introduced a new technique to enhance “reasoning” in large language models (LLMs). Reasoning capabilities have emerged as a critical benchmark in the race to build top-performing generative AI systems. China and the U.S. are actively competing to develop the most powerful and practical models. According to a Stanford University…
Read More » -
Blog
Microsoft 365 Copilot’s ‘First-of-Their-Kind Reasoning Agents’ — Here’s What They Do
Microsoft is adding two new AI reasoning agents to its Microsoft 365 Copilot suite: Researcher and Analyst. These AI agents are designed to streamline workflows by handling multi-step processes that typically require significant time and expertise. The tech giant introduced these reasoning agents, which Microsoft says are “first-of-their-kind,” as part of its continued push to make AI a core part…
Read More » -
Blog
Microsoft adds ‘deep reasoning’ Copilot AI for research and data analysis
After Google and OpenAI offered up AI news on Tuesday, Microsoft has followed with announcements of its own, including details of two “deep reasoning” agents for Microsoft 365 Copilot that it claims are the first of their kind, dubbed Researcher and Analyst, as well as new capabilities for custom AI agents. Researcher relies on OpenAI’s deep research AI model to…
Read More » -
Blog
Google says its new ‘reasoning’ Gemini AI models are the best ones yet
After delivering a new “open” AI model with better performance on a single GPU, Google has now introduced an update to the AI models for its products with Gemini 2.5, which combines “a significantly enhanced base model with improved post-training” for better overall performance. It’s claiming that the first release, Gemini 2.5 Pro experimental, leads competition from OpenAI, Anthropic, xAI,…
Read More » -
Blog
OpenAI unleashes o3-mini reasoning model – Computerworld
The model also offers new features for developers who incorporate OpenAI models in their software, including function calling, developer messages, and structured outputs. They can also choose one of three reasoning effort options — low, medium, and high — to adjust power and latency to suit the use case. However, unlike OpenAI o1, it does not support vision capabilities. The…
Read More » -
Blog
Google Drops Its First “Reasoning” Model to Take On OpenAI o1
After OpenAI introduced its o1 reasoning model that takes some time to “think” before responding, Google has now finally released its own version of the thinking model. The new AI model is “Gemini 2.0 Flash Thinking” aka gemini-2.0-flash-thinking-exp-1219. It’s an experimental preview model, and already available on AI Studio for testing and feedback. The Gemini 2.0 Flash Thinking model follows…
Read More » -
Blog
Microsoft introduces Phi-4, an AI model for advanced reasoning tasks – Computerworld
“The goal with Phi-4 is to explore the efficiency of smaller models while maintaining accuracy,” Microsoft researchers noted in the technical documentation. Microsoft’s Phi-4 competes directly with models such as OpenAI’s GPT-4o Mini, Anthropic’s Claude 3 Haiku, and Google’s Gemini 1.5 Flash, each catering to specific applications in the small language model landscape. While GPT-4o Mini is designed for cost-efficient…
Read More » -
Blog
You Can Now Pay $200 a Month for a ‘Reasoning’ ChatGPT
Would you pay $200 a month for unlimited ChatGPT? What if it’s able to “reason”? OpenAI thinks you just might. As part of its “12 days of Shipmas,” where the company is announcing new features for 12 days straight, OpenAI is finally bringing its first reasoning model out of preview, as well as adding unlimited access to it and all…
Read More »