benchmarks
-
Blog
ChatGPT 4.1 early benchmarks compared against Google Gemini
ChatGPT 4.1 is now rolling out, and it’s a significant leap from GPT 4o, but it fails to beat the benchmark set by Google Gemini. Yesterday, OpenAI confirmed that developers with API access can try as many as three new models: GPT‑4.1, GPT‑4.1 mini, and GPT‑4.1 nano. According to the benchmarks, these models are far better than the existing GPT‑4o and…
Read More » -
Blog
Meta gets caught gaming AI benchmarks with Llama 4
Over the weekend, Meta dropped two new Llama 4 models: a smaller model named Scout, and Maverick, a mid-size model that the company claims can beat GPT-4o and Gemini 2.0 Flash “across a broad range of widely reported benchmarks.” Maverick quickly secured the number-two spot on LMArena, the AI benchmark site where humans compare outputs from different systems and vote…
Read More » -
Blog
Benchmarks Find ‘DeepSeek-V3-0324 Is More Vulnerable Than Qwen2.5-Max’
With the latest stable release dated January 28, 2025, Qwen2.5-Max is classified as a Mixture-of-Experts (MoE) language model developed by Alibaba. Like other language models, Qwen2.5-Max is capable of generating text, understanding different languages, and performing advanced logic. According to recent benchmarks, it is also more secure than DeepSeek-V3-0324. Using Recon to scan for vulnerabilities A team of analysts with…
Read More » -
Blog
Apple A18 Benchmarks: Geekbench, 3DMark, AnTuTu & More
Apple’s latest iPhone 16 and 16 Plus models feature the latest A18 chipset, which looks like a binned-down version of the more powerful, A18 Pro. We have already benchmarked the A18 Pro chipset, so in this post, we have run various benchmark tests on the Apple A18, including Geekbench, 3DMark, AnTuTu, and more. On that note, let’s go ahead and…
Read More » -
Blog
Intel Arrow Lake specs and early benchmarks just leaked ahead of October launch
Intel officially unveiled their next laptop processor line, Lunar Lake, in June, though we’d known about the next-gen CPUs for a few months. The company has yet to make any announcements about the desktop version. A new leak on X spotted by GSMArena claims that desktop processors will be dubbed Arrow Lake and are expected to be revealed on October…
Read More »