English
全部
搜索
图片
视频
地图
资讯
Copilot
更多
购物
航班
旅游
笔记本
Top stories
Sports
U.S.
Local
World
Science
Technology
Entertainment
Business
More
Politics
过去 24 小时
时间不限
过去 1 小时
过去 7 天
过去 30 天
最佳匹配
最新
2 小时
谷歌Gemini 3深夜掀翻牌桌:数理满分、视力碾压 GPT-5,程序员的 ...
真正的屠杀发生在一个叫 MathArena Apex 的榜单上。这是数学竞赛的“地狱模式”,里面的题目充满了复杂的陷阱和极度晦涩的逻辑。在这个榜单上,包括 GPT-5.1 在内的所有顶尖模型,得分都在 1% 上下徘徊——这说明它们基本是在瞎蒙。
一些您可能无法访问的结果已被隐去。
显示无法访问的结果
今日热点
House OKs release of files
Federal court blocks Texas
Asked about Khashoggi
Woman set on fire
Judge dismisses DOJ lawsuit
Kessler Twins die
To begin transferring offices
Klimt painting fetches $236.4M
Judge finds 'missteps'
Launches Gemini 3
Cloud infrastructure deal
To close delivery centers
On AI bubble burst
Poland on railway sabotage
Wins FTC antitrust case
Loose wire caused collapse?
EU probes cloud services
Japan warns citizens in China
Steps back from public roles
Trump on US strikes in MX
Zelenskyy to visit Turkey
Trump unveils ‘FIFA Pass’
Cause of death revealed
Hired as Virginia Tech coach
CPB agrees to revive deal
Drops out of Davis Cup
Lawsuit filed over 2 deaths
Recalls Accord Hybrids
UNSC OKs US plan for Gaza
Two charged in 300+ thefts
Steps up age checks
Free robotaxi rides in SF
Briefly slides below $90,000
反馈