See tests where Gemini 3 Pro tops GPT 5.1 and Claude, builds dashboards from prompts, and previews Agent mode, with notes on what still needs ...