On HMMT Feb 25, a rigorous reasoning benchmark, Qwen3-Max-Thinking scored 98.0, edging out Gemini 3 Pro (97.5) and ...
Dozens of doctors and therapists said chatbots had led their patients to psychosis, isolation and unhealthy habits.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results