We tested state-of-the-art LLMs under clinical-scale workloads using two designs: a single agent handling all tasks and a multi-agent orchestrator assigning each task to a dedicated worker. Across ...