Tag: FF-STACK
All the articles with the tag "FF-STACK".
-
51.85% on Humanity's Last Exam: How a Solo Researcher Built a Multi-Agent HLE Submission
1,119 out of 2,158 on canonical HLE. Single workstation, no GPU cluster, no fine-tuning. The architecture, the numbers, and what's next.