It's a fun way to quantify the real-world performance between models that's more practical and actionable.
Excited to test this.
> "build a Linux-style desktop environment as a web application"
They claim "50 applications from scratch", but "Browser" and a bunch of the other apps are likely all <iframe> elements.We all know that building a spec-compliant browser alone is a herculean task.
For short-term bugfixing and tweaks though, it does about what I'd expect from Sonnet for a pretty low price.
[[you guys, please don't post like this to HN - it will just irritate the community and get you flamed]]