Discussion about this post

User's avatar
Neural Foundry's avatar

Nice to see MiniMax continuing to push open-source boundaries. The 74.0 SWE-bench Verified score is impressive given how quickly they're iterating. I've been tracking how VIBE-Web differs from traditional text benchmarks and it's a much better signal for actual web app generation tasks bc it tests end-to-end functional outcomes rather than just isolated completions. Curious how M2.1 handles state managment in more complex React apps compared to the previous version.

Expand full comment

No posts

Ready for more?