LongCat Breakthrough: Open-Source AI Hits New Highs in Verifiable Mathematical Reasoning

arXiv AI March 24, 2026

A new 560-billion-parameter open-source model is closing the gap on human logic by mastering formal mathematical proofs with nearly perfect accuracy. This shift from 'guessing' to 'proving' marks a critical milestone for AI applications in high-stakes environments like financial modeling and software verification where errors aren't an option.

Key Intelligence

•Apparently, this new model achieved a 97.1% success rate on elite high school math competition problems, setting a new record for open-source AI.
•Did you hear that it’s mastering Lean4? That’s a specialized language used to prove theorems, meaning the AI isn't just hallucinating answers—it's mathematically proving them.
•It successfully solved over 41% of problems from the Putnam Competition, which is widely considered one of the toughest university math exams in the world.
•The model uses a 'Mixture-of-Experts' architecture with 560 billion parameters, proving that open-source models are now competing directly with the biggest proprietary labs.
•Researchers used a new 'agentic reasoning' framework that allows the AI to sketch out a plan and double-check its logic before committing to a final proof.
•Experts say this level of 'formal reasoning' is the 'holy grail' for industries like aerospace and cryptography where code must be 100% bug-free.
•Notably, it achieved these results with high 'sample efficiency,' meaning it finds the right answer much faster and with less computing power than previous models.

Read Full Source