formal

DeepSeek-Prover-V2: Bridging the Gap Between Informal and Formal Mathematical Reasoning

While DeepSeek-R1 has significantly advanced AI’s capabilities in informal reasoning, formal mathematical reasoning has remained a difficult task for AI. That is primarily because producing verifiable mathematical proof requires each deep conceptual understanding and...

LM Arena, a preferred benchmark of human preferences, established a proper company

LM Arena, a brand new standard for benchmark by measuring human preference, established the corporate and began a full -scale business. LM Arena announced the establishment of the corporate through X (Twitter) on the seventeenth...

Solving (some) formal math olympiad problems

We built a neural theorem prover for Lean that learned to unravel a wide range of difficult high-school olympiad problems, including problems from the AMC12 and AIME competitions, in addition to two problems adapted from the IMO.

Recent posts

Popular categories

ASK ANA