Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Petlibro Discount Codes and Deals: Save Up to 50%

    January 17, 2026

    Bluesky rolls out cashtags and LIVE badges amid a boost in app installs

    January 17, 2026

    Heat Waves Are Overwhelming Honey Bee Hives

    January 17, 2026
    Facebook Twitter Instagram
    • Tech
    • Gadgets
    • Spotlight
    • Gaming
    Facebook Twitter Instagram
    iGadgets TechiGadgets Tech
    Subscribe
    • Home
    • Gadgets
    • Insights
    • Apps

      Google Uses AI Searches To Detect If Someone Is In Crisis

      April 2, 2022

      Gboard Magic Wand Button Will Covert Your Text To Emojis

      April 2, 2022

      Android 10 & Older Devices Now Getting Automatic App Permissions Reset

      April 2, 2022

      Spotify Blend Update Increases Group Sizes, Adds Celebrity Blends

      April 2, 2022

      Samsung May Improve Battery Significantly With Galaxy Watch 5

      April 2, 2022
    • Gear
    • Mobiles
      1. Tech
      2. Gadgets
      3. Insights
      4. View All

      Heat Waves Are Overwhelming Honey Bee Hives

      January 17, 2026

      Scientists Are Tracking Mysterious Blackouts Beneath the Sea

      January 17, 2026

      Scientists Create Living Computers Powered by Mushrooms

      January 16, 2026

      A Strange State of Matter Behaves Very Differently Under Even Weak Magnetism

      January 16, 2026

      March Update May Have Weakened The Haptics For Pixel 6 Users

      April 2, 2022

      Project 'Diamond' Is The Galaxy S23, Not A Rollable Smartphone

      April 2, 2022

      The At A Glance Widget Is More Useful After March Update

      April 2, 2022

      Pre-Order The OnePlus 10 Pro For Just $1 In The US

      April 2, 2022

      Petlibro Discount Codes and Deals: Save Up to 50%

      January 17, 2026

      Thinking Machines Cofounder’s Office Relationship Preceded His Termination

      January 17, 2026

      The Campaign to Destroy Renee Good

      January 16, 2026

      Our Favorite Compact Power Station Is on Sale for 33% Off

      January 16, 2026

      Latest Huawei Mobiles P50 and P50 Pro Feature Kirin Chips

      January 15, 2021

      Samsung Galaxy M62 Benchmarked with Galaxy Note10’s Chipset

      January 15, 2021
      9.1

      Review: T-Mobile Winning 5G Race Around the World

      January 15, 2021
      8.9

      Samsung Galaxy S21 Ultra Review: the New King of Android Phones

      January 15, 2021
    • Computing
    iGadgets TechiGadgets Tech
    Home»Spotlight»AI models are starting to crack high-level math problems 
    Spotlight

    AI models are starting to crack high-level math problems 

    adminBy adminJanuary 14, 2026No Comments4 Mins Read
    Facebook Twitter Pinterest LinkedIn Tumblr Email
    Image of math equations written on a blackboard.
    Share
    Facebook Twitter LinkedIn Pinterest Email

    Over the weekend, Neel Somani, who is a software engineer, former quant researcher, and a startup founder, was testing the math skills of OpenAI’s new model when he made an unexpected discovery. After pasting the problem into ChatGPT and letting it think for 15 minutes, he came back to a full solution. He evaluated the proof and formalized it with a tool called Harmonic — but it all checked out. 

    “I was curious to establish a baseline for when LLMs are effectively able to solve open math problems compared to where they struggle,” Somani said. The surprise was that, using the latest model, the frontier started to push forward a bit. 

    ChatGPT’s chain of thought is even more impressive, rattling off mathematical axioms like Legendre’s formula, Bertrand’s postulate, and the Star of David theorum. Eventually, the model found a Math Overflow post from 2013, where Harvard mathematician Noam Elkies had given an elegant solution to a similar problem. But ChatGPT’s final proof differed from Elkies’ work in important ways, and gave a more complete solution to a version of the problem posed by legendary mathematician Paul Erdős, whose vast collection of unsolved problems has become a proving ground for AI.

    For anyone skeptical of machine intelligence, it’s a surprising result — and it’s not the only one. AI tools have become ubiquitous in mathematics, from formalization-oriented LLMs like Harmonic’s Aristotle to literature review tools like OpenAI’s deep research. But since the release of GPT 5.2 — which Somani describes as “anecdotally more skilled at mathematical reasoning than previous iterations” — the sheer volume of solved problems has become difficult to ignore, raising new questions about large language models’ ability to push the frontiers of human knowledge.  

    Somani was looking at the Erdős problems, a set of over one thousand conjectures by the Hungarian mathematician that are maintained online. The problems have become a tempting target for AI-driven mathematics, varying significantly in both subject matter and difficulty. The first batch of autonomous solutions came in November from a Gemini-powered model called AlphaEvolve — but more recently, Somani and others have found GPT 5.2 to be remarkably adept with high-level math.  

    Since Christmas, 15 problems have been moved from “open” to “solved” on the Erdős website — and 11 of the solutions have specifically credited AI models as involved in the process. 

    The revered mathematician Terence Tao has a more nuanced look at the progress on his GitHub page, counting eight different problems where AI models made meaningful autonomous progress on an Erdős problem, with six other cases where progress was made by locating and building on previous research. It’s a long way from AI systems being able to do math without human intervention, but it’s clear that there’s an important role for large models to play. 

    Techcrunch event

    San Francisco
    |
    October 13-15, 2026

    On Mastodon, Tao conjectured that the scalable nature of AI systems makes them “better suited for being systematically applied to the ‘long tail’ of obscure Erdős problems, many of which actually have straightforward solutions.”

    “As such, many of these easier Erdős problems are now more likely to be solved by purely AI-based methods than by human or hybrid means,” Tao continued.

    Another driving force is a recent shift towards formalization, a labor-intensive task that makes mathematical reasoning easier to verify and extend. Formalization doesn’t require use of AI or even computers, but a new crop of automated tools have made the process far easier. The open-source “proof assistant” Lean, which was developed at Microsoft Research in 2013, has become widely used within the field as a way of formalizing proof— and AI tools like Harmonic’s Aristotle promise to automate much of the work of formalization. 

    For Harmonic founder Tudor Achim, the sudden jump in solved Erdős problems is less important than the fact that the world’s greatest mathematicians are starting to take those tools seriously. “I care more about the fact that math and computer science professors are using [AI tools],” Achim said. “These people have reputations to protect, so when they’re saying they use Aristotle or they use ChatGPT, that’s real evidence.” 

    AI,gpt-5.2,harmonic,mathematicsAI,gpt-5.2,harmonic,mathematics#models #starting #crack #highlevel #math #problems1768419949

    AI crack gpt-5.2 harmonic highlevel Math mathematics models Problems starting
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    admin
    • Website
    • Tumblr

    Related Posts

    Bluesky rolls out cashtags and LIVE badges amid a boost in app installs

    January 17, 2026

    YouTube relaxes monetization guidelines for some controversial topics

    January 17, 2026

    Trump administration wants tech companies to buy $15B of power plants they may not use

    January 17, 2026
    Add A Comment

    Leave A Reply Cancel Reply

    Editors Picks

    McKinsey tests AI chatbot in early stages of graduate recruitment

    January 15, 2026

    Bosch’s €2.9 billion AI investment and shifting manufacturing priorities

    January 8, 2026
    8.5

    Apple Planning Big Mac Redesign and Half-Sized Old Mac

    January 5, 2021

    Autonomous Driving Startup Attracts Chinese Investor

    January 5, 2021
    Top Reviews
    9.1

    Review: T-Mobile Winning 5G Race Around the World

    By admin
    8.9

    Samsung Galaxy S21 Ultra Review: the New King of Android Phones

    By admin
    8.9

    Xiaomi Mi 10: New Variant with Snapdragon 870 Review

    By admin
    Advertisement
    Demo
    iGadgets Tech
    Facebook Twitter Instagram Pinterest Vimeo YouTube
    • Home
    • Tech
    • Gadgets
    • Mobiles
    • Our Authors
    © 2026 ThemeSphere. Designed by WPfastworld.

    Type above and press Enter to search. Press Esc to cancel.