• I’m curious about auto-regressive token prediction vs planning. The article just very briefly mentions “planning” and then never explains what it is. As someone who’s not in this area, what’s the definition/mechanism of “planning” here?

    • It’s extremely hard to separate out the actual technical terms from the hyperventilating booster lingo in this space. Unfortunately a lot of it is because there’s overlap. “Hallucination” is a well-defined technical term now, but it is also a booster term because it implies a consciousness that doesn’t exist.

      However this:

      machine that is able to reason about mathematics

      Is absolute bullshit.