• BradleyUffner@lemmy.world
    link
    fedilink
    English
    arrow-up
    3
    ·
    9 hours ago

    The models themselves are actually entirely deterministic. The non-determinism you see is actually artificially introduced at the application layer to make the output seem more human. It’s usually controlled by a setting called “heat”, which when set to 0 will give completely reproducible results.

    • SlimePirate@lemmy.dbzer0.com
      link
      fedilink
      English
      arrow-up
      3
      ·
      7 hours ago

      This is correct, I suppose you’re talking about the final softmax layer? When I said they are bad at determinism, I was talking about reasoning on deterministic rules not having deterministic output. For example, LLMs make logical deduction errors, calculation errors etc.