• boonhet@sopuli.xyz
    link
    fedilink
    English
    arrow-up
    11
    arrow-down
    1
    ·
    25 days ago

    AI doesn’t see individual characters, it sees tokens, with most tokens being a word or part of a word. That’s why per-character questions have such a high failure rate.

    • PunnyName@lemmy.world
      link
      fedilink
      English
      arrow-up
      6
      arrow-down
      5
      ·
      edit-2
      24 days ago

      If it doesn’t understand the simple concept of the number of letters and spaces, it needs to be reprogrammed.

      ETA: sorry folks, not gonna change my view and simp for shit A.I., continue with the downvotes.

      • boonhet@sopuli.xyz
        link
        fedilink
        English
        arrow-up
        9
        ·
        25 days ago

        It doesn’t understand anything though? It never will. It’s a probability machine. If you choose to believe its output, that’s on you. I use it as a coding assistant to get boring things done faster. Fire a prompt at claude code, grab a coffee, check out the diff. But that last step is crucial. Can’t trust AI output blindly.

        • dream_weasel@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          3
          arrow-down
          2
          ·
          25 days ago

          The embedding layer post tokenization is not just a probability machine the way you’re suggesting it. You can argue that it is probabilistic with inferred sentiment, but too many people think it works like how text prediction on your phone does and that is just factually inaccurate.

          Verify output of course, but saying “it doesn’t understand anything” and “probability machine” is a borderline erroneous short sell. At the level of tokens it “understands” relationships, and those relationships are not probabilistic, though they are fundamentally approximated based on a training corpus.