• Admiral Patrick
    link
    fedilink
    English
    871 year ago

    As junk web pages written by AI proliferate, the models that rely on that data will suffer.

    Good.

  • Madrigal
    link
    fedilink
    English
    791 year ago

    “On two occasions I have been asked, ‘Pray, Mr. Babbage, if you put into the machine wrong figures, will the right answers come out?’ I am not able rightly to apprehend the kind of confusion of ideas that could provoke such a question.” - Charles Babbage

    • @bionicjoey@lemmy.ca
      link
      fedilink
      English
      151 year ago

      The business people adopting AI: “who cares what it’s trained on? It’s intelligent right? It’ll just sort through the garbage and magically come up with the right answers to everything”

      • @lennivelkant@discuss.tchncs.de
        link
        fedilink
        English
        11 year ago

        I believe Robustness was the term I learned years ago: the ability of a system to gracefully handle user error, make it easy to recover from or fix, clearly communicate what was wrong etc.

        Of course, nothing is ever perfect and humans are very creative at fucking up, and a lot of companies don’t seem to take UX too seriously. Particularly when the devs get tunnel vision and forget about user error being a thing…

  • Lvxferre [he/him]
    link
    fedilink
    English
    401 year ago

    Model degeneration is an already well-known phenomenon. The article already explains well what’s going on so I won’t go into details, but note how this happens because the model does not understand what it is outputting - it’s looking for patterns, not for the meaning conveyed by said patterns.

    Frankly at this rate might as well go with a neuro-symbolic approach.

    • @CeeBee_Eh@lemmy.world
      link
      fedilink
      English
      21 year ago

      The issue with your assertion is that people don’t actually work a similar way. Have you ever met someone who was clearly taught "garbage’?

      • Lvxferre [he/him]
        link
        fedilink
        English
        121 year ago

        The issue with your assertion is that people don’t actually work a similar way.

        I’m talking about LLMs, not about people.

        • @CeeBee_Eh@lemmy.world
          link
          fedilink
          English
          21 year ago

          I know you are, but the argument that an LLM doesn’t understand context is incorrect. It’s not human level understanding, but it’s been demonstrated that they do have a level of understanding.

          And to be clear, I’m not talking about consciousness or sapience.

          • Lvxferre [he/him]
            link
            fedilink
            English
            101 year ago

            I know you are, but the argument that an LLM doesn’t understand context is incorrect

            Emphasis mine. I am talking about the textual output. I am not talking about context.

            It’s not human level understanding

            Additionally, your obnoxiously insistent comparison between LLMs and human beings boils down to a red herring.

            Not wasting my time further with you.

            [For others who might be reading this: sorry for the blatantly rude tone but I got little to no patience towards people who distort what others say, like the one above.]

            • @CeeBee_Eh@lemmy.world
              link
              fedilink
              English
              11 year ago

              I got little to no patience towards people who distort what others say,

              My original reply was meant to be tongue-in-cheek, but I guess I forgot about Poe’s law. I’m not a layman, for the record. I’ve worked with AI for over a decade

              Not wasting my time further with you.

              Ditto. Have a nice day.

              • @CileTheSane@lemmy.ca
                link
                fedilink
                English
                4
                edit-2
                1 year ago

                A better mathematical system of storing words does not mean the LLM understands any of them. It just has a model that represents the relation between words that it uses.

                If I put 10 minus 8 into my calculator I get 2. The calculator doesn’t actually understand what 2 means, or what subtracting represents, it just runs the commands that gives the appropriate output.

                • @CeeBee_Eh@lemmy.world
                  link
                  fedilink
                  English
                  2
                  edit-2
                  1 year ago

                  That’s a bad analogy, because the calculator wasn’t trained using an artificial neural network literally designed by studying biological brains (aka biological neutral networks).

                  And “understand” doesn’t equate to consciousness or sapience. For example, it is entirely and factually correct to state that an LLM is capable of reasoning. That’s not even up for debate. The accuracy of an LLM’s reasoning capability is one of the fundamental benchmarks used for evaluating its quality.

                  But that doesn’t mean it’s “thinking” in the way most people consider.

                  Edit: anyone up voting this CileTheSane clown is in the same boat of not comprehending how LLMs work.

      • @PenisDuckCuck9001@lemmynsfw.com
        link
        fedilink
        English
        8
        edit-2
        1 year ago

        I’m autistic and sometimes I feel like an ai bot spewing out garbage in social situations. If I do what people normally do and make it sound believable, maybe no one will notice.

  • @tal@lemmy.today
    link
    fedilink
    English
    271 year ago

    Well, you’ve got a timestamped copy of much of the Web that existed up until latent-diffusion models at archive.org. That may not give you access to newer information, but it’s a pretty whopping big chunk of data to work with.

    • palordrolap
      link
      fedilink
      191 year ago

      Hopefully archive.org have measures in place to stop people from yanking all their data too quickly. As least not without a hefty donation or something. As a user it can chug a bit, and I’m hoping that’s the rate-limiting I’m talking about and not that they’re swamped.

  • @Anarki_@lemmy.blahaj.zone
    link
    fedilink
    English
    161 year ago

    ⢀⣠⣾⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⠀⠀⠀⠀⣠⣤⣶⣶ ⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⠀⠀⠀⢰⣿⣿⣿⣿ ⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣧⣀⣀⣾⣿⣿⣿⣿ ⣿⣿⣿⣿⣿⡏⠉⠛⢿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⡿⣿ ⣿⣿⣿⣿⣿⣿⠀⠀⠀⠈⠛⢿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⠿⠛⠉⠁⠀⣿ ⣿⣿⣿⣿⣿⣿⣧⡀⠀⠀⠀⠀⠙⠿⠿⠿⠻⠿⠿⠟⠿⠛⠉⠀⠀⠀⠀⠀⣸⣿ ⣿⣿⣿⣿⣿⣿⣿⣷⣄⠀⡀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⢀⣴⣿⣿ ⣿⣿⣿⣿⣿⣿⣿⣿⣿⠏⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠠⣴⣿⣿⣿⣿ ⣿⣿⣿⣿⣿⣿⣿⣿⡟⠀⠀⢰⣹⡆⠀⠀⠀⠀⠀⠀⣭⣷⠀⠀⠀⠸⣿⣿⣿⣿ ⣿⣿⣿⣿⣿⣿⣿⣿⠃⠀⠀⠈⠉⠀⠀⠤⠄⠀⠀⠀⠉⠁⠀⠀⠀⠀⢿⣿⣿⣿ ⣿⣿⣿⣿⣿⣿⣿⣿⢾⣿⣷⠀⠀⠀⠀⡠⠤⢄⠀⠀⠀⠠⣿⣿⣷⠀⢸⣿⣿⣿ ⣿⣿⣿⣿⣿⣿⣿⣿⡀⠉⠀⠀⠀⠀⠀⢄⠀⢀⠀⠀⠀⠀⠉⠉⠁⠀⠀⣿⣿⣿ ⣿⣿⣿⣿⣿⣿⣿⣿⣧⠀⠀⠀⠀⠀⠀⠀⠈⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⢹⣿⣿ ⣿⣿⣿⣿⣿⣿⣿⣿⣿⠃⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⢸⣿⣿

  • Sundray
    link
    fedilink
    English
    31 year ago

    AI writing, scraped by AI, producing more AI writing…

    So not “gray goo” exactly, but “gray slop”?