As AI-generated content fills the Internet, it’s corrupting the training data for models to come. What happens when AI eats itself?

  • blivet
    link
    fedilink
    62 years ago

    So in order for data to be useful to AIs, AI-generated content will have to be flagged as such. Sounds good to me.

    • admiralteal
      link
      fedilink
      42 years ago

      But malicious actors don’t want their generated data to be recognizable to LLMs. They want it to be impersonating real people in order to promote advertising/misinformation goals.

      Which means that even if they started flagging LLM generated content as LLM generated, that would just mean only the most malicious and vile LLM contents will be out there training models in the future.

      I don’t see any solution to this on the horizon. Pandora is out of the box.

      • blivet
        link
        fedilink
        62 years ago

        If the quality of AI-generated content degrades to the point where it’s useless that is also fine with me.

        • RoboRay
          link
          fedilink
          52 years ago

          Some would argue that this is the starting position.

      • Machinist3359
        link
        fedilink
        12 years ago

        To flip it, this means that only AI which responsibly manages it’s initial data set will be successful. Can’t simply scrape and pray, need to have some level of vetting with input.

        More labor intensive? Sure, but AI companies aren’t entitled to quick and easy solutions they started with…

        • admiralteal
          link
          fedilink
          12 years ago

          That doesn’t follow.

          It means the AI companies that don’t behave responsibly will have a huge advantage over the ones that do.

  • curiosityLynx
    link
    fedilink
    62 years ago

    Cannibalism always has increased risk of brain disease. Seems fitting that this applies to AI too.

    • grahamsz
      link
      fedilink
      52 years ago

      I like the term from Jathan Sadowski that it should be called Habsburg AI

  • TimeSquirrel
    link
    fedilink
    52 years ago

    It’s basically RE:RE:RE:RE:RE:RE and corrupted jpegs that have been reposted and compressed a thousand times, but for AI.

  • admiralteal
    link
    fedilink
    32 years ago

    Dead internet theory seems like a completely inevitable future place that we’re all racing to. I don’t see any way to avoid it. It’s a tragedy of the commons in a place where there is no organizing body that can step in and prevent private actors from destroying everything. Worse, we’re more concerned with those private actors being strong and competitive which is only accelerating us towards the doomed endgame.

  • anon2481
    link
    fedilink
    22 years ago

    Great. It’ll be easy to tell AI generated content apart when they’re just spewing jibberish.