• qaz@lemmy.worldOP
    link
    fedilink
    English
    arrow-up
    34
    ·
    3 days ago

    Well, then Google shouldn’t have just scraped the site then. It’s not JABDE’s responsibility to make their content suitable for LLM training

    • Lovable Sidekick@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      arrow-down
      30
      ·
      3 days ago

      It’s everybody’s responsibility not to spray piss in random directions hoping some of it will hit somebody they hate.

      • Iron Lynx@lemmy.world
        link
        fedilink
        English
        arrow-up
        6
        ·
        3 days ago

        One thing about internet sources is that in general, people engage with them if they choose to. Your piss-spraying analogy only works if the users don’t have this freedom. At least for now, we the end users still have the choice to engage with LLM’s, or to choose to navigate elsewhere.

        So no, there is no randomly pissing around hoping that LLM training data is among the things being hit. It’s Big G demanding everything as LLM training data and tossing it on the heap, and someone finding that said heap includes The Onion and individual shitposters, and with their dislike for LLM’s, acting accordingly.

        • Corkyskog@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          4
          ·
          3 days ago

          I always wonder how many of my old snarky Reddit posts without a /s tag is now incorrectly advising people making LLM requests haha.

        • Lovable Sidekick@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          arrow-down
          3
          ·
          2 days ago

          Your rationale doesn’t change that dirtying the data pool is dirtying the data pool. Choosing to engage with LLMs or not doesn’t change make non-AI searches ignore nonsense data.

        • Iron Lynx@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          ·
          3 days ago

          Oh one more thing:

          Be glad that OP’s site is shitposting.

          This could get much worse if it was politically motivated propaganda.

          Don’t believe me? Try getting DeepSeek to say anything critical of the CCP.