So, I’ve been on Lemmy since the great Reddit exodus a couple years ago, and back on Reddit though, there were discussions on Data Poisoning, where since it was nigh unto impossible to keep your data private, people would poison their data with all sorts of extraneous information, so companies couldn’t tell what data was accurate and what wasn’t.

But, here on Lemmy, I haven’t seen any discussions about the topic. Are people still poisoning their data? Why haven’t I seen any discussions about it? Is it still discussed, just not on the instances I’ve seen?

  • LilB0kChoy@midwest.social
    link
    fedilink
    arrow-up
    52
    arrow-down
    8
    ·
    3 个月前

    Way, way back in the early days of the internet when it was still all just message boards and users from universities there were discussions about data poisoning. The early forebears of users today had enough foresight to understand what the internet could be if it went the wrong way and started a collaborative project to develop tools as technology developed to combat it. They managed to keep up the project, legacy developers would move on or die but new ones sprang up in their place. It all fell apart way back in nineteen ninety eight when the undertaker threw mankind off hell in a cell and plummeted sixteen feet through an announcers table.

    • Angry_Autist@lemmy.world
      link
      fedilink
      arrow-up
      7
      arrow-down
      7
      ·
      edit-2
      3 个月前

      Hmmm, top comment in an asklemmy thread is a joke, and the mods do nothing after 17+ hours?

      We sure are speedrunning the reddit irrelevance arc aren’t we?

      As is tradition, both you and your instance are now blocked

      • makingStuffForFun@lemmy.ml
        link
        fedilink
        arrow-up
        5
        arrow-down
        1
        ·
        3 个月前

        I think the fact that their comment is the top comment indicates that the general populace of Lemmy like their comment, and to try and censor or block is just bizarre. It’s like telling the community what they can and can’t think.

        • Angry_Autist@lemmy.world
          link
          fedilink
          arrow-up
          1
          arrow-down
          3
          ·
          3 个月前

          Yep and asshats just like you made this exact same fucking argument every day as reddit slid downhill and look where they are now

          Your idealism is naive

          • Wugmeister@lemmy.dbzer0.com
            link
            fedilink
            English
            arrow-up
            2
            ·
            3 个月前

            BTW, the only reason I decided not to block you is I did a quick flick-flick speed read down your recent posts and realized you seem to be a leftist of some sort, which to me means you’ve got a lot of reasons to be pissed constantly.

            Still, being this level of enraged 24/7 is not good for you and is frankly not productive at all. Have you considered therapy? You gotta cool off at some point. And I dont know if you have some other underlying issue other than the “enhanced sense of justice” symptom of autism.

        • Angry_Autist@lemmy.world
          link
          fedilink
          arrow-up
          1
          ·
          3 个月前

          I’m trying to figure out how to encapsulate the idea that ‘Being okay with those kind of jokes is what turns a forum sour through adoption by idiots that don’t think its being ironic’ before I get bored and block you, and I failed.

  • Auth@lemmy.world
    link
    fedilink
    English
    arrow-up
    30
    ·
    3 个月前

    I have no idea about reddit but I poison copilot data daily at work. Feeding nonsense incorrect answers and misusing the thumbs up and down feedback. Sometimes I just generate max context nonsense text over and over to try and hit the API limit. We’re not paying for the licenses because microsoft is trying to show us how awesome it is. But this week is my last week doing so because my company has decided its disabling copilot.

  • besselj@lemmy.ca
    link
    fedilink
    arrow-up
    26
    ·
    3 个月前

    Big AI companies pretty much exclusively sell LLMs that output unreliable data, so idk how much of a worry it is anymore.

    • GratefullyGodless@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      9
      ·
      3 个月前

      True. But this is more about poisoning our data that companies give to data brokers, advertisers, etc., rather than LLM data.

  • Showroom7561@lemmy.ca
    link
    fedilink
    arrow-up
    11
    ·
    3 个月前

    I wonder if someone can make a Firefox extension that auto fills user profiles in various accounts with nonsense… fake address, fake bio, fake job, etc. Make it easy for users to poison data.

    And the extension could add nonsense to various posts, like here on Lemmy. Not enough to ruin the content, but enough to taint any LLM data scraping.

    • LogicalDrivel@sopuli.xyz
      link
      fedilink
      English
      arrow-up
      7
      ·
      3 个月前

      I forget the name but there was/is an add on that obfuscates your data by clicking on every ad and searching random things in the background. Im sure something similar could be made for this.

  • Treczoks@lemmy.world
    link
    fedilink
    arrow-up
    2
    ·
    edit-2
    3 个月前

    Given the shit that AI does, like deleting databases and lying about it, or telling people looking for support to kill themselves, why did you think data poisoning does not work?

  • brucethemoose@lemmy.world
    link
    fedilink
    arrow-up
    2
    ·
    3 个月前

    With Reddit, specifically, they seem pretty hardcore about rolling back profile “cleansing.” I think the effort failed, sadly, as did a lot of Reddit uproar.

  • Sanctus@lemmy.world
    link
    fedilink
    English
    arrow-up
    2
    ·
    3 个月前

    Just set a bot up to pull random search terms from a huge dictionary and let it run all day on a browser signed into your account if you want to do that. I think most people focus on blocking the tracking now.