Its a bit old, but I just learned it via the retro-dodo article here: https://retrododo.com/google-is-killing-retro-dodo/
Is it just me or are 60 million a ridiculously small price for that whole dataset?
How quickly you forget that half of it is just “I also choose this guy’s wife” and “the narwhal bacon’s at midnight”
I’m personally curious whether Reddit actually has any ability to protect that database. I don’t remember Reddit TOS, but usually those things give them license to use and copy the data, maybe even to sell it, but not actually the copyright on it. So if someone made a Reddit scraper and copied the comments, wouldn’t only the actual commenter be able to sue?
$60M may be reflecting that, in that it’s more a convenience fee to shield Google against individual Redditors going after them than something that Reddit itself could actually sue over.
Considering it’s all full of Nazis and bots, and if you get to filter all of them out you’re left with reposts and low quality memes followed by comments that represent the hostile side of each of us… I’d say anything over $5 is a good deal for spez.
Now, I hope Google uses this data exclusively for detecting inappropriate answers. Can you imagine it giving answers based on the endless threads i of " I’m not your mate, bro; I’m not your bro, dude…".
It’s more than they were making from third party apps, hence the ridiculous API fees.
Can’t wait to see an AI chatbot in my Google searches that behaves like a typical redditor.
I mean one of the most popular search types on Google is <topic + Reddit> so not much would change
Just wait till the LLM starts “singing” randomly to you.
deleted by creator
– Hey Google/reddit, what does xxxxxx mean?
–Wtf is people so lazy, Google it yourself it’s only 5 seconds!
–But but, you are Google, are you not?
–Buahaha , haha!
Oh no, my thousands of identical messages!
You sir are a scholar and a gentleman.
I also choose this man’s wife.
This
Scrolled too far down to find this
And my axe
God, the taste their AI is about to garner for coconuts.
Like they don’t have the data prior to the overwrite… These tools need to update over time
Steve Huffman looks increasingly douchier and shittier with every passing photo.
What a damn chode. Fuck that guy.
AI be like “stfu regard”
AI be like there things are over they’re
This is the Way.
Can someone point me the way of that bot or whatever that changes all your old Reddit posts before deleting them? I thought I had it saved somewhere but I can’t find it now and have no idea what it’s called.
They keep copies of posts because people who mass edited their posts saw them reverted or have people reply still as if they were not edited.
I had read that with some people, is was a delay from their server instance between read/write and in the end the changes did end up sticking, but I don’t know if that was true. A lot of people were mass editing at the same time, and since editing isn’t something that happens super frequently, it might have less priority in the stack and caused backups.
They change it on their website but the data that’s collected and sold isn’t changed.
It still devalues their google search though but also makes it harder to scrap data for free and ups the value of what they are selling.
For sure. They definitely have change records for everything. It would be borderline negligent if they didn’t.
Plus they can easily just detect mass edits, and ship the state prior to that event.
I hope AI sais fuck Spez a lot:-)
You can be sure the little fucker hired people to filter out that sort of stuff from the data
I deleted my comment history after the API exodus. I’m sure they could dig it up if they wanted but at least they’ll have to click like 3 more buttons if they want to train AI on my nonsense.
Before:
SELECT * FROM `comments` WHERE is_deleted=0;
After:
SELECT * FROM `comments`;
And this is how Skynet was born.
That one Microsoft Twitter bot turned into a full blown Nazi in just one day.
I can’t even imagine how fucked up and depraved one trained on Reddit data will get.
They have a series of safeguards against that now. They’ve actually taken it in the extreme other direction now where it can’t give you anything without injecting diversity in there somewhere.
Here’s an example. This is what it produced when asking for an image of a German soldier in 1943.
They may be able to alter what it says, but they can’t alter what it thinks.
60m? Ms got a steal no wonder Reddit can’t monetize
As part of the deal, spez will personally train the AI Jailbait Model.
Grateful this is no longer my problem
is there a way to mass delete my old content? the service i used in the past doesn’t seem to have worked. i recently got a reply from a 6 year-old post from someone saying they got there on google.
My understanding is that the mass delete you did probably had worked, but reddit rolled back your deletions. I heard it happened to a lot of mass deleters after the lemmy exodus.
Can we still mass edit our previous comments with random stuff, a little bit at a time to avoid detection? Poison the data, yada yada.
Is worse nothing gotten really deleted admins admitted in like 2018 that they can see deleted posts. I think even some mods can. The access they give to Google is to the backend they can see EVERYTHING.
I think I’m gonna be sick. so all the stuff I wrote, it’s just THERE? what the fuck do i do? what about private info that I dont want on a public fucking search engine?? I’ve had that account since I was a kid, there’s a lot of shit I regret posting, what the FUCK!
Yes but they just reverse it. That ship has sailed.
Google cached it maybe?
That’s not great news when weighed against my desire to watch reddit crash and burn.
All we can do is make something better, reddit will do their thing and we will do ours.
Minimum royalty laws should exist.