There have been multiple accounts created with the sole purpose of posting advertisement posts or replies containing unsolicited advertising.

Accounts which solely post advertisements, or persistently post them may be terminated.

Xanthrax ,
@Xanthrax@lemmy.world avatar

It already happened without their consent. You’ve been able to get it to produce “reddit text posts”, for years. This is a bit harrowing, though.

BigTrout75 ,

So AI models are not farming the federation?

nightwatch_admin ,

They probably are, but not the personal/private info like chat/DM, upvotes or downvotes, geolocation, etc which I highly suspect Reddit did sell.

KairuByte ,

Just FYI, your voting is fully public on Lemmy. DMs are “private” but could be intercepted at the server level of any instances involved (yours and the receiver/sender) and of course your geolocation info is visible to the server.

Not saying that is happening, and not trying to spread FUD, but be aware that your info isn’t necessarily private just because a corpo isn’t directly involved.

nightwatch_admin ,

You are absolutely right, and I think people should be more aware of this.

FlyingSquid ,
@FlyingSquid@lemmy.world avatar

Me too, maybe then assholes will stop whining about me of downvoting them when I didn’t. As if it matters.

4grams ,
@4grams@awful.systems avatar

i am so glad i deleted all my posts. im sure they have backup hisory though :(.

asymmetric ,

One of the original Reddit memes was quite prescient:

https://i.imgur.com/Fza1Cut.jpg

KairuByte ,

Their content?

unionagainstdhmo ,
@unionagainstdhmo@aussie.zone avatar

That’s what I was thinking

SurRoulettes ,

I wouldn’t be surprised if comments become their intellectual property through some terms of services bullcrap

uis ,

Meanwile I’m on Matrixstoemmy

NutWrench ,
@NutWrench@lemmy.world avatar

Reddit is all bots, porn, ads and political shit posts. Good luck getting any useful training content out of that.

ladicius ,

Maybe that’s the point? Training the AI to produce the blabbering bullshit that’s preferred in social media?

HawlSera ,

I wish it would die, because honestly some of the porn was great and Lemmy seems to be the one place on the net that doesn’t specifically ban porn, yet has none of it anyway.

I miss bodyswap and part tf captions…

PoliticalAgitator ,

They don’t care if the AI produced is useful, they just want to milk as much money from their content as they can.

The API changes were almost certainly just the groundwork for this and I called it at the time. The ridiculous pricing model for API access is because it’s aimed at the hottest tech companies, not third party app developers.

The enshittification continues because it’s what neoliberalism demands. They’ll sell your content and the data they have about you and still show you ads, because that’s the most profitable. Ethics and product quality don’t even enter into it.

Ilgaz ,

Liberal market gives end users choice. If they don’t choose, they get the consequences.

This is more like people choosing Trump like types and complaining. Alternative exists, choose it.

PoliticalAgitator ,

“The free market can fix it” is just another neoliberal lie, pushed precisely because it doesn’t work. Rather than holding corporations accountable, it blames the population instead.

The reality is that boycotting businesses isn’t always an option and when it is, it’s usually a luxury. Very few products are domestically and/or ethically produced and when they are, they’re extremely expensive, especially for people being fucked out of every cent by their bosses, landlords and utilities.

It’s why the most hated companies in the world continue to bring in record profits.

Regulations are the real answer, which is why neoliberals oppose them.

Ilgaz , (edited )

I really don’t care about people who behave like they are living in North Korea or who wants a North Korean World to live in.

Even Digg people could say “No, F you” to Digg superstar owners. It is just a damn URL to type.

tigerjerusalem , (edited )

Reddit is a trove of user built content under the guise of community. What Spez did was to say “thanks for all the free work, suckers!”, put a price sticker on it, and laughed all the way to the bank.

And this is why I’m not active on any Internet community anymore. Nevermind, I guess I just can’t help myself…

nodsocket ,

And this is why I’m not active on any Internet community anymore,

you typed.

Rascabin ,

You couldn’t see the sarcasm because it was set to “hidden”.

xorollo ,

Somebody asked chat GPT to appear to be a normal internet user to populate the comments section to manufacture content for normal Internet users to respond to so that they can continue building up their training models.

tigerjerusalem ,

Active as in “creating meaningful contributions and contributing to the overall knowledge base”. I still shit post from time to time.

Kolrami ,

This is going to be a really weird thing to argue, but I just casually read through a bunch of your comments and they seem like meaningful contributions.

nightwatch_admin ,

^ this comment right here, officer.

tigerjerusalem ,

Well, I guess I can’t help myself… I’ll shitpost more from now on 😅

tigerjerusalem ,
Adulated_Aspersion ,

And that is another unintended example of why all of my post history was purged before migration.

DScratch ,

What are they odds that they kept it in a backup?

RootBeerGuy ,
@RootBeerGuy@discuss.tchncs.de avatar

Depends. If they were smart they backed up every content that had a certain number of upvotes and/or a certain number of paragraphs and/or responses. Just to weed out all the 2-3 word comments that no one interacted with. If OP wrote mostly those then Reddit gives a shit about them deleting those.

Crack0n7uesday ,

Some 4chan users created a backup bot that auto saves every few hours, so if reddit didn’t do it already, 4chan has been doing it for a while. The bot was originally made for 4chan but repurposed for other websites, reddit included.

Dozzi92 ,
@Dozzi92@lemmy.world avatar

Yeah, it’s all too late. Shit, PRISM was 2007, so there’s a copy of everything somewhere. Obviously different ends.

Ilgaz ,

Spez like people are even capable of leeching archive.org and still sell the data which was archived for good intentions.

jjlinux ,

Welcome to the club.

redfox ,

Don’t cheat yourself just because there are douches that take advantage…

erAck ,
@erAck@discuss.tchncs.de avatar

It will get trained on some comment posts.

Let reddit die. Join Lemmy or /kbin. join-lemmy.org kbin.pub

Evotech ,

And what’s to stop instance owners from selling their data?

bigMouthCommie ,
@bigMouthCommie@kolektiva.social avatar

shame

Toneswirly ,

mass user exodus to one of the many other identical Instances. Also, data brokers prolly aren’t interested in going after each Instance because no one instance has enough data to make it worthwhile. Yet again, the fediverse proves its resistance to enshitification.

werefreeatlast ,

Yes, it’s not worth running an instance! So let’s all run one! LOL. It’s so worth it. Fuck reddit.

Toneswirly ,

you OK bud?

JackbyDev ,

Lmao, if it gets as big as Reddit then it’s worth scraping. It’s not the fediverse making it less worthwhile, just the size.

nodsocket ,

The eggs are not all in one basket. Less data to sell.

meat_popsicle ,

Thanks to federation, the copies of the eggs are. You can’t stop one instance from selling data sourced from federated content until it’s too late.

drathvedro ,

You can’t put a price tag on it. Nothing is stopping anyone from scraping all of the data for free.

MostlyGibberish ,

The only thing stopping them is the fact that anyone who wants the data can just utilize the federation protocol to take any data they want, and there’s not a lot anyone can do about it. You can’t sell something that’s trivial to get for free.

If the question you’re really asking is “what’s stopping content on Lemmy/Mastodon/etc from being used to train an LLM?” the answer is, nothing.

Ilgaz ,

I wished they had evil lawyers looking after such stuff and sold strictly opt in data to AI corps. Free for FOSS though.

Bobmighty ,

With reddits severe bot problem, it’ll be like training on unfiltered sewage. Garbage in, garbage out.

captain_oni ,

Machines training machines? How perverse!

ohlaph ,

Gross

DozensOfDonner ,

Why does it sound like reddit trained AI will only get dumber.

jol ,

That would explain why GPT is often so confidently incorrect.

aidan ,

laughs villainously This is all going to plan, now there will be some chatbot spewing my insane beliefs

garibaldi_biscuit ,

This is what the 3rd party access to API was really all about.

When API access was allowed , all reddit content was effectively free: They needed to ban 3rd party apps so they could sell the accumulated content. I expect using content to train AI also factors into it.

bier ,

Is it? Because when you build a bot and just scrape Reddit I don’t think you can just use the content to train AI, just like the New York Times. The API change was definitely to sell more ads and get a higher IPO, but I don’t think it was because of AI.

Empricorn ,

Am I crazy or are you arguing the same point? Scraping is not the same as API access. They closed off the API to everyone for dubious reasons so they can sell that content (both for ads and AI training)… Right??

bier , (edited )

No you’re not, the post was editted. The original one said it was all because of AI, the entire reason for the API change was to sell to AI companies.

Edit, now I’m in doubt, because if you edit a post that is shown somehow right?

Edit2, just to be clear my point is that Reddit content was never free, before and after the API change. It’s easier to get the content with a decent API, sure. But it was never free, just like the lawsuit the NY Times started.

Strayce ,

Considering how much of Reddit is already bots, I’m sure this will end fantastically.

  • All
  • Subscribed
  • Moderated
  • Favorites
  • [email protected]
  • random
  • lifeLocal
  • goranko
  • All magazines