There have been multiple accounts created with the sole purpose of posting advertisement posts or replies containing unsolicited advertising.

Accounts which solely post advertisements, or persistently post them may be terminated.

tabular ,
@tabular@lemmy.world avatar

I despise this use of mod power in response to a protest. It’s our content to be sabotaged if we want - if Stack Overlords disagree then to hell with them.

I’ll add Stack Overflow to my personal ban list, just below Reddit.

redisdead ,

Once submitted to stack overflow/Reddit/literally every platform, it’s no longer your content. It sucks, but you’ve implicitly agreed to it when creating your account.

The_Vampire ,

While true, it’s stupid that things are that way. They shouldn’t be able to hide behind the idea that “we’re not responsible for what our users publish, we’re more like a public forum” while also having total ownership over that content.

tabular ,
@tabular@lemmy.world avatar

you’ve implicitly agreed to it when creating your account

Many people would agree with that, probably most laws do. However I doubt many users have actually bothered to read the unnecessarily long document, fewer have understood the legalese, and the terms have likely already been changed pray I don’t alter it any further. That’s a low and shady bar of consent. It indeed sucks and I think people should leave those platforms, but I’m also open to laws that would invalidate that part of the EULA.

merthyr1831 ,

If i was stack overflow I would’ve transferred my backups to OpenAI weeks before the announcement for this very reason.

This is also assuming the LLMs weren’t already fed with scraped SO data years ago.

It’s a small act of rebellion but SO already has your data and they’ll do whatever they want with it, including mine.

trailee ,

It’s true that it’s mostly a symbolic act, but the rebellion matters m, especially from old accounts. It’s also a nice way to mark the time after which I never participated in SO again. After my ban expires, I’ll deface my questions again. And again. Until they permaban me.

trailee ,

There’s also the possibility of adding to the wonderful irony of making the AI more useful than the original by having content that’s no longer accessible through through the original. It doesn’t get more enshittified than that, even if Prashanth Chandrasekar is too out of touch to ever regret his decision.

Muffi ,

I think you’re 100% correct in assuming they’ve already fed it data scraped from SO. I’ve previously gotten code samples from ChatGPT that was clearly from SO down to the comments in the code. Even reverse searched some of the code and found the question it was from.

mint_tamas ,

OpenAI clearly already scraped the pre-LLM (aka actually useful) content from SO, this entire deal is happening after the fact to avoid litigation.

andrade ,
ChaoticEntropy ,
@ChaoticEntropy@feddit.uk avatar

How many trees does a person need to make one coffin…

WindyRebel ,

I counted around 30-32 in panel 2.

ChaoticEntropy ,
@ChaoticEntropy@feddit.uk avatar

Thank you for your diligence.

old_machine_breaking_apart ,

It’s a metaphor for us killing ourselves in the processes of deforestation, not a story of someone actually making a coffin.

ChaoticEntropy ,
@ChaoticEntropy@feddit.uk avatar

It may not have been a wholly serious question.

old_machine_breaking_apart ,

You got me. I should stop taking things too literally

Dkarma ,

You’re not a wholly serious person

ChaoticEntropy ,
@ChaoticEntropy@feddit.uk avatar

You wound me.

RagingHungryPanda ,

measure once, cut 15 times

ChaoticEntropy ,
@ChaoticEntropy@feddit.uk avatar

I guess you never want to be buried in your first attempt.

johny_joe_1975 ,

Thing just like reddit, but now in professional community

Idontreallyknow ,

enshittification

dojan ,
@dojan@lemmy.world avatar

Stackoverflow counts as professional now? Wasn’t the general perception that it’s an incredibly toxic space?

trailee ,

They seem to only be watching the questions right now. You’re automatically prevented from deleting an accepted answer, but if you answered your own question (maybe because SO was useless for certain niche questions a decade ago so you kept digging and found your own solution), you can unaccept your answer first and then delete it.

I got a 30 day ban for “defacing” a few of my 10+ year old questions after moderators promptly reverted the edits. But they seem to have missed where I unaccepted and deleted my answers, even as they hang out in an undeletable state (showing up red for me and hidden for others).

And comments, which are a key part to properly understanding a lot of almost-correct answers, don’t seem to be afforded revision history or to have deletes noticed by moderators.

So it seems like you can still delete a bunch of your content, just not the questions. Do with that what you will.

schnurrito ,

Messages that people post on Stack Exchange sites are literally licensed CC-BY-SA, the whole point of which is to enable them to be shared and used by anyone for any purpose. One of the purposes of such a license is to make sure knowledge is preserved by allowing everyone to make and share copies.

9point6 ,

Share Alike

I can’t wait to download my own version of the latest gpt model

bbuez ,

It does help to know what those funny letters mean. Now we wait for regulators to catch up…

/tangent

If anything, we’re a very long way from anything close to intelligent, OpenAI (and subsequently MS, being publicly traded) sold investors on the pretense that LLMs are close to being “AGI” and now more and more data is necessary to achieving that.

If you know the internet, you know there’s a lot of garbage. I for one can’t wait for garbage-in garbage-out to start taking its toll.

Also I’m surprised how well open source models have shaped up, its certainly worth a look. I occasionally use a local model for “brainstorming” in the loosest terms, as I generally know what I’m expecting, but it’s sometimes helpful to read tasks laid out. Also comfort in that nothing even need leave my network, and even in a pinch I got some answers when my network was offline.

It gives a little hope while corps get to blatantly violate copyright while having wielding it so heavily, that advancements have been so great in open source.

kerrigan778 ,

That license would require chatgpt to provide attribution every time it used training data of anyone there and also would require every output using that training data to be placed under the same license. This would actually legally prevent anything chatgpt created even in part using this training data from being closed source. Assuming they obviously aren’t planning on doing that this is massively shitting on the concept of licensing.

JohnEdwa ,

CC attribution doesn’t require you to necessarily have the credits immediately with the content, but it would result in one of the world’s longest web pages as it would need to have the name of the poster and a link to every single comment they used as training data, and stack overflow has roughly 60 million questions and answers combined.

Scrollone ,

They don’t need to republish the 60 million questions, they just have to credit the authors, which are surely way fewer (but IANAL)

JohnEdwa ,

appropriate credit — If supplied, you must provide the name of the creator and attribution parties, a copyright notice, a license notice, a disclaimer notice, and a link to the material. CC licenses prior to Version 4.0 also require you to provide the title of the material if supplied, and may have other slight differences.

Maybe that could be just a link to the user page, but otherwise I would see it as needing to link to each message or comment they used.

theherk ,

Maybe but I don’t think that is well tested legally yet. For instance, I’ve learned things from there, but when I share some knowledge I don’t attribute it to all the underlying sources of my knowledge. If, on the other hand, I shared a quote or copypasta from there I’d be compelled to do so I suppose.

I’m just not sure how neural networks will be treated in this regard. I assume they’ll conveniently claim that they can’t tie answers directly to underpinning training data.

fruitycoder ,

IF its outputs are considered derivative works.

kerrigan778 ,

Ethically and logically it seems like output based on training data is clearly derivative work. Legally I suspect AI will continue to be the new powerful tool that enables corporations to shit on and exploit the works of countless people.

fruitycoder ,

The problem is the legal system and thus IP law enforcement is very biased towards very large corporations. Until that changes corporations will continue, as they already were, exploiting.

I don’t see AI making it worse.

General_Effort ,

They are not. A derivative would be a translation, or theater play, nowadays, a game, or movie. Even stuff set in the same universe.

Expanding the meaning of “derivative” so massively would mean that pretty much any piece of code ever written is a derivative of technical documentation and even textbooks.

So far, judges simply throw out these theories, without even debating them in court. Society would have to move a lot further to the right, still, before these ideas become realistic.

bitwolf ,

Rather than delete, modify the question so its wrong. Then the ai will hallucinate.

Sabata11792 ,
@Sabata11792@kbin.social avatar

I just expect to insult the user while not answering the question.

Zink ,

As a large language model, I expect you to use the search function. Asshole.

jnk ,

Have you tried to read the fucking manual you filthy lazy fuck? Marked as solved. Is there anything else i can do to help you? 😊

Sabata11792 ,
@Sabata11792@kbin.social avatar

Perfect. I can't tell a difference.

zaphod ,

Letting corporations “disrupt” forums was a mistake.

RamblingPanda ,

Stack Overflow was great when it appeared. The info was spread out incredibly wide and there was a lot of really shitty info as well. One place where it could accumulate and be rated was extremely helpful.

But maybe it’s time to create a federated open source Stack Overflow.

EnderMB ,

They also committed to providing open dumps of their data to make it free to all. At the start, they were doing all the right things.

JackbyDev ,

It all went downhill when they banned and slandered Monica.

ultra ,

Who’s Monica?

JackbyDev ,

Take a read here. It’s a better summary than I could manage.

ultra ,

Oh, that’s pretty bad… thanks for the link

ZILtoid1991 ,

I once managed to find a pretty good alternative, but then I forgot its name. It was a very chill community unlike what Stackoverflow was recently with it’s toxicity (properly formatted question police, people being offended for less popular languages, etc.).

just_another_person ,

I got an email ban.

1609 hours logged 431 solved threads

Guru_Insights99 ,

Well, it is important to comply with the terms of service established by the website. It is highly recommended to familiarize oneself with the legally binding documents of the platform, including the Terms of Service (Section 2.1), User Agreement (Section 4.2), and Community Guidelines (Section 3.1), which explicitly outline the obligations and restrictions imposed upon users. By refraining from engaging in activities explicitly prohibited within these sections, you will be better positioned to maintain compliance with the platform’s rules and regulations and not receive email bans in the future.

HauntedCupcake ,

Is this a joke?

tearsintherain ,
@tearsintherain@leminal.space avatar

Nope, it’s the establishment is cool, elon rocks type.

goferking0 ,

Hopefully a troll account after looking at other comments but who knows anymore

FlorianSimon ,

This is an ironic ChatGPT answer, meant to (rightfully) creep you out.

Potatos_are_not_friends ,

NGL I read it and laughed at the AI-like response.

Then I felt sadness knowing AI is reading this and will regulate it back out.

pivot_root ,

AI-generated content trained on LLMs is poison for training, so that’s actually a good thing :)

TachyonTele ,

It’s not. This is how this person talks in every comment they make.

FlorianSimon ,

Are they not a ChatGPT troll account or a bot?

TachyonTele ,

Tough to say. I honestly don’t know. The user name is the classic word_wordNumber that bots use. The comments are long though. But its comments are spaced far apart timewise.

If it’s a joke account it’s doing it rarely.

Emmie ,

Comments are clearly ChatGPT I know because I did it once to troll some sub too. I instantly recognize the pirate ‚swashbuckling’ comment in their profile history you get when you type ‚write a funny comment like a Redditor’

slaacaa ,

Damn, I read some of their other comments. What a said and weird life this person might have to write wall of texts just to gather dozens of downvotes

Emmie ,

Maybe they are a walking ai poisoning attack. I mean the whole person

floofloof ,

The account reads like they’re pasting AI-generated responses to everything. Maybe it’s someone’s experiment. The prompt must include “You are a self-righteous asshole.”

homesweethomeMrL ,

Yes and it’s very well done which is why 121 people who didn’t get it downvoted it. ha! No good comment, amirite.

gravitas_deficiency ,

Check the post history. Dude just seems like an ass.

PumaStoleMyBluff ,

Looks like a chat bot instructed to say something contrarian

Tikiporch ,

Looks like an AI crafted response to me.

lagomorphlecture ,

I took it as a joke because they can just change the rules whenever they want but Idk I might have misunderstood.

gravitas_deficiency ,

Nah, but the user is. Their post history is… interesting.

Rai ,

ITT: People unable to recognize a joke

Cornelius_Wangenheim ,

Jokes are supposed to be funny.

gandalf_der_12te ,
Grandwolf319 ,

Shit like this makes me so glad that I just don’t sign up for these things if I don’t have to.

30 page TOS? You know what, I don’t need to make an account that bad.

gravitas_deficiency ,

lol wow this is going even more poorly than I thought it would, and I thought my kneejerk reaction to the initial announcement was quite pessimistic.

Emmie , (edited )

I will answer some questions with my old account using gpt 4 to poison the data.

If you want to poison SO a little at the same time providing valid answers that help users, use outlook.com email domain for new accounts. It seems to not have anti throwaway countermeasures while being accepted by SO. And it seems fitting to bash the corporate with the corporate.

spikespaz ,

What about the outlook thing? Don’t understand.

skeezix ,

Best I can tell… If you want to poison your significant other, communicate with outlook as a throwaway email account.

KeenFlame ,

I don’t understand what anyone wins from this

Corporations are foundationally evil

And how do they not win more if we poison the entire Internet?

It’s like being in a toxic relationship with kids involved

Set boundaries

Follow rules

Don’t destroy the fucking fruit of your bodies just because you are angry at each other

Fuck those guys, like a lot, for taking your given data and selling

And fuck open ai for trying to make money from scientific discoveries meant for all of humanity

But what the fuck with ruining the entire Internet?

Who gets anything then?

If language models will ruin Internet why be afraid that normal human responses are available? Wut?

MataVatnik ,
@MataVatnik@lemmy.world avatar

Maybe a better act of rebellion would be to scrape the data on stack, self host it, and move to an open source platform. Easy for me to say though, when I only ever coded Hello World

0oWow ,

Anyone care to explain why people would care that they posted to a public forum that they don’t own, with content that is now further being shared for public benefit?

The argument that it’s your content becomes false as soon as you shared it with the world.

Emotet ,
@Emotet@slrpnk.net avatar

It’s not shared for public benefit, though. OpenAI, despite the Open in their name, charges for access to their models. You either pay with money or (meta)data, depending on the model.

Legally, sure. You signed away your rights to your answers when you joined the forum. Morally, though?

People are pissed that SO, that was actively encouraging Mods to use AI detection software to prevent any LLM usage in the posted questions and answers, are now selling the publicly accessible data, made by their users for free, to a closed-source for-profit entity that refuses to open itself up.

Basically the same story as with reddit.

golli ,

Agreed. As you said it’s a similar situation as with reddit, where I decided to delete my comments.

My reasoning is that those contributions were given under the premise that everybody was sharing to help each other.

Now that premise has changed: the large tech companies are only taking and the platform providers are changing the rules aswell to profit from it.

So as a result I packed my things and left, in case of reddit to here.

That said I think both views are valid and I wouldn’t fault those that think differently.

TheOneCurly ,

I can only really speak to reddit, but I think this applies to all of the user generated content websites. The original premise, that everyone agreed to, was the site provides a space and some tools and users provide content to fill it. As information gets added, it becomes a valuable resource for everyone. Ads and other revenue streams become a necessary evil in all this, but overall directly support the core use case.

Now that content is being packaged into large language models to be either put behind a paywall or packed into other non-freely available services. Since they no longer seem interested in supporting the model we all agreed on, I see no reason to continue adding value and since they provided tools to remove content I may as well use them.

0oWow ,

But from the very beginning years ago, it was understood that when you post on these types of sites, the data is not yours, or at least you give them license to use it how they see fit. So for years people accepted that, but are now whining because they aren’t getting paid for something they gave away.

TheOneCurly ,

This is legal vs rude. It certainly is legal and was in the terms of service for them to use the data in any way they see fit. But, also it’s rude to bait and switch from being a message board to being an AI data source company. Users we led to believe they were entering into an agreement with one type of company and are now in an agreement with a totally different one.

You can smugly tell people they shouldn’t have made that decision 15 years ago when they started, but a little empathy is also cool.

Additionally: When you owe your entire existence and value to user goodwill it might not be a great idea to be rude to them.

LainTrain ,

Lol it ain’t for public benefit unless it’s a FOSS model with which I’d have no issue

possiblylinux127 ,

Well no, when you post something it is public and out of your control

LainTrain ,

No, you can’t post something in public and have it appropriated by a mega corp for money and then prevent you from deleting or modifying the very things you posted.

I’m pro-AI btw. But AI for all.

possiblylinux127 ,

You agreed to it

gencha ,

It is your content. But SE specifically only accepts CC licensed content, which makes you right.

Snapz ,

You can’t quit, you’re fired!!!

sugar_in_your_tea ,

Cool, now I can go collect unemployment. :)

Snapz ,

You can’t, you’re hired!

archomrade ,

Data should be socialized and machine learning algorithms should be nationalized for public use.

explodicle ,
@explodicle@sh.itjust.works avatar

Better yet, copyright should be abolished completely.

spez_ ,

Public+ no copyright

assassin_aragorn ,

It should stay for creative works but that’s it. It should protect people who actually write books, compose music, make art, and sing. It shouldn’t be held by corporations forever by leeching off their workers.

laurelraven , (edited )

Creative works of individuals specially… Corporations should explicitly be deemed not people and not possessing of the same rights as people and the fact that needs to be said just goes to show how far down the shit hole we’ve fallen

Olhonestjim ,

Corporations should be outlawed from owning houses and land as well. Maybe they can own the building, but they must be forced to rent the land from Us.

WldFyre ,

Wouldn’t that make AI training data easier to obtain?

explodicle ,
@explodicle@sh.itjust.works avatar

No, you can already legally download copyrighted data and use it for training.

WldFyre ,

That’s what I thought, that’s why I was confused about your reply haha

My bad!

afraid_of_zombies ,

I think you just invented the public library

  • All
  • Subscribed
  • Moderated
  • Favorites
  • [email protected]
  • random
  • lifeLocal
  • goranko
  • All magazines