Stack Overflow mods go on strike

salarua ( @salarua@sopuli.xyz ) · 11 months ago

Stack Overflow mods go on strike

Hyperz ( @Hyperz@beehaw.org ) · 11 months ago

It seems to me like StackOverflow is really shooting themselves in the foot by allowing AI generated answers. Even if we assume that all AI generated answers are “correct”, doesn’t that completely destroy the purpose of the site? Like, if I were seeking an answer to some Python-related problem, why wouldn’t I go straight to ChatGPT or similar language models instead then? That way I also don’t have to deal with some of the other issues that plague StackOverflow such as “this question is a duplicate of <insert unrelated question> - closed!”.

OrangeSlice ( @14specks@lemmy.ml ) · 11 months ago

I think what sites have been running into is that it’s difficult to tell what is and is not AI-generated, so enforcement of a ban is difficult. Some would say that it’s better to have an AI-generated response out there in the open, which can then be verified and prioritized appropriately from user feedback. If there’s a human generated response that’s higher.quality, then that should win anyway, right? (Idk tbh)

Hyperz ( @Hyperz@beehaw.org ) · 11 months ago

Yeah that’s a good point. I have no idea how you’d go about solving that problem. Right now you can still sort of tell sometimes when something was AI generated. But if we extrapolate the past few years of advances in LLMs, say, 10 years into the future… There will be no telling what’s AI and what’s not. Where does that leave sites like StackOverflow, or indeed many other types of sites?

This then also makes me wonder how these models are going to be trained in the future. What happens when for example half of the training data is the output from previous models? How do you possibly steer/align future models and prevent compounding errors and bias? Strange times ahead.

OrangeSlice ( @14specks@lemmy.ml ) · 11 months ago

This then also makes me wonder how these models are going to be trained in the future. What happens when for example half of the training data is the output from previous models? How do you possibly steer/align future models and prevent compounding errors and bias? Strange times ahead.

Between this and the “deep fake” tech I’m kinda hoping for a light Butlerian jihad that gets everyone to log tf off and exist in the real world, but that’s kind of a hot take

Hyperz ( @Hyperz@beehaw.org ) · 11 months ago

But then they’d have to break up with their AI girlfriends/boyfriends 🤔.

spoiler

I wish I was joking.

cavemeat ( @cavemeat@beehaw.org ) · 11 months ago

My guess is the internet is gonna go through trial by fire regarding ai—some stuff is gonna be obscenely incorrect, or difficult to detect before it all straightens out.

salarua ( @salarua@sopuli.xyz ) · 11 months ago

there are some pretty good AI-generated text detectors out there like GPTZero. i wouldn’t be surprised if mods used that to screen comments

OrangeSlice ( @14specks@lemmy.ml ) · 11 months ago

My understanding was that they’re very unreliable in their current state, but I’m definitely not up to speed.

Pigeon ( @Lowbird@beehaw.org ) · 11 months ago

I’ve been seeing so many stories about student work getting falsely flagged as AI generated. It really feels bad to be accused of that, I think. So I can see why it would be better to avoid trying to determine one way or the other if something is AI generated, for now.

All that matters for a question answer is whether it’s right, partly right, completely dead wrong, and so on, right? And that can still be judged regardless of whether it’s AI.

AI absolutely shouldn’t be outright invited either, though.

OrangeSlice ( @14specks@lemmy.ml ) · 11 months ago

Yeah that’s kinda where I’m at

Stack Overflow mods go on strike

Stack Overflow mods go on strike

Dear Stack Overflow, Inc.