OpenAI has a 99.9% accurate ChatGPT AI text detector, but won't release it.

rozodru , 1 month ago

a search bar for your DB doesn’t count guys.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

vrighter , 1 month ago (edited 1 month ago)

it’s only 99.9% accurate because they haven’t released it. As soon as they do, it will quickly fall to 50% as usual. Because this type of thing is exactly what’s needed to develop tech to defeat itself.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

aodhsishaj , 1 month ago

What?

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

Nighed , 1 month ago

Once you have an AI detector, you can use it’s results to train your AI to pass the detector.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

drmoose , 1 month ago

Lots of misinformation in this thread. Yes they have it, it’s good but it’s probably nowhere close to 99.9% accuracy.

The primary way to detect AI is to inject a fingerprint into AI generation in the first place. This means only the model creators can do that. We don’t exactly know how the fingerprint works but it can be as simple as preferring 1 word synonym over the other. For example preferring word synonyms like “illustrate”, “peer” etc. quickly ads up to a statistical

These techniques pre-date chatgpt itself and do work! However there are a lot of caveats:

The fingerprint has to be trained for each model meaning each model version performs slightly differently and only owners know the fingerprint.

The fingerprint test can only work on longer bodies of text that are not modified further.

Extending model through more complex instructions (like character, tone) or RAG can significantly decrease the effectiveness.

The industry is understandably very secretive about it but your low effort chatgpt copy/paste can be detected by OpenAI and nobody else.

As for public release of the fingerprint: they can’t as it can be reverse engineered so it’s only valuable as an internal tool for now. Also if released it would serve no real purpose as detection can be easily defeated by remixing content to dilute the fingerprint.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

EnderMB , 1 month ago

Agreed. Frankly, if someone were to say “we can detect with 99% accuracy” I imagine that someone would say “well, clearly your measurements are wrong, find the issue and come back to us when it’s fixed”.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

conciselyverbose , 1 month ago

but your low effort chatgpt copy/paste can be detected by OpenAI and nobody else

Low effort copy pastes can absolutely be detected by people who aren’t openAI. The consistent “advanced” vocabulary and excessively formal grammar used correctly, but with clear and significant comprehension gaps are pretty damn consistent. You won’t get perfect reliability, but you’ll catch most of it and you won’t have a huge number of false positives.

Real people don’t sound like GPT.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

drmoose , 1 month ago

No that’s in no way reliable way of catching anyone and I hope people smarten up and avoid this snake oil entirely. I’m borderline jealous how these “ai catchers” are making so much money from straight up snake oil.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

conciselyverbose , 1 month ago (edited 1 month ago)

An algorithm can’t.

Plenty of humans absolutely can. LLM writing is genuinely fucking terrible. It has the slightly stilted over formality of most non-native speakers, without the intelligence being fluent in a second language implies.

Flawless grammar with a complete absence of any sign of intelligence is not something you get regularly from humans.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

drmoose , 1 month ago

The “can” is irrelevant here. Checking tool has to be reliable to be useful. What’s the use of having a checker that maybe detects something sometimes somewhat successfully?

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

conciselyverbose , 1 month ago

There’s a massive gap between “you can’t make a tool” and “you can’t identify it”.

The problem with a tool is the exact same as the issue with LLMs to begin with. It does not resemble intelligence or comprehension in any way and cannot use it as an indicator.

But the use of LLMs is absolutely identifiable to moderately intelligent humans, because LLM output has raw language skills wildly inconsistent with every other skill that is part of writing.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

drmoose , 1 month ago

What’s even point of your argument? That a detective can figure out who used AI? Yes detectives can figure out most stuff. This is completely irrelevant to the topic at hand my dude.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

conciselyverbose , 1 month ago

What are you talking about “detectives”?

You said “nobody can identify LLM use” when any moderately intelligent human can identify LLM output pretty easily. It explodes off the page.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

drmoose , 1 month ago

Whatever dude not playing these stupid games. You know exactly what I meant. Go away 👋

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

conciselyverbose , 1 month ago

It’s not a game.

Spreading the lie that LLMs are somehow indistinguishable from humans is incredibly harmful. It’s a big part of the reason the obscene waste of energy the entire “force chatbots into everything” space exists.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

echodot , 1 month ago

Probably because it doesn’t work. It’s not difficult for Open AI to see if any given conversation is one of their conversations. If I were them I would hash the results of each conversation and then store that hash in a database for quick searching.

That’s useless for actual AI detection

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

x00z , 1 month ago

ALL conversations are logged and can be used however they want.

I’m almost certain this “detector” is a simple lookup in their database.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

Etterra , 1 month ago

If they have one, and that’s IF, then of course they won’t release it. They’re still trying to find a use case for their stupid toy so that they can charge people for it. Releasing the counter agent would be completely contradictory to their business model. It’s like Umbrella Corp. but even dumber.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

Pogogunner , 1 month ago

If you believe this, I have a bridge in Brooklyn to sell you

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

stardreamer , 1 month ago

A routine that just returns “yes” will also detect all AI. It would just have an abnormally high false positive rate.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

BluesF , 1 month ago

My model has 100% recall and 50% precision, not bad eh?

But - that model would not have 99.9% accuracy.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

stardreamer , 1 month ago

Agreed. Personally I think this whole thing is bs.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

KeenFlame , 1 month ago

Ofc they just look in their database if this is something it has ever said and to who

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

rozodru , 1 month ago

would you have any ocean front property for sale in Kansas per chance?

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

Cyteseer , 1 month ago

If they aren’t willing to release it, then the situation is no different from them not having one at all. All these claims openai makes about having whatever system but hiding it, is just tobtry and increase hype to grab more investor money.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

Naich , 1 month ago

Total coincidence that this “news” appears about a day after several articles saying the AI bubble is starting to burst.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

Melvin_Ferd , 1 month ago

It is nut. Who is paying for all these articles and why are they hell bent on convincing everyone that AI is to the left like immigrants are to Republicans

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

UnderpantsWeevil , 1 month ago

Lots of money in the AI hype game, as tech stocks are massively inflated from just this year alone.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

doodledup , 1 month ago

Why does everything have to be about the USA these days? I’m tired of this joke of a wannabe democracy. Don’t want to hear it. Nobody cares. Just stop and leave it to yourself.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

Saledovil , 1 month ago

Language models in the end, are just statistics. And to make statistics more accurate, you need more data. Exponentially more data. At the same time, the marginal utility of precision decays exponentially. Exponentially increasing marginal costs are met with exponentially decaying marginal utility.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

chiisana , 1 month ago

They’re keeping everything anyway, so what’s preventing them from doing a DB look up to see if it (given a large enough passage of text) exist in their output history?

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

_edge , 1 month ago

I believe the actual detector is similar. They know what sentences are likely generated by chatgpt, since that’s literally in their model. They probably also have to some degree reverse engineered typical output from competing models.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

circuitfarmer , 1 month ago

Doubt

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

Evil_Shrubbery , 1 month ago

She goes to another school
(for intelligent ificial art)

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

AmbiguousProps , 1 month ago

There is no way it’s that accurate, which is why they don’t want to release it.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

RootBeerGuy , 1 month ago

“A 99.9% accurate ChatGPT AI text detector? At this time of year! At this time of day! In this part of the country! Localized entirely within your company?!?”

“Yes”

"May I see it?“

“No”

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

Loduz_247 , 1 month ago

This technology will not be published until the GPT-3 code is released.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...