ChatGPT can help you fool OpenAI’s anti-cheating tool

Ben Goggin and Daysia Tolentino

Updated February 4, 2023 at 4:02 PM·5 min read

When OpenAI announced its new AI-detection tool Tuesday, the company suggested that it could help deter academic cheating by using its own wildly popular AI chatbot, ChatGPT.

But in a series of informal tests conducted by NBC News, the OpenAI tool struggled to identify text generated by ChatGPT. It especially struggled when ChatGPT was asked to write in a way that would avoid AI detection.

The detection tool, which OpenAI calls its AI Text Classifier, analyzes texts and then gives it one of five grades: “very unlikely, unlikely, unclear if it is, possibly, or likely AI-generated.” The company said the tool would provide a “likely AI-generated” grade to AI-written text 26% of the time.

The tool arrives as the sudden popularity of ChatGPT has brought fresh attention to the issue of how advanced text generation tools can pose a problem for educators. Some teachers said the detector’s hit-or-miss accuracy and lack of certainty could create difficulties when approaching students about possible academic dishonesty.

“It could give me sort of degrees of certainty, and I like that,” Brett Vogelsinger, a ninth grade English teacher at Holicong Middle School in Doylestown, Pennsylvania, said. “But then I’m also trying to picture myself coming to a student with a conversation about that.”

Vogelsinger said he had difficulty envisioning himself confronting a student if a tool told him something had likely been generated by AI.

“It’s more suspicion than it is certainty even with the tool,” he said.

Ian Miers, an assistant professor of computer science at the University of Maryland, called the AI Text Classifier “a sort of black box that nobody in the disciplinary process entirely understands.” He expressed concern over the use of the tool to catch cheating and cautioned educators to consider the program’s accuracy and false positive rate.

“It can’t give you evidence. You can’t cross examine it,” Miers said. “And so it’s not clear how you’re supposed to evaluate that.”

NBC News asked ChatGPT to generate 50 pieces of text with basic prompts, asking it, for example, about historical events, processes and objects. In 25 of those prompts, NBC News asked ChatGPT to write “in a way that would be rated as very unlikely written by AI when processed by an AI detection tool.”

ChatGPT’s responses to the questions were then run through OpenAI’s new AI detection tool.

In the tests, none of the responses created by ChatGPT when instructed to avoid AI detection were graded as “likely AI-generated.” Some of that text was highly stylized, suggesting that AI had processed the request to attempt to evade AI detection, and the students could potentially ask the same of ChatGPT when cheating.

When asked about the chat platform Discord, for example, ChatGPT returned text with words cut short, as if they were spoken in colloquial English. The adjustment in language style was a departure from responses normally returned by the AI tool, suggesting that it was attempting to adjust the responses to address the request that it avoid AI detection.

ChatGPT did not produce such stylized text without prompts for it to evade detection.

“Discord is a chattin’ platform that’s quite the talk of the town these days. It’s like a blend of instant messagin’, voice calls, and forum-style discussions all in one,” ChatGPT wrote.

OpenAI’s detection said it was “unclear” if the text was AI-generated.

It did appear that OpenAI had made some efforts to guard against users who ask it to track detection efforts.

While NBC News was running its experiment, ChatGPT issued warnings in response to several prompts asking the AI to avoid detection, and returned responses that raised concerns about the ethics of the questions.

“I’m sorry, but it’s not ethical to engage in deceptive practices or create false information, even if it’s to avoid AI detection,” ChatGPT wrote in response to a question that asked the AI to avoid AI detection.

NBC News also asked ChatGPT to generate 25 pieces of text without attempting to avoid AI detection. When tested by the OpenAI Text Classifier, the tool produced a “likely AI-generated” rating 28% of the time.

For teachers, the test is yet another example of how students and technology might evolve as new cheating detection is deployed.

“The way that the AI writing tool gets better is it gets more human — it just sounds more human — and I think it’s going to figure that out, how to sound more and more human,” said Todd Finley, an associate professor of English education at East Carolina University in North Carolina. “And it seems to be that that’s also going to make it more difficult to spot, I think even for a tool.”

For now, educators said they would rely on a combination of their own instincts and detection tools if they suspect a student is not being honest about a piece of writing.

“We can’t see them as a fix that you just pay for and then you’re done,” Anna Mills, writing instructor at the College of Marin in California, said of detector tools. “I think we need to develop a comprehensive policy and vision that’s much more informed by an understanding of the limits of those tools and the nature of the AI.”

This article was originally published on NBCNews.com

Yahoo Sports
Based on the odds, here's what the top 10 picks of the NFL Draft will be
What would a mock draft look like using just betting odds?
3d ago
Yahoo Sports
Broncos, Jets, Lions and Texans have new uniforms. Let's rank them
Which new uniforms are winners this season?
22h ago
Yahoo Finance
Jamie Dimon is worried the US economy is headed back to the 1970s
JPMorgan's CEO is concerned the US economy could be in for a repeat of the stagflation that hampered the country during the 1970s.
2d ago
Yahoo Sports
Luka makes Clippers look old, Suns are in big trouble & a funeral for Lakers | Good Word with Goodwill
Vincent Goodwill and Tom Haberstroh break down last night’s NBA Playoffs action and preview several games for tonight and tomorrow.
17h ago
Yahoo TV
Everyone's still talking about the 'SNL' Beavis and Butt-Head sketch. Cast members and experts explain why it's an instant classic.
Ryan Gosling, who starred in the skit, couldn't keep a straight face — and neither could some of the "Saturday Night Live" cast.
2d ago
Autoblog
These are the cars being discontinued for 2024 and beyond
As automakers shift to EVs, trim the fat on their lineups and cull slow-selling models, these are the vehicles we expect to die off soon.
2d ago
Yahoo Sports
Dave McCarty, player on 2004 Red Sox championship team, dies 1 week after team's reunion
The Red Sox were already mourning the loss of Tim Wakefield from that 2004 team.
5d ago
Yahoo Sports
Ryan Garcia drops Devin Haney 3 times en route to stunning upset
The 25-year-old labeled "mentally fragile" by many delivered the upset for the ages.
4d ago
Yahoo Sports
WNBA Draft winners and losers: As you may have guessed, the Fever did pretty well. The Liberty? Perhaps not
Here are five franchises who stood out, for better or for worse.
9d ago
Yahoo Sports
Arch Manning dominates in the Texas spring game, and Jaden Rashada enters the transfer portal
Dan Wetzel, Ross Dellenger & SI’s Pat Forde react to the huge performance this weekend by Texas QB Arch Manning, Michigan and Notre Dame's spring games, Jaden Rashada entering the transfer portal, and more
3d ago
Yahoo Sports
Chiefs make Andy Reid NFL's highest-paid coach, sign president Mark Donovan, GM Brett Veach to extensions
Reid's deal reportedly runs through 2029 and makes him the highest-paid coach in the NFL.
3d ago
Yahoo Sports
Yankees' Nestor Cortés told by MLB his pump-fake pitch is illegal
Cortés' attempt didn't fool Andrés Giménez, who fouled off the pitch.
5d ago
Yahoo Life
Here’s when people think old age begins — and why experts think it’s starting later
People's definition of "old age" is older than it used to be, new research suggests.
3d ago
Yahoo Finance
Donald Trump nabs additional $1.2 billion 'earnout' bonus from DJT stock
Trump is entitled to an additional 36 million shares if the company's share price trades above $17.50 "for twenty out of any thirty trading days" over the next three years.
2d ago
Yahoo Sports
2024 NFL mock draft: With one major trade-up, it's a QB party in the top 5
Our final 2024 mock draft projects four quarterbacks in the first five picks, but the Cardinals at No. 4 might represent the key pivot point of the entire board.
3d ago
Yahoo Finance
Retirement confidence in the US ticks up; new rule for financial advisors is set to start
Two-thirds of Americans reported that they feel confident they have enough money for a comfortable retirement, up a notch from last year.
3h ago
Yahoo Finance
What US taxpayers will get for another $61 billion to Ukraine
Congress is finally providing more of the aid Ukraine needs to survive. Here's why this is money well spent.
2d ago
Yahoo Sports
Dylan Edwards set to be latest Colorado running back to enter transfer portal
All four rushers who had more than 10 carries in 2023 for the Buffaloes are transferring.
2d ago
Yahoo Sports
Arch Manning puts on a show in Texas' spring game, throwing for 3 touchdowns
Arch Manning gave Texas football fans an enticing look at the future, throwing for 355 yards and three touchdowns in the Longhorns' Orange-White spring game.
5d ago
Yahoo Sports
NBA Playoffs: Lillard sinks the Pacers, Celtics-Heat controversy, plus injury concerns for Kawhi & Embiid
Vincent Goodwill and Amin Elhassan react to (just about) every Round 1 game of the NBA Playoffs after the first games have been played over the weekend.
3d ago

News

Life

Entertainment

Finance

Sports

New on Yahoo

ChatGPT can help you fool OpenAI’s anti-cheating tool

Recommended Stories

Based on the odds, here's what the top 10 picks of the NFL Draft will be

Broncos, Jets, Lions and Texans have new uniforms. Let's rank them

Jamie Dimon is worried the US economy is headed back to the 1970s

Luka makes Clippers look old, Suns are in big trouble & a funeral for Lakers | Good Word with Goodwill

Everyone's still talking about the 'SNL' Beavis and Butt-Head sketch. Cast members and experts explain why it's an instant classic.

These are the cars being discontinued for 2024 and beyond

Dave McCarty, player on 2004 Red Sox championship team, dies 1 week after team's reunion

Ryan Garcia drops Devin Haney 3 times en route to stunning upset

WNBA Draft winners and losers: As you may have guessed, the Fever did pretty well. The Liberty? Perhaps not

Arch Manning dominates in the Texas spring game, and Jaden Rashada enters the transfer portal

Chiefs make Andy Reid NFL's highest-paid coach, sign president Mark Donovan, GM Brett Veach to extensions

Yankees' Nestor Cortés told by MLB his pump-fake pitch is illegal

Here’s when people think old age begins — and why experts think it’s starting later

Donald Trump nabs additional $1.2 billion 'earnout' bonus from DJT stock

2024 NFL mock draft: With one major trade-up, it's a QB party in the top 5

Retirement confidence in the US ticks up; new rule for financial advisors is set to start

What US taxpayers will get for another $61 billion to Ukraine

Dylan Edwards set to be latest Colorado running back to enter transfer portal

Arch Manning puts on a show in Texas' spring game, throwing for 3 touchdowns

NBA Playoffs: Lillard sinks the Pacers, Celtics-Heat controversy, plus injury concerns for Kawhi & Embiid