NIST releases a tool for testing AI model risk

Kyle Wiggers

July 27, 2024 at 11:25 AM·3 min read

The National Institute of Standards and Technology (NIST), the U.S. Commerce Department agency that develops and tests tech for the U.S. government, companies and the broader public, has re-released a testbed designed to measure how malicious attacks -- particularly attacks that "poison" AI model training data -- might degrade the performance of an AI system.

Called Dioptra (after the classical astronomical and surveying instrument), the modular, open source web-based tool, first released in 2022, seeks to help companies training AI models -- and the people using these models -- assess, analyze and track AI risks. Dioptra can be used to benchmark and research models, NIST says, as well as to provide a common platform for exposing models to simulated threats in a "red-teaming" environment.

"Testing the effects of adversarial attacks on machine learning models is one of the goals of Dioptra," NIST wrote in a press release. "The open source software, like generating child available for free download, could help the community, including government agencies and small to medium-sized businesses, conduct evaluations to assess AI developers’ claims about their systems’ performance."

NIST Dioptra — A screenshot of Diatropa's interface.

Dioptra debuted alongside documents from NIST and NIST's recently created AI Safety Institute that lay out ways to mitigate some of the dangers of AI, like how it can be abused to generate nonconsensual pornography. It follows the launch of the U.K. AI Safety Institute's Inspect, a toolset similarly aimed at assessing the capabilities of models and overall model safety. The U.S. and U.K. have an ongoing partnership to jointly develop advanced AI model testing, announced at the U.K.’s AI Safety Summit in Bletchley Park in November of last year.

Dioptra is also the product of President Joe Biden’s executive order (EO) on AI, which mandates (among other things) that NIST help with AI system testing. The EO, relatedly, also establishes standards for AI safety and security, including requirements for companies developing models (e.g. Apple) to notify the federal government and share results of all safety tests before they’re deployed to the public.

As we’ve written about before, AI benchmarks are hard -- not least of which because the most sophisticated AI models today are black boxes whose infrastructure, training data and other key details are kept under wraps by the companies creating them. A report out this month from the Ada Lovelace Institute, a U.K.-based nonprofit research institute that studies AI, found that evaluations alone aren't sufficient to determine the real-world safety of an AI model in part because current policies allow AI vendors to selectively choose which evaluations to conduct.

NIST doesn't assert that Dioptra can completely de-risk models. But the agency does propose that Dioptra can shed light on which sorts of attacks might make an AI system perform less effectively and quantify this impact to performance.

In a major limitation, however, Dioptra only works out-of-the-box on models that can be downloaded and used locally, like Meta's expanding Llama family. Models gated behind an API, such as OpenAI's GPT-4o, are a no-go -- at least for the time being.

TechCrunch
OpenAI comes for Google with SearchGPT
OpenAI is testing SearchGPT, a new AI search experience to compete directly with Google. The feature aims to elevate search queries with “timely answers” from across the internet and allows the user to ask follow-up questions. The temporary prototype is currently only available to a small group of users and its publisher partners for testing and feedback, but curious minds can join the waitlist.
Engadget
Websites accuse AI startup Anthropic of bypassing their anti-scraping rules and protocol
iFixit and Freelancer have accused Anthropic, the AI startup behind the Claude large language models, of ignoring their "do not crawl" robots.txt protocol and policy to scrape their websites' data.
Yahoo Life Shopping
DeWalt's top-selling driver is $99 (40% off), plus more brand deals
Grab a powerful LED work light for $38 — that's 55% off — and a blower at a $77 discount.
Engadget
What to read this weekend: Keanu Reeves wrote a book with ‘weird fiction’ author China Miéville
New releases in fiction, nonfiction and comics that caught our attention.
Yahoo Finance
Why Wall Street is unfazed by Medicare drug pricing threat
Medicare drug pricing negotiations are done and Wall Street is getting a good signal from drug CEOs.
Yahoo Life Shopping
Is there a best sleep position? Experts weigh in.
If you wake up achey and uncomfortable, the culprit might be your sleep position. Learn the pros and cons of the most common options, whether you sleep on your back, side or stomach.
Yahoo Life
Journaling, juggling and meditating: How 3 Team USA athletes get in the zone for the 2024 Paris Olympics
Members of Team USA share their wellness routines for the Summer Games.
Yahoo Life Shopping
Eva Longoria, 49, loves this L'Oreal root spray — and it's down to $10 at Amazon
Shoppers call this No. 1 bestseller a 'godsend' that helps them go a few more weeks between pricey salon visits.
Yahoo Sports
2024 Paris Olympics Soccer: How to watch the USMNT vs. New Zealand today
Here's how to watch the next USMNT game at the Paris 2024 Olympics.
Yahoo Finance
Crypto libertarians and Silicon Valley billionaires: The mashup fueling new support for Trump
Donald Trump looks to solidify support from the crypto world with a speech before thousands of bitcoin enthusiasts in Nashville.
Engadget
Amazon drops the first teaser for its upcoming Yakuza adaptation
Amazon has released its first teaser video for Like A Dragon: Yakuza, its live action adaptation of SEGA's Yakuza games, at San Diego Comic-Con.
TechCrunch
Apple signs the White House's commitment to AI safety
Apple signed the White House's voluntary commitment to developing safe, secure and trustworthy AI, according to a press release on Friday. The company will soon launch its generative AI offering, Apple Intelligence, into its core products, putting generative AI in front of Apple's 2 billion users. Apple joins 15 other technology companies — including Amazon, Anthropic, Google, Inflection, Meta, Microsoft and OpenAI — that committed to the White House's ground rules for developing generative AI in July 2023.
Yahoo Finance
Here's what the CrowdStrike outage exposed about our connected world. It's not good.
The CrowdStrike breakdown shows that our modern connected world is incredibly fragile.
Yahoo Sports
How to watch Diving at the 2024 Paris Olympics: Full schedule, where to stream meets and more
The Olympic diving competition begins on July 27.
Yahoo Sports
New college sports roster limits revealed as House settlement expands scholarship numbers
More than 750 additional scholarships are coming to college sports.
Engadget
Here's how to stop Grok's AI models using your tweets for training
X automatically opted users into letting Grok's AI models train on their tweets and interactions with the chatbot. Here's how to opt out.
Engadget
ISPs are fighting to raise the price of low-income broadband
Internet service providers are objected to the lower rates they need to offer lower income customers if they want to obtain government funds from a new Internet access program.
Autoblog
Junkyard Gem: 1963 International C-1000 Pickup
A 1963 International Harvester C-Series pickup truck, found in a Colorado wrecking yard.
Yahoo Sports
How to watch Beach Volleyball at 2024 Paris Olympics: Full schedule, where to stream matches and more
This year's Men's and Women's Beach Volleyball tournament at the 2024 Paris Olympics takes place at Eiffel Tower Stadium, built specifically for this event in the shadow of the Eiffel Tower.
Yahoo Sports
2024 Paris Olympics: How to watch swimming, full events schedule and more
Here's how to watch Team USA make a splash at the Summer Olympics.

News

Life

Entertainment

Finance

Sports

New on Yahoo

NIST releases a tool for testing AI model risk

Recommended Stories

OpenAI comes for Google with SearchGPT

Websites accuse AI startup Anthropic of bypassing their anti-scraping rules and protocol

DeWalt's top-selling driver is $99 (40% off), plus more brand deals

What to read this weekend: Keanu Reeves wrote a book with ‘weird fiction’ author China Miéville

Why Wall Street is unfazed by Medicare drug pricing threat

Is there a best sleep position? Experts weigh in.

Journaling, juggling and meditating: How 3 Team USA athletes get in the zone for the 2024 Paris Olympics

Eva Longoria, 49, loves this L'Oreal root spray — and it's down to $10 at Amazon

2024 Paris Olympics Soccer: How to watch the USMNT vs. New Zealand today

Crypto libertarians and Silicon Valley billionaires: The mashup fueling new support for Trump

Amazon drops the first teaser for its upcoming Yakuza adaptation

Apple signs the White House's commitment to AI safety

Here's what the CrowdStrike outage exposed about our connected world. It's not good.

How to watch Diving at the 2024 Paris Olympics: Full schedule, where to stream meets and more

New college sports roster limits revealed as House settlement expands scholarship numbers

Here's how to stop Grok's AI models using your tweets for training

ISPs are fighting to raise the price of low-income broadband

Junkyard Gem: 1963 International C-1000 Pickup

How to watch Beach Volleyball at 2024 Paris Olympics: Full schedule, where to stream matches and more

2024 Paris Olympics: How to watch swimming, full events schedule and more