Dall-e 2: What is the AI image generator creating strange artworks out of nothing – and how do you use it?

Adam Smith

June 6, 2022 at 8:17 AM·5 min read

(Image by Alan Warburton / © BBC / Better Images of AI / Nature / CC-BY 4.0)

An artificially intelligent algorithm that can create images based on what is described to it has recently risen in popularity.

Dall-e 2 – named after the 2008 Pixar film WALL-E and the surrealist painter Salvador Dalí – was created by OpenAI, the billion-dollar artificial intelligence lab.

The team spent two years developing the technology, which is based on the same ‘neural network’ mathematics as smart assistants.

By gathering data on thousands of photos the algorithm can ‘learn’ what an object is supposed to look like. Give it millions of images, and Dall-e can start putting them together in any amalgamation that can be thought of.

The original Dall-e was launched in January last year, but this new version can edit objects – removing part of an image or replacing it with another element – while considering features such as shadows. While the first version of this technology only rendered images in a cartoon-like art style, Dall-e 2 can produce images in a variety of styles with higher-quality and more complex backgrounds.

How can I try it?

There is currently a waitlist for Dall-e 2 on Open AI’s website, but smaller tools such as Dall-e Mini are currently available for general users to play with.

Other companies, such as Google, are developing similar tools – such as Imagen, which it brought out late last month.

Introducing Imagen, a new text-to-image synthesis model that can generate high-fidelity, photorealistic images from a deep level of language understanding. Learn more and and check out some examples of #imagen at https://t.co/RhD6siY6BY pic.twitter.com/C8javVu3iW
— Google AI (@GoogleAI) May 24, 2022

Another popular application is Wombo’s Dream app, which generates pictures of whatever users describe in different art styles, although this does not use the specific Dall-e 2 algorithm.

How does Dall-e work?

When someone inputs an image for Dall-E to generate, it notes a series of key features that might be present. This could include, Alex Nichol, one of the researchers behind the system, explained to the New York Times last month, the edge of a trumpet or the curve at the top of a teddy bear’s ear.

A second neural network – the diffusion model – then creates the image and generates the pixels needed to replicate the image, and with Dall-e 2, in a higher resolution than we’ve seen before.

Client: Can you turn this elephant around?
Me: …？？？？？？
DALL·E: #dalle #dalle2 #openai pic.twitter.com/2EYml97dV6
— Simon Lee (@simonxxoo) June 1, 2022

Does it have any limits?

Building Dall-E 2 was more difficult than standard language algorithms. Dalle-2 is built on a computer vision system called CLIP, which is more complex than the word-matching system used by GPT-3 - the AI tool which, in its former version, had been infamously deemed “too dangerous to release” because of its ability to create text that is seemingly indistinguishable from those written by humans.

Word-matching, however, did not capture the qualities that humans think are most vital and limited how realistic the images could be. While CLIP looked at images and summarized their contents, the tool used here begins at the description and works towards the image, which says OpenAI research scientist Prafulla Dhariwal told The Verge is like starting with a “bag of dots” and then filling in a pattern with greater and greater detail.

There are also some built-in safeguards for the kind of images, which are watermarked, that can be generated. The model was trained on data that had ”objectionable content” filtered out, and will not generate recognisable faces based on someone’s name.

People testing Dall-e 2 are also banned from uploading or generating images that are not suitable for general audiences or could cause harm, which includes hate symbols, nudity, obscenity, and conspiracy theories.

First play with #dalle2 - "an ancient roman laptop" pic.twitter.com/9PxoBJlILZ
— Infinite Vibes (@Infinite__Vibes) May 31, 2022

What are the concerns?

The concerns around Dall-e 2, and its subsequent iterations, are the same that have been spoken about with other technology like deepfakes and artificially-intelligent voice creation: it could help spread disinformation across the internet.

“You could use it for good things, but certainly you could use it for all sorts of other crazy, worrying applications, and that includes deep fakes,” like misleading photos and videos, said Subbarao Kambhampati, a professor of computer science at Arizona State University, told the New York Times.

Dalle-2 has a summary of the risks and limitations of the technology on the Github site hosting its preview. “Without sufficient guardrails, models like DALL·E 2 could be used to generate a wide range of deceptive and otherwise harmful content, and could affect how people perceive the authenticity of content more generally. DALL·E 2 additionally inherits various biases from its training data, and its outputs sometimes reinforce societal stereotypes”, the developers write.

Similarly, Google has put out a lengthy statement describing the ethical challenges in text-to image research, referencing the same ethnic biases that artificial intelligence has exhibited – based on racist data sets – in other areas such as police work.

“While a subset of our training data was filtered to removed noise and undesirable content, such as pornographic imagery and toxic language, we also utilized LAION-400M dataset which is known to contain a wide range of inappropriate content including pornographic imagery, racist slurs, and harmful social stereotypes”, Google wrote.

“As such, there is a risk that Imagen has encoded harmful stereotypes and representations, which guides our decision to not release Imagen for public use without further safeguards in place”.

Yahoo Finance
Jamie Dimon is worried the US economy is headed back to the 1970s
JPMorgan's CEO is concerned the US economy could be in for a repeat of the stagflation that hampered the country during the 1970s.
2h ago
Yahoo Sports
Based on the odds, here's what the top 10 picks of the NFL Draft will be
What would a mock draft look like using just betting odds?
1d ago
Yahoo Sports
WNBA Draft winners and losers: As you may have guessed, the Fever did pretty well. The Liberty? Perhaps not
Here are five franchises who stood out, for better or for worse.
8d ago
Yahoo TV
Everyone's still talking about the 'SNL' Beavis and Butt-Head sketch. Cast members and experts explain why it's an instant classic.
Ryan Gosling, who starred in the skit, couldn't keep a straight face — and neither could some of the "Saturday Night Live" cast.
4h ago
Yahoo Sports
Report: Jets trading QB Zach Wilson to Broncos
Wilson's starting over in Denver.
1d ago
Yahoo Sports
NFL mock draft 2024: With one major trade-up, it's a QB party in the top 5
Our final 2024 mock draft projects four quarterbacks in the first five picks, but the Cardinals at No. 4 might represent the key pivot point of the entire board.
2d ago
Yahoo Sports
Dave McCarty, player on 2004 Red Sox championship team, dies 1 week after team's reunion
The Red Sox were already mourning the loss of Tim Wakefield from that 2004 team.
3d ago
Yahoo Sports
Chiefs make Andy Reid NFL's highest-paid coach, sign president Mark Donovan, GM Brett Veach to extensions
Reid's deal reportedly runs through 2029 and makes him the highest-paid coach in the NFL.
19h ago
Yahoo Sports
Yankees' Nestor Cortés told by MLB his pump-fake pitch is illegal
Cortés' attempt didn't fool Andrés Giménez, who fouled off the pitch.
3d ago
Yahoo Life
Here’s when people think old age begins — and why experts think it’s starting later
People's definition of "old age" is older than it used to be, new research suggests.
1d ago
Yahoo Sports
Ryan Garcia drops Devin Haney 3 times en route to stunning upset
The 25-year-old labeled "mentally fragile" by many delivered the upset for the ages.
3d ago
Yahoo Sports
Arch Manning dominates in the Texas spring game, and Jaden Rashada enters the transfer portal
Dan Wetzel, Ross Dellenger & SI’s Pat Forde react to the huge performance this weekend by Texas QB Arch Manning, Michigan and Notre Dame's spring games, Jaden Rashada entering the transfer portal, and more
1d ago
Autoblog
The new Ford Mustang's V8 is available as a crate engine
Ford offers the new Mustang's updated 5.0-liter Coyote V8 as a crate engine, and it also sells a supercharger kit that unlocks a total of 810 horsepower.
1d ago
Yahoo Sports
Lions' new uniforms get leaked early, and they find some humor in it
The Lions' new uniforms got released prematurely.
5d ago
Yahoo Sports
Yankees manager Aaron Boone ejected after fan mouths off to home plate umpire
You don't see an ejection like this every day.
1d ago
Yahoo Sports
Robert Kraft reportedly warned Falcons owner Arthur Blank not to trust Bill Belichick during head coach interviews
Bill Belichick's former boss Robert Kraft reportedly tanked his chances of getting hired as the Falcons head coach.
6d ago
Yahoo Sports
Pass or Fail: Broncos release 'Mile High Collection,' first new uniforms in over 25 years
The Broncos may have committed the greatest fashion faux pas there is: being boring.
1d ago
Yahoo Sports
Arch Manning puts on a show in Texas' spring game, throwing for 3 touchdowns
Arch Manning gave Texas football fans an enticing look at the future, throwing for 355 yards and three touchdowns in the Longhorns' Orange-White spring game.
3d ago
Yahoo Sports
NBA Playoffs: Lillard sinks the Pacers, Celtics-Heat controversy, plus injury concerns for Kawhi & Embiid
Vincent Goodwill and Amin Elhassan react to (just about) every Round 1 game of the NBA Playoffs after the first games have been played over the weekend.
22h ago
Yahoo Sports
Tom Brady will be mercilessly mocked in Netflix's 'Greatest Roast of All Time' comedy special
Get ready for a lot of jokes about Tom Brady never eating strawberries and divorcing one of the most successful supermodels in history.
1d ago

News

Life

Entertainment

Finance

Sports

New on Yahoo

Dall-e 2: What is the AI image generator creating strange artworks out of nothing – and how do you use it?

How can I try it?

How does Dall-e work?

Does it have any limits?

What are the concerns?

Recommended Stories

Jamie Dimon is worried the US economy is headed back to the 1970s

Based on the odds, here's what the top 10 picks of the NFL Draft will be

WNBA Draft winners and losers: As you may have guessed, the Fever did pretty well. The Liberty? Perhaps not

Everyone's still talking about the 'SNL' Beavis and Butt-Head sketch. Cast members and experts explain why it's an instant classic.

Report: Jets trading QB Zach Wilson to Broncos

NFL mock draft 2024: With one major trade-up, it's a QB party in the top 5

Dave McCarty, player on 2004 Red Sox championship team, dies 1 week after team's reunion

Chiefs make Andy Reid NFL's highest-paid coach, sign president Mark Donovan, GM Brett Veach to extensions

Yankees' Nestor Cortés told by MLB his pump-fake pitch is illegal

Here’s when people think old age begins — and why experts think it’s starting later

Ryan Garcia drops Devin Haney 3 times en route to stunning upset

Arch Manning dominates in the Texas spring game, and Jaden Rashada enters the transfer portal

The new Ford Mustang's V8 is available as a crate engine

Lions' new uniforms get leaked early, and they find some humor in it

Yankees manager Aaron Boone ejected after fan mouths off to home plate umpire

Robert Kraft reportedly warned Falcons owner Arthur Blank not to trust Bill Belichick during head coach interviews

Pass or Fail: Broncos release 'Mile High Collection,' first new uniforms in over 25 years

Arch Manning puts on a show in Texas' spring game, throwing for 3 touchdowns

NBA Playoffs: Lillard sinks the Pacers, Celtics-Heat controversy, plus injury concerns for Kawhi & Embiid

Tom Brady will be mercilessly mocked in Netflix's 'Greatest Roast of All Time' comedy special