Meta releases its biggest 'open' AI model yet

Kyle Wiggers

Updated July 23, 2024 at 12:07 PM·9 min read

Meta's latest open source AI model is its biggest yet.

Today, Meta said it is releasing Llama 3.1 405B, a model containing 405 billion parameters. Parameters roughly correspond to a model's problem-solving skills, and models with more parameters generally perform better than those with fewer parameters.

At 405 billion parameters, Llama 3.1 405B isn't the absolute largest open source model out there, but it's the biggest in recent years. Trained using 16,000 Nvidia H100 GPUs, it also benefits from newer training and development techniques that Meta claims makes it competitive with leading proprietary models like OpenAI's GPT-4o and Anthropic's Claude 3.5 Sonnet (with a few caveats).

As with Meta's previous models, Llama 3.1 405B is available to download or use on cloud platforms like AWS, Azure and Google Cloud. It's also being used on WhatsApp and Meta.ai, where it's powering a chatbot experience for U.S.-based users.

New and improved

Like other open and closed source generative AI models, Llama 3.1 405B can perform a range of different tasks, from coding and answering basic math questions to summarizing documents in eight languages (English, German, French, Italian, Portuguese, Hindi, Spanish and Thai). It's text-only, meaning that it can't, for example, answer questions about an image, but most text-based workloads — think analyzing files like PDFs and spreadsheets — are within its purview.

Meta wants to make it known that it is experimenting with multimodality. In a paper published today, researchers at the company write that they're actively developing Llama models that can recognize images and videos, and understand (and generate) speech. Still, these models aren't yet ready for public release.

To train Llama 3.1 405B, Meta used a dataset of 15 trillion tokens dating up to 2024 (tokens are parts of words that models can more easily internalize than whole words, and 15 trillion tokens translates to a mind-boggling 750 billion words). It's not a new training set per se, since Meta used the base set to train earlier Llama models, but the company claims it refined its curation pipelines for data and adopted "more rigorous" quality assurance and data filtering approaches in developing this model.

The company also used synthetic data (data generated by other AI models) to fine-tune Llama 3.1 405B. Most major AI vendors, including OpenAI and Anthropic, are exploring applications of synthetic data to scale up their AI training, but some experts believe that synthetic data should be a last resort due to its potential to exacerbate model bias.

For its part, Meta insists that it "carefully balance[d]" Llama 3.1 405B's training data, but declined to reveal exactly where the data came from (outside of webpages and public web files). Many generative AI vendors see training data as a competitive advantage and so keep it and any information pertaining to it close to the chest. But training data details are also a potential source of IP-related lawsuits, another disincentive for companies to reveal much.

In the aforementioned paper, Meta researchers wrote that compared to earlier Llama models, Llama 3.1 405B was trained on an increased mix of non-English data (to improve its performance on non-English languages), more "mathematical data" and code (to improve the model's mathematical reasoning skills), and recent web data (to bolster its knowledge of current events).

Recent reporting by Reuters revealed that Meta at one point used copyrighted e-books for AI training despite its own lawyers’ warnings. The company controversially trains its AI on Instagram and Facebook posts, photos and captions, and makes it difficult for users to opt out. What's more, Meta, along with OpenAI, is the subject of an ongoing lawsuit brought by authors, including comedian Sarah Silverman, over the companies’ alleged unauthorized use of copyrighted data for model training.

"The training data, in many ways, is sort of like the secret recipe and the sauce that goes into building these models," Ragavan Srinivasan, VP of AI program management at Meta, told TechCrunch in an interview. "And so from our perspective, we've invested a lot in this. And it is going to be one of these things where we will continue to refine it."

Bigger context and tools

Llama 3.1 405B has a larger context window than previous Llama models: 128,000 tokens, or roughly the length of a 50-page book. A model’s context, or context window, refers to the input data (e.g. text) that the model considers before generating output (e.g. additional text).

One of the advantages of models with larger contexts is that they can summarize longer text snippets and files. When powering chatbots, such models are also less likely to forget topics that were recently discussed.

Two other new, smaller models Meta unveiled today, Llama 3.1 8B and Llama 3.1 70B — updated versions of the company's Llama 3 8B and Llama 3 70B models released in April — also have 128,000-token context windows. The previous models' contexts topped out at 8,000 tokens, which makes this upgrade fairly substantial -- assuming the new Llama models can effectively reason across all that context.

All of the Llama 3.1 models can use third-party tools, apps and APIs to complete tasks, like rival models from Anthropic and OpenAI. Out of the box, they're trained to tap Brave Search to answer questions about recent events, the Wolfram Alpha API for math- and science-related queries, and a Python interpreter for validating code. In addition, Meta claims the Llama 3.1 models can use certain tools they haven't seen before — to an extent.

Building an ecosystem

If benchmarks are to be believed (not that benchmarks are the end-all be-all in generative AI), Llama 3.1 405B is a very capable model indeed. That'd be a good thing, considering some of the painfully obvious limitations of previous-generation Llama models.

Llama 3 405B performs on par with OpenAI's GPT-4, and achieves "mixed results" compared to GPT-4o and Claude 3.5 Sonnet, per human evaluators that Meta hired, the paper notes. While Llama 3 405B is better at executing code and generating plots than GPT-4o, its multilingual capabilities are overall weaker, and Llama 3 405B trails Claude 3.5 Sonnet in programming and general reasoning.

And because of its size, it needs beefy hardware to run. Meta recommends at least a server node.

That's perhaps why Meta's pushing its smaller new models, Llama 3.1 8B and Llama 3.1 70B, for general-purpose applications like powering chatbots and generating code. Llama 3.1 405B, the company says, is better reserved for model distillation — the process of transferring knowledge from a large model to a smaller, more efficient model — and generating synthetic data to train (or fine-tune) alternative models.

To encourage the synthetic data use case, Meta said it has updated Llama's license to let developers use outputs from the Llama 3.1 model family to develop third-party AI generative models (whether that's a wise idea is up for debate). Importantly, the license still constrains how developers can deploy Llama models: App developers with more than 700 million monthly users must request a special license from Meta that the company will grant on its discretion.

That change in licensing around outputs, which allays a major criticism of Meta's models within the AI community, is a part of the company's aggressive push for mindshare in generative AI.

Alongside the Llama 3.1 family, Meta is releasing what it's calling a "reference system" and new safety tools — several of these block prompts that might cause Llama models to behave in unpredictable or undesirable ways — to encourage developers to use Llama in more places. The company is also previewing and seeking comment on the Llama Stack, a forthcoming API for tools that can be used to fine-tune Llama models, generate synthetic data with Llama and build "agentic" applications — apps powered by Llama that can take action on a user's behalf.

"[What] We have heard repeatedly from developers is an interest in learning how to actually deploy [Llama models] in production," Srinivasan said. "So we're trying to start giving them a bunch of different tools and options."

Play for market share

In an open letter published this morning, Meta CEO Mark Zuckerberg lays out a vision for the future in which AI tools and models reach the hands of more developers around the world, ensuring people have access to the "benefits and opportunities" of AI.

It's couched very philanthropically, but implicit in the letter is Zuckerberg's desire that these tools and models be of Meta's making.

Meta's racing to catch up to companies like OpenAI and Anthropic, and it is employing a tried-and-true strategy: give tools away for free to foster an ecosystem and then slowly add products and services, some paid, on top. Spending billions of dollars on models that it can then commoditize also has the effect of driving down Meta competitors' prices and spreading the company's version of AI broadly. It also lets the company incorporate improvements from the open source community into its future models.

Llama certainly has developers' attention. Meta claims Llama models have been downloaded over 300 million times, and more than 20,000 Llama-derived models have been created so far.

Make no mistake, Meta's playing for keeps. It is spending millions on lobbying regulators to come around to its preferred flavor of "open" generative AI. None of the Llama 3.1 models solve the intractable problems with today's generative AI tech, like its tendency to make things up and regurgitate problematic training data. But they do advance one of Meta's key goals: becoming synonymous with generative AI.

There are costs to this. In the research paper, the co-authors — echoing Zuckerberg's recent comments — discuss energy-related reliability issues with training Meta's ever-growing generative AI models.

"During training, tens of thousands of GPUs may increase or decrease power consumption at the same time, for example, due to all GPUs waiting for checkpointing or collective communications to finish, or the startup or shutdown of the entire training job," they write. "When this happens, it can result in instant fluctuations of power consumption across the data center on the order of tens of megawatts, stretching the limits of the power grid. This is an ongoing challenge for us as we scale training for future, even larger Llama models."

One hopes that training those larger models won't force more utilities to keep old coal-burning power plants around.

Engadget
Llama 3.1 is Meta's latest salvo in the battle for AI dominance
Meta said that the new model, called Llama 3.1 405B, is the first openly available model that can compete available rivals in general knowledge, math skills and translating across multiple languages.
Yahoo Finance
Meta CEO Zuckerberg calls on industry to adopt open-source AI, debuts high-powered Llama AI model
Meta CEO Mark Zuckerberg is calling on the tech industry to adopt open-source AI technology.
Engadget
Meta AI is now available in Spanish, Portugese, French and more
Meta AI gets new tools and languages in latest update.
Engadget
Meta's AI assistant is coming to Quest headsets in the US and Canada
The experimental version of Meta AI will be available on Quest headsets starting next month.
TechCrunch
TTT models might be the next frontier in generative AI
After years of dominance by the form of AI known as the transformer, the hunt is on for new architectures. Transformers underpin OpenAI’s video-generating model Sora, and they're at the heart of text-generating models like Anthropic’s Claude, Google’s Gemini and GPT-4o. A promising architecture proposed this month is test-time training (TTT), which was developed over the course of a year and a half by researchers at Stanford, UC San Diego, UC Berkeley and Meta.
Engadget
The Morning After: Meta may hold back its next-gen AI models from the EU
The biggest news stories this morning: EA Sports FC 25 brings women’s soccer to the career modes for the first time, Fallout’s Emmy nominations show that successful gaming adaptations are no longer a fluke, Dyson ditches the air purifier in its new headphones.
TechCrunch
Anthropic releases Claude app for Android
Anthropic launched its Claude Android app on Tuesday to bring its AI chatbot to more users. This is Anthropic's latest effort to convince users to ditch ChatGPT by making Claude available in more places. The Claude Android app will work just like the iOS version released in May, including free access to Anthropic's best AI model, Claude 3.5 Sonnet, alongside upgraded plans through Anthropic's Pro and Team subscriptions.
TechCrunch
Meta AI gets new 'Imagine me' selfie feature
Meta AI, Meta's AI-powered assistant across Facebook, Instagram, Messenger and the web, can now speak in more languages and create stylized selfies. The early iterations of Meta AI struggled with facts, numbers and web search, often failing to complete basic tasks like looking up recipes and airfares. Meta claims that the new model is particularly adept at math and coding questions, making it well-suited for help with math homework problems, explaining scientific concepts, code debugging and so on.
Yahoo Sports
Cowboys All-Pro CeeDee Lamb to reportedly hold out of training camp after failing to reach terms on an extension
Lamb will accrue fines until and if the two sides reach a deal that compels him to report to camp.
Yahoo Finance
Meta's reality check: Inside the $45 billion cash burn at Reality Labs
Meta's Reality Labs division has lost nearly $50 billion in the last five years. Yahoo Finance spoke to several former employees about what's happening inside Mark Zuckerberg's virtual world.
Yahoo Sports
12 training camp questions we have at the QB + RB position | Yahoo Fantasy Forecast
Training camps are in full swing this week and Joe Burrow's hair and Patrick Mahomes highlights have already taken over social media. Scott Pianowski joins Matt Harmon to identify the 12 biggest fantasy questions we have at the QB and RB position this summer and help you cut through the noise to know exactly what you should be paying attention to in training camps.
Yahoo News
Officers left posts before gunman opened fire at Trump, Pennsylvania State Police commissioner testifies. Here’s what else we learned from Tuesday’s House hearing.
The House Homeland Security Committee held a hearing Tuesday in an effort to find answers around the events of the July 13 shooting at a campaign rally of former President Trump in Butler, Pa. Here's what we learned.
TechCrunch
Elon Musk sets new date for Tesla robotaxi reveal, calls everything beyond autonomy 'noise'
Elon Musk says he will show off Tesla's purpose-built "robotaxi" prototype during an event October 10, after scrapping a previous plan to reveal it August 8. Musk said Tesla will also show off "a couple of other things," but didn't explain what that meant. The comments, made on Tesla's second-quarter earnings call Tuesday, largely confirms Bloomberg's original report about the delay, including that it was driven by Musk's desire to redesign certain elements of the prototype.
Yahoo Finance
Chipotle expected to post big Q2 earnings as it remains competitive on value
Chipotle is expected to hold onto its momentum in Q2, even as consumers criticized the brand over portions sizes.
TechCrunch
Adobe releases new Firefly AI tools for Illustrator and Photoshop
Adobe released new Firefly tools for Photoshop and Illustrator on Tuesday, offering graphic designers more ways to use the company's in-house AI models. Adobe's new features let creative workers describe what they want with brief prompts and receive AI-generated textures or images that could otherwise take hours to create. As Adobe doubles down on AI, the company walks a fine line with some of its loyal users who feel threatened by it.
Yahoo Life Shopping
These caftans are easy, breezy and great for the summer heat, from just $15
Dreamy '70s elegance is having a moment. Toss on a versatile, drapey dress to soak up the sun — and rake in the compliments.
TechCrunch
Alphabet to invest another $5B into Waymo
Alphabet will spend an additional $5 billion on its self-driving subsidiary, Waymo, over the next few years, according to Ruth Porat, the company's chief financial officer. Porat announced the commitment to a new "multi-year investment" Tuesday during Alphabet's second-quarter earnings call. "This new round of funding, which is consistent with recent annual investment levels, will enable Waymo to continue to build the world's leading autonomous driving technology company," said Porat.
Yahoo Sports
Buccaneers' Randy Gregory does not report for training camp after missing mandatory minicamp
There has been no word from Gregory, his agent or his attorney to explain his continued absence, according to Fox Sports.
Yahoo TV
'Bachelorette' contestants are flaunting their emotional intelligence. Is 'therapy speak' a rose or a thorn?
Contestants on Jenn Tran's "Bachelorette" season are using phrases seemingly pulled from a therapy session to assert their readiness for a relationship with the season's lead.
Autoblog
2025 Mercedes-AMG GT 63 S E Performance First Drive: The GT 63, but more
2025 Mercedes-AMG GT 63 S E Performance is just like the other V8 ones, but with all the numbers turned up.

News

Life

Entertainment

Finance

Sports

New on Yahoo

Meta releases its biggest 'open' AI model yet

New and improved

Bigger context and tools

Building an ecosystem

Play for market share

Recommended Stories

Llama 3.1 is Meta's latest salvo in the battle for AI dominance

Meta CEO Zuckerberg calls on industry to adopt open-source AI, debuts high-powered Llama AI model

Meta AI is now available in Spanish, Portugese, French and more

Meta's AI assistant is coming to Quest headsets in the US and Canada

TTT models might be the next frontier in generative AI

The Morning After: Meta may hold back its next-gen AI models from the EU

Anthropic releases Claude app for Android

Meta AI gets new 'Imagine me' selfie feature

Cowboys All-Pro CeeDee Lamb to reportedly hold out of training camp after failing to reach terms on an extension

Meta's reality check: Inside the $45 billion cash burn at Reality Labs

12 training camp questions we have at the QB + RB position | Yahoo Fantasy Forecast

Officers left posts before gunman opened fire at Trump, Pennsylvania State Police commissioner testifies. Here’s what else we learned from Tuesday’s House hearing.

Elon Musk sets new date for Tesla robotaxi reveal, calls everything beyond autonomy 'noise'

Chipotle expected to post big Q2 earnings as it remains competitive on value

Adobe releases new Firefly AI tools for Illustrator and Photoshop

These caftans are easy, breezy and great for the summer heat, from just $15

Alphabet to invest another $5B into Waymo

Buccaneers' Randy Gregory does not report for training camp after missing mandatory minicamp

'Bachelorette' contestants are flaunting their emotional intelligence. Is 'therapy speak' a rose or a thorn?

2025 Mercedes-AMG GT 63 S E Performance First Drive: The GT 63, but more