Meta releases Llama 4, a new crop of flagship AI models

Importance Score: 72 / 100 🔴

Meta has launched its latest suite of artificial intelligence models, Llama 4, marking a new advancement in its Llama family of large language models. This release, unexpectedly occurring on a Saturday, introduces a collection of AI systems designed for enhanced capabilities.

Introducing the Llama 4 Series

The Llama 4 series comprises four distinct models: Llama 4 Scout, Llama 4 Maverick, and Llama 4 Behemoth. Meta has stated that these models were trained using extensive datasets of unlabeled text, images, and video, equipping them with comprehensive visual understanding and multimodal processing abilities.

Background and Development Context

The rapid progress of open-source models from Chinese AI laboratory DeepSeek, which have demonstrated performance levels comparable to or exceeding Meta’s prior Llama models, is reported to have significantly accelerated Llama 4 development. Meta reportedly initiated intensive efforts to analyze how DeepSeek achieved reduced operational and deployment costs for models such as R1 and V3, indicating a competitive drive to optimize efficiency and performance in their AI offerings.

Availability and Current Applications

Llama 4 Scout and Maverick are currently accessible via Llama.com and through Meta’s partnerships, including the AI developer platform Hugging Face. Behemoth is still undergoing training. Meta has also announced that Meta AI, its AI assistant integrated across platforms like WhatsApp, Messenger, and Instagram, has been upgraded to utilize Llama 4 in 40 countries. However, multimodal functionalities are presently limited to English within the U.S.

Licensing Considerations and EU Restrictions

The licensing terms for Llama 4 may present concerns for some developers. Notably, entities and organizations based or having their primary business location in the European Union are restricted from using or distributing these models. This limitation is likely a consequence of regulatory requirements imposed by the EU’s legislation concerning artificial intelligence and data privacy. Meta has previously expressed criticism of these regulations, deeming them excessively burdensome. Additionally, similar to earlier Llama releases, companies exceeding 700 million monthly active users must request a specific license from Meta, which can be granted or refused at Meta’s discretion.

Meta’s Perspective on Llama 4

“These Llama 4 models represent the dawn of a new phase for the Llama ecosystem,” Meta stated in a published blog post. “This release marks just the beginning for the Llama 4 collection, with further advancements anticipated.”

Mixture of Experts (MoE) Architecture

Meta highlights that Llama 4 is the first model series to employ a Mixture of Experts (MoE) architecture. This design is intended to enhance computational efficiency during both training and query processing. MoE architectures function by dividing complex data processing tasks into smaller subtasks and delegating these to specialized, smaller “expert” models, optimizing resource utilization and potentially improving response times.

Model Specifications: Maverick and Scout

For instance, Maverick incorporates 400 billion total parameters, but only 17 billion active parameters distributed across 128 experts. Parameters are generally understood to reflect a model’s problem-solving capabilities. Scout features 17 billion active parameters, 16 experts, and 109 billion total parameters.

Performance Benchmarks and Comparisons

According to Meta’s internal evaluations, Maverick, positioned as optimal for “general assistant and chat” applications like creative writing, outperforms models such as OpenAI’s GPT-4o and Google’s Gemini 2.0 in specific benchmarks assessing coding, reasoning, multilingual capabilities, long-context handling, and image understanding. However, Maverick does not quite reach the performance levels of more advanced contemporary models including Google’s Gemini 2.5 Pro, Anthropic’s Claude 3.7 Sonnet, and OpenAI’s GPT-4.5.

Scout’s Unique Strengths and Capabilities

Scout’s primary advantages are in areas such as document summarization and complex reasoning across extensive codebases. Notably, it boasts a substantial context window of 10 million tokens. Tokens represent units of raw text—for example, the word “fantastic” is segmented into “fan,” “tas,” and “tic.” In simpler terms, Scout can process images and millions of words, allowing it to effectively manage and analyze extremely lengthy documents, making it well-suited for tasks requiring extensive contextual awareness.

Hardware Requirements

Scout is designed to operate on a single Nvidia H100 GPU, whereas Maverick necessitates an Nvidia H100 DGX system or comparable hardware, according to Meta’s estimations.

Behemoth: The High-Performance Model

Meta’s Behemoth model, not yet released, will demand even more robust hardware infrastructure. The company indicates that Behemoth features 288 billion active parameters, 16 experts, and almost two trillion total parameters. Meta’s internal benchmarking suggests that Behemoth surpasses GPT-4.5, Claude 3.7 Sonnet, and Gemini 2.0 Pro (though not 2.5 Pro) on several assessments evaluating STEM proficiencies, such as mathematical problem-solving.

Reasoning Model Considerations

It is important to note that none of the Llama 4 models are classified as true “reasoning” models akin to OpenAI’s o1 and o3-mini. Reasoning models incorporate fact-checking in their responses and generally provide more reliable answers but typically require more time to generate responses compared to traditional, “non-reasoning” models.

Addressing Contentious Queries and Bias

Meta has stated that all Llama 4 models have been refined to reduce instances of refusing to answer “disputed” questions. According to the company, Llama 4 will now address debated political and social subjects that previous Llama models avoided. Furthermore, Meta asserts that Llama 4 demonstrates “significantly improved fairness” in determining which prompts it will decline to engage with altogether.

“[Y]ou can expect [Llama 4] to deliver useful, accurate responses without prejudice,” stated a Meta spokesperson to TechCrunch. “[W]e are continually working to make Llama more responsive so that it answers a broader range of questions, can address diverse perspectives […] and avoids favoring specific viewpoints over others.”

Political Discourse and AI Chatbot Bias

These adjustments come amidst accusations from some allies of the White House who claim that AI chatbots exhibit excessive political “wokeness.”

Numerous close associates of former President Donald Trump, including Elon Musk and David Sacks, a figure prominent in crypto and AI sectors, have asserted that widely used AI chatbots purportedly censor conservative viewpoints. Sacks has specifically criticized OpenAI’s ChatGPT as being “programmed to be woke” and inaccurate on political topics.

The Technical Challenges of AI Bias

In reality, bias in AI represents a complex technical challenge. Musk’s own AI venture, xAI, has encountered difficulties in creating a chatbot that does not inherently favor certain political stances over others, highlighting the complexities involved in achieving neutrality in AI systems.

Despite these challenges, companies such as OpenAI are actively modifying their AI models to answer a wider spectrum of questions than previously, particularly those concerning sensitive subjects, reflecting an ongoing effort to enhance the responsiveness and inclusivity of AI chatbots.


🕐 Top News in the Last Hour By Importance Score

# Title 📊 i-Score
1 10 Ways to Beat Seasonal Allergies for Better Sleep 🔴 65 / 100
2 Aetherflux raises $50 million for space-based solar power 🔴 65 / 100
3 Thousands join Paris far-right march against Le Pen’s election ban 🔴 65 / 100
4 Archaeologists stunned by discovery of human relics in 150,000-year-old rainforest 🔴 65 / 100
5 Real life ITV Grace location hit by major blow as show makes huge return 🔵 60 / 100
6 How to Maintain Healthy Eyes at Every Stage of Life 🔵 55 / 100
7 A Minecraft Movie storms box office despite lukewarm reviews 🔵 45 / 100
8 Americans hit with 'culture shock' over UK bathrooms as 5 major differences revealed 🔵 40 / 100
9 SNL Pokes Fun at Morgan Wallen's Abrupt Exit to "God's Country" 🔵 35 / 100
10 Helen Flanagan reveals new boyfriend has moved out as she shares candid kids admission 🔵 30 / 100

View More Top News ➡️