Full opportunity report: OpenEuroLLM. The third path. on ThorstenMeyerAI.com — validation score, market gap, and execution plan.
TL;DR
OpenEuroLLM is a €20.6M EU-funded project involving 20 organizations to create open-source multilingual LLMs. Despite progress, compute resource constraints remain a key obstacle. The first models are expected by July 2026.
OpenEuroLLM, a pan-European consortium project funded by €20.6 million from the EU’s Digital Europe Programme, is progressing toward developing multilingual large language models, but faces significant compute resource challenges that could impact its timeline and outcomes.
The project, coordinated by Jan Hajič at Charles University in Prague and co-led by Peter Sarlin of Silo AI in Finland, involves 20 organizations across Europe, including universities, companies, and high-performance computing centers. Its goal is to create open-source multilingual LLMs accessible within the European public space.
As of March 2026, the project has achieved its first-year milestones, but its lead coordinator publicly acknowledged that “significant challenges, especially in securing more compute for creating the final models, still remain,” according to Hajič’s recent progress report. The consortium’s resource constraints mirror those faced by national projects, highlighting the scale and complexity of the enterprise.
Despite the progress, the structural limits of the consortium are becoming visible. Hajič’s statement underscores that even at a pan-European pooled scale, compute remains the bottleneck, casting doubt on whether the project can meet its July 2026 deadline for delivering the first models.
OpenEuroLLM · The Third Path.
EU Sovereign AI · Pan-EU · May 2026
OpenEuroLLM.
The third
path.
€37.4M EU budget, 20 organizations, four major EuroHPC supercomputers, 35 target languages. And the project’s coordinator says: “significant challenges in securing more compute still remain.”
Italy bet national. Portugal bet continuation. The EU bet consortium. OpenEuroLLM — coordinated by Jan Hajič at Charles University Prague, co-led by Peter Sarlin at AMD-owned Silo AI — is what the pan-European pooled-resources answer looks like in operational form. And the project lead is publicly stating that even at pan-European pooled scale, compute is the bottleneck. Each of the three sovereign-LLM answers, examined honestly, surfaces a complication the press coverage downplays.
● HAJIČ MARCH 2026 “SIGNIFICANT CHALLENGES IN SECURING MORE COMPUTE FOR FINAL MODELS STILL REMAIN” · STRUCTURAL FINDING
● COMPUTE 3M GPU HOURS LEONARDO BOOSTER + 1.5M LUMI + STRATEGIC 4 EUROHPC SYSTEMS · $7B EUROHPC CONTEXT
● THREE-WAY MINERVA FROM-SCRATCH · AMÁLIA CONTINUATION · OPENEUROLLM CONSORTIUM · ALL THREE OPERATIONAL SUMMER 2026
● YEAR ONE OUTPUTS MIXTUREVITAE · HPLT 38 REFERENCE MODELS · OPEN-SCI-REF 0.01 · TRAINING DATA CATALOGUE · MULTISYNT
● vs MINERVA ITALY 128 GPUS LEONARDO · €100M+ PNRR · OPENEUROLLM 4.5M GPU HOURS · €37.4M EU BUDGET · ORDER OF MAGNITUDE LARGER POOLED
● JULY 31 2026 FIRST MODELS · INITIAL DATASET · EVALUATION CODE · STRATEGIC MOMENT FOR EU SOVEREIGN-LLM MOVEMENT
Even at pan-European scale, compute is the bottleneck.
From the OpenEuroLLM first-year progress report, March 6, 2026. The single most important sentence in the public documentation of the project. The pan-European consortium answer — explicitly designed as the response to individual national projects’ resource constraints — is itself constrained by the same resource that limits national projects.
First-year progress and next steps · March 6, 2026
12 universities. 6 companies. 3 HPC centers. One conspicuous absence.
The OpenEuroLLM consortium combines academic NLP research, commercial AI capability, and EuroHPC supercomputing infrastructure across multiple European nations. The breadth is the strategic bet. The breadth is also the operational complication.
Eleven deliverables. Two shipped. Nine pending.
From the official deliverables roadmap. As of mid-May 2026, only two of eleven deliverables have shipped — both from July 2025. The July 31, 2026 cluster — first models, initial dataset, evaluation code — is when OpenEuroLLM becomes empirically comparable to Minerva and AMÁLIA.
Three answers. Three structural findings.
The Minerva from-scratch path. The AMÁLIA continuation path. The OpenEuroLLM consortium path. Each project surfaces an empirical complication the press coverage downplays. Each finding is harder than the framing it’s wrapped in.
Three projects. Three findings. Each one harder than the framing it’s wrapped in. Each answer is valid for its specific positioning and resource context. None of the three is “the right answer” in the abstract. The strategic discourse benefits from treating all three as data points in the same empirical experiment.
First models in six weeks. Three scenarios.
The July 31, 2026 first-models deliverable is the strategic moment for OpenEuroLLM specifically and for the European sovereign-LLM movement broadly. Three scenarios are plausible. The structurally honest framing will require acknowledging whatever the empirical results actually show.
OpenEuroLLM is one valid answer to the European sovereign-LLM question. AMÁLIA is another. Minerva is a third. Mistral is potentially a fourth — the commercial-frontier answer this essay track examines next. The strategic discourse benefits from treating all of them as complementary experiments in the same empirical question. More analysis like this is needed. Not less.
Source dossier · the receipts
AMÁLIA · The Three Hard Questions · Standalone Essay 01 · the Portuguese case study (continuation answer)
Minerva · The Opposite Path · Standalone Essay 02 · the Italian case study (from-scratch answer)
OpenEuroLLM official project page · partner list, goals, status
Deliverables roadmap · 11 items through January 2028
First year progress and next steps · March 6, 2026 · the Hajič compute statement
Launch press release · February 3, 2025
Strategic access to EuroHPC resources granted to OpenEuroLLM · December 12, 2025
MultiSynt synthetic training data · May 27, 2025 · 3M GPU hours Leonardo BOOSTER
LUMI compute allocation · May 26, 2025 · 1.5M GPU hours
open-sci-ref 0.01 · August 22, 2025
HPLT v2 · 38 Monolingual 2.15B LLMs · July 17, 2025
TechCrunch · Open source LLMs hit Europe’s digital sovereignty roadmap · Hajič interview, Mistral absence, Stasenko critique
Fortune · OpenEuroLLM · €37.4M vs $100B Stargate first tranche comparison
European Commission · STEP Seal announcement · February 3, 2025
MixtureVitae arXiv 2509.25531 · permissive open web-scale dataset
HPLT/hplt-20-monolingual-reference-models · 38 reference models
MultiSynt/MT-Nemotron-CC · first multilingual synthetic pre-training dataset
OpenEuroLLM training-data-catalogue GitHub
arXiv 2412.02595 · Nemotron-CC · Nvidia methodology underlying MultiSynt
Jan Hajič · Charles University ÚFAL · OpenEuroLLM coordinator · HPLT coordinator since 2022
Peter Sarlin · AMD Silo AI CEO · OpenEuroLLM co-lead · Silo AI $665M AMD acquisition 2024
Anastasia Stasenko · Pleias co-founder · published skepticism about consortium decision-velocity
Grant agreement No 101195233 · EU Digital Europe Programme
STEP (Strategic Technologies for Europe Platform) Seal · first Digital Europe Programme recipient
EuroHPC AI Factory Large Scale call EHPC-AIF-2025LS01-028
Finnish LUMI Extreme Scale Access program
EU AI Act · effective Feb 2, 2025 · one day after OpenEuroLLM start
LAION · Open-sci · OpenML · collaborating open-source/open-science communities
Colophon · Standalone Essay 03
Set in Source Serif 4 (display), EB Garamond (essay body), IBM Plex Sans & IBM Plex Mono. Standalone essay register · not part of the security franchise. Closes the AMÁLIA / Minerva / OpenEuroLLM three-way essay track. Free to embed with attribution.
thorstenmeyerai.com
Standalone essay 03 · European sovereign AI · the OpenEuroLLM case study · May 2026
€37.4M · 20 ORGS · 4.5M GPU HOURS · JUL 2026
Implications of Compute Constraints on European AI Development
The ongoing compute resource limitations threaten to delay or diminish the scope of the first models from OpenEuroLLM, impacting Europe’s strategic position in sovereign AI development. The project exemplifies the broader challenge faced by European initiatives: balancing ambition with practical resource constraints, which may influence future policy and investment decisions in AI infrastructure.
European Sovereign-LLM Strategies and Resource Challenges
Europe’s approach to sovereign AI development has been characterized by three main strategies: Italy’s from-scratch investment in Minerva, Portugal’s continuation pre-training with AMÁLIA, and the EU-wide pooled-resource model exemplified by OpenEuroLLM. Each strategy reflects different levels of investment, architectural commitment, and institutional scope.
Previous essays have highlighted that resource constraints, especially compute, are a common challenge. Minerva’s initial findings and AMÁLIA’s language-specific results underscore the limitations of current models and infrastructure. OpenEuroLLM, as the largest and most collaborative effort, aims to overcome these barriers but now faces the same fundamental issue: inadequate compute capacity could hinder its progress.
“Significant challenges, especially in securing more compute for creating the final models, still remain.”
— Jan Hajič, Charles University
Unresolved Impact of Compute Limitations on Model Delivery
It remains unclear whether the consortium will secure enough compute resources before July 2026 to meet its first-model delivery deadline. The extent to which resource constraints might delay or reduce the scope of the initial models is still uncertain, pending further progress and resource allocation.
Next Milestone: First Model Release and Resource Assessment
The project’s next key milestone is the delivery of the first models by July 31, 2026. The upcoming months will determine whether additional compute resources can be secured and whether the models can meet the project’s ambitious goals. A detailed assessment of resource needs and potential delays is expected in the lead-up to this deadline.
Key Questions
What is OpenEuroLLM?
OpenEuroLLM is a European consortium project funded by the EU to develop open-source multilingual large language models through collaboration among universities, companies, and HPC centers across Europe.
What are the main challenges facing the project?
The primary challenge is securing sufficient compute resources needed to train and finalize the models, which could impact the project’s timeline and model quality.
When will the first models be available?
The first models are expected to be delivered by July 31, 2026, but resource constraints could cause delays.
How does this project compare to national efforts like Minerva or AMÁLIA?
Unlike national projects, OpenEuroLLM aims for a pooled, pan-European approach, but all face similar resource limitations that impact progress and outcomes.
Will the project succeed despite current challenges?
It is uncertain. Success depends on securing additional compute resources and managing technical hurdles before the July 2026 deadline.
Source: ThorstenMeyerAI.com