Home / Company / Scale AI

Scale AI

Last Updated November 24, 2025

Scale AI provides data infrastructure, model evaluation, and defense-grade AI platforms for leading technology companies and government agencies, combining high-quality labeled data, tooling, and security clearances to power large-scale AI systems in both commercial and national-security environments.
Company Overview: Scale AI
Scale AI sits at a structurally important layer of the AI stack: it is the connective tissue between raw data and production-grade AI systems for some of the largest technology and defense organizations in the world. The Meta transaction delivers unprecedented capital and a flagship strategic customer while validating Scale’s role as core AI infrastructure. At the same time, Scale has moved up the value chain into high-leverage products like Scale Evaluation and Donovan, positioning itself to capture a growing share of AI development budgets as enterprises and governments seek to operationalize and govern AI systems in mission-critical settings.
Investment Highlights

Explosive Revenue Scale

Scale generated roughly $870M in revenue in 2024 and is forecasting more than $2B in 2025, reflecting rapid adoption of its platforms across generative AI labs, enterprises, and government programs.

Transformational Meta Partnership

In June 2025, Meta agreed to invest about $14.3B for a 49% non-voting stake at a $29B valuation, bringing founder Alexandr Wang into Meta as Chief AI Officer while he remains on Scale’s board. This is Meta’s second-largest deal and gives Scale a deep, long-term anchor customer for data and evaluation services.

Defense and Government Leadership

Scale has become a key AI contractor for the U.S. government, including a $99M R&D contract with the U.S. Army, a five-year $100M-ceiling DoD platform contract on top-secret networks, and the Defense Llama program built on Meta’s Llama models. Its FedRAMP High and IL-4 authorizations, plus classified network access, create a high barrier to entry.

Central Role in AI Data Infrastructure

The company’s Data Engine and annotation workforce (via subsidiaries such as Remotasks and Outlier) have powered training data for OpenAI, Google, Microsoft, Meta, and leading autonomous-vehicle and geospatial programs. Years of labeled datasets and workflow optimizations underpin a defensible data and process moat.

Move Up the Stack into Evaluation and Decision Platforms

Newer products such as Scale Evaluation and Scale Donovan position the company beyond data labeling into high-value evaluation, governance, and decision-support layers, aligning with emerging enterprise and government needs to measure, test, and operationalize AI systems rather than merely train them.

Capitalized for Long-Term Infrastructure Build-Out

Including traditional VC rounds and the Meta transaction, Scale has raised nearly $16B, providing a multi-year runway to invest in infrastructure, government programs, and platform R&D while competitors often remain constrained by capital and security clearances.

Product & Technology Leadership

Scale Data Engine

End-to-end data platform that ingests raw images, video, text, and sensor data, then applies machine-assisted workflows and human annotators to produce high-quality training datasets for computer vision, LLMs, robotics, and geospatial intelligence.

Scale GenAI Platform

Infrastructure and tooling for building, fine-tuning, and deploying generative AI models, including custom LLMs for defense, autonomous systems, and enterprise use cases. Supports integration with commercial and open-source models.

Scale Donovan

Decision-support platform that applies generative AI to unstructured data and geospatial feeds to surface recommendations at “mission speed”. Deployed with agencies such as the U.S. Defense Logistics Agency to help manage large, complex operations.

Scale Evaluation

Evaluation suite launched in 2025 that benchmarks LLMs against standardized and custom tests, highlighting failure modes, safety issues, and areas requiring additional training data. Used by enterprises and government organizations seeking reliable AI governance.

Defense Llama

A large language model built on Meta’s Llama family and fine-tuned for defense and intelligence workflows, trained on military doctrine and policy guidelines and deployed only in controlled, secure government environments.

Secure Government Infrastructure

Scale operates environments certified at FedRAMP High and DoD IL-4 levels, with access to classified networks such as SIPR and JWICS, enabling it to host sensitive training data and models for U.S. defense and intelligence customers.

Annotation Workforce & Tooling

Subsidiaries and partners provide a distributed global workforce for high-throughput annotation, augmented by internal QA tools and model-in-the-loop pipelines that drive both scale and quality across computer vision and LLM tasks.

 Market Position & Strategic Advantage

Position in the AI Stack

Scale sits between raw data sources and production AI applications, providing the labeled data, evaluation, and decision platforms that frontier labs, enterprises, and defense organizations rely on to build, test, and field AI systems.

Defense and National Security Foothold

With multi-year contracts from the U.S. Army and DoD, plus specialized programs like Thunderforge and Defense Llama, Scale has secured a durable role in U.S. defense AI modernization efforts and is expanding into international government partnerships such as a five-year agreement with Qatar.

Impact of the Meta Transaction

The $14.3B Meta investment both validates Scale’s strategic importance and reshapes its customer landscape. Meta becomes a dominant strategic partner and user of Scale’s platforms, while some frontier labs such as Google have begun shifting workloads to alternative vendors following the deal, prompting internal restructuring of Scale’s legacy labeling units.

Competitive Landscape

Scale competes with data-labeling platforms (Labelbox, Appen), specialized providers (Surge AI), cloud-native tools (AWS SageMaker Ground Truth), and in-house teams at major AI labs. Its differentiators include deep government clearances, breadth of data and evaluation offerings, and significant capital backing.

Brand and Ecosystem

The company has become synonymous with high-end AI training data and evaluation, is featured on the 2025 CNBC Disruptor 50 list, and participates in high-profile AI safety and policy initiatives, including work with the U.S. AI Safety Institute.

Financial Opportunity

High-Growth Revenue Profile

Scale’s reported $870M in 2024 revenue and projected $2B+ in 2025 place it among the fastest-growing private AI infrastructure businesses, with revenue roughly doubling year-over-year as AI spending accelerates.

Leverage to AI and Defense Budgets

The business is directly tied to two expanding spend categories: AI development by large technology companies and AI modernization within defense and intelligence agencies. Multi-year contracts and long-term partnerships provide revenue visibility.

Upside from Platform Expansion

As customers move beyond raw data labeling toward evaluation, governance, and operational decision support, Scale’s newer platforms (Evaluation, Donovan, GenAI) have potential to increase average contract values and deepen account penetration.

Capital Structure and Strategic Optionality

The Meta stake delivers substantial capital and strategic alignment while Scale remains an independent company with its own board and non-voting outside shareholder structure. This creates flexibility for future financing, partnerships, or public-market entry once the business matures.

Long-Term Exposure to AI Infrastructure TAM

Given rising AI model complexity and the growing emphasis on evaluation and safety, Scale offers ongoing exposure to the expanding AI infrastructure market where data quality and governance are increasingly central to deployment decisions.

Company Snapshot
Founded
2016
Founders
Alexandr Wang, Lucy Guo
Headquarters
San Francisco, California, USA
Sector / Primary Focus
AI data infrastructure, model evaluation, and defense AI platforms
Latest Reported Valuation
$29.0B (June 2025)
Latest Round / Structure
$14.3B strategic investment by Meta for 49% non-voting stake
Total Capital Raised
~$1.6B venture funding pre-Meta; $15.9B including Meta investment
2024 Revenue
≈$870M
2025 Revenue Outlook
>$2B projected
Employees
≈1,400 (post-mid-2025 restructuring, est.)
Key Customers
Meta, Microsoft, Toyota, General Motors, SAP, U.S. DoD (Army, Joint AI Center), Qatar
Government Certifications
FedRAMP High, DoD IL-4, Top Secret network access (SIPR/JWICS)
Notable Recognition
CNBC Disruptor 50 #28 (2025)
How Summit Ventures Works

Summit Ventures offers accredited investors exclusive access to shares in the secondary market, providing:

  • Network Access: Exposure to opportunities sourced through Summit’s
    relationships.
  • Market-Based Pricing: Valuations informed by current private-market activity.
  • Simplified Process: Streamlined subscription and administrative support.
  • Portfolio Exposure: Participation in select private technology companies.
Risk Disclaimer

Investment in private companies involves substantial risk and is suitable only for sophisticated investors who can bear the loss of their entire investment. Past performance is not indicative of future results.

About Scale AI

Scale AI is a leading provider of AI data infrastructure, model evaluation, and defense-focused AI platforms. Founded in 2016 by Alexandr Wang and Lucy Guo, the company supplies high-quality labeled data, tooling, and evaluation services used by OpenAI, Google, Microsoft, Meta, automotive OEMs, and the U.S. Department of Defense. Its platform spans data curation, annotation, model testing, and decision-support systems such as Scale Donovan and Defense Llama, making Scale a core enabler of modern AI development for both commercial enterprises and national security customers.

In June 2025, Meta agreed to invest about $14.3 billion for a 49% non-voting stake in Scale AI, valuing the company at $29 billion and bringing founder Alexandr Wang into Meta as Chief AI Officer while he remains on Scale’s board. This transaction more than doubled Scale’s prior $13.8 billion valuation from its May 2024 Series F round and marked Meta’s second-largest deal after WhatsApp.

Scale generated roughly $870 million in revenue in 2024 and is projecting more than $2 billion in 2025, supported by strong demand from frontier AI labs and large government programs in defense, logistics, and geospatial intelligence. The company holds FedRAMP High authorization and supports IL-4 and classified environments, giving it a privileged position in sensitive U.S. government AI programs. In 2025 it was awarded a $99 million U.S. Army R&D contract running through 2030 and a separate five-year Department of Defense contract with a $100 million ceiling to provide AI platforms on top-secret networks, as well as becoming a third-party evaluator for the U.S. AI Safety Institute.

Scale continues to expand beyond its origins in data labeling into full-stack AI infrastructure: Scale Data Engine for training data creation, Scale GenAI Platform for custom model development, Scale Evaluation for systematic testing of LLMs, and Scale Donovan for operational decision support in defense and logistics. At the same time, the Meta partnership has reshaped the business landscape: Google, formerly one of Scale’s largest customers, has begun shifting work to other providers following the Meta deal, and Scale announced layoffs in mid-2025 in parts of its data-labeling arm as the market evolves.

Overall, Scale AI occupies a central role in the AI ecosystem as a data and evaluation backbone for both commercial and government AI programs, now operating with deep capitalization, a major strategic partner in Meta, and a growing portfolio of higher-value platform products.