Scale AI

Supplies high-quality training data and AI evaluation services across the stack—from model fine-tuning to enterprise deployment.

1. Core Product / Service

Scale AI operates as a comprehensive AI infrastructure platform with three primary product lines: Scale Data Engine (training data generation and curation for frontier models), Scale GenAI Platform (enterprise-grade generative AI deployment), and Scale Donovan (agentic AI systems for business operations). The company also runs Scale Labs, a research division launched in March 2026 focused on post-training evaluation methods, enterprise deployment strategies, and safety oversight.

At its core, Scale provides data annotation, reinforcement learning from human feedback (RLHF), model evaluation, and red-teaming services. The company maintains subsidiary platforms: Remotasks for crowdsourced computer vision and autonomous vehicle data, and Outlier for generative AI contributor work. Scale claims 90% of the world's leading generative AI model builders use its data services, positioning itself as foundational infrastructure for frontier model development.

The technical approach emphasizes data quality and governance across the entire AI stack. Scale operates as an LLM red team for adversarial testing, helping identify vulnerabilities and biases. For enterprise customers, the platform addresses a critical problem: Scale states that "most AI deployments in enterprise and government fail," with its solutions keeping "humans in the loop" rather than enabling fully autonomous systems.

2. Target Users & Pain Points

Scale serves three distinct customer segments: (1) frontier AI labs and tech companies (Meta, OpenAI, Google, Microsoft, physical intelligence startups) requiring massive volumes of high-quality training data and RLHF for model development; (2) enterprise organizations (Fortune 500 companies, government agencies like the Chief Data and AI Officer's office) seeking reliable AI deployment with human oversight and compliance; (3) specialized sectors (healthcare systems like Mayo Clinic, automotive/robotics, media/publishing) needing domain-specific model evaluation and safety testing.

Pain points include: scarcity of high-quality annotated data for training frontier models, difficulty scaling RLHF workflows across large distributed teams, model safety and bias assessment before production, enterprise AI failure rates due to data quality or responsible deployment gaps, and regulatory/compliance requirements for critical AI systems.

3. Competitive Landscape

Company	Focus	Positioning	Key Differentiator
Surge AI	RLHF & data annotation	Bootstrapped, premium process-driven	Higher-quality workflows; $1.2B revenue (2024)
Mercor	Human expert evaluation & hiring	Talent-centric AI evals	300k+ expert network; recruiting revenue stream
Scale AI	Full-stack data + deployment	Breadth; 90% of frontier labs	Diversity of product lines; enterprise + frontier focus

Scale's main competitive pressure comes from surge-ai, which bootstrapped to $1.2B in 2024 revenue while maintaining higher quality standards, and mercor, which differentiates by monetizing its network of 300k+ vetted experts. However, Scale's breadth—spanning data generation, RLHF, enterprise deployment, and research—and its dominance with frontier AI labs gives it the largest addressable market. The Meta $14.3B investment created customer risk: Google and OpenAI announced shifts away from Scale post-June 2025, partly due to Meta's 49% ownership creating competitive concerns.

4. Unique Observations

The "acquihire" deal structure: Meta's $14.3B investment (49% non-voting stake) is functionally an acquihire of Alexandr Wang—the founder departed to lead Meta's AI efforts while Scale remained independent. This is unusual for a $29B valuation company and signals Meta's prioritization of Wang's talent and vision over Scale's equity upside. Wang retains a board seat but operational control passed to Jason Droege (ex-Uber). This may explain partial customer churn: Google and OpenAI shifted data labeling work post-announcement, suggesting concern about Meta access to competitive training data streams.

Customer concentration risk post-Meta: The 2025 deal fundamentally altered customer trust. With Meta owning nearly half of Scale and simultaneously being a direct frontier model competitor to Google/OpenAI, the three largest AI labs can no longer equally access Scale's services. This created a de facto forced choice for non-Meta labs, benefiting surge-ai and mercor.

Scale Labs as research defensibility: The March 2026 launch of Scale Labs (post-Wang departure) suggests Scale is pivoting from pure execution services toward research and evaluation capabilities. This mimics OpenAI's research divisions—positioning data as a form of intellectual property and moat rather than a commodity service.

5. Financials / Funding

Total raised (primary equity): $1.60B
Latest valuation: $29.0B

Date	Round	Amount	Post-money	Lead investor(s)
2016-08	Seed	$0.00B	—	Y Combinator
2017-05	Series A	$0.00B	—	Accel
2018-08	Series B	$0.02B	—	Index Ventures
2019-08	Series C	$0.10B	$1.0B	Founders Fund
2020-12	Series D	$0.15B	$3.5B	Tiger Global Management
2021-04	Series E	$0.33B	$7.3B	Dragoneer Investment Group; Greenoaks Capital; Tiger Global Management
2024-05	Series F	$1.00B	$13.8B	Accel
2025-06	Strategic	$14.30B	$29.0B	Meta Platforms

Note: 2024 revenue reported at $870M; 2026 projected at $2B (130% growth rate), one of the fastest in AI infrastructure.

6. People & Relationships

Founders:

Alexandr Wang (Founder, CEO until June 2025; joined Meta as part of strategic investment; retains Scale board seat)
Lucy Guo (Co-founder; ex-Quora alongside Wang)

Current Leadership:

Jason Droege (CEO from June 2025 onwards; former Chief Strategy Officer, ex-Uber executive)

Key Investors:

Meta Platforms ($14.3B investment, June 2025; 49% non-voting stake; now major shareholder)
Accel (Series A 2017, Series F lead 2024)
Tiger Global (Series D 2020; $150M)
Founders Fund (Series C 2019; $100M)
Dragoneer Investment Group (Series E 2021)

Notable Customers & Partnerships:

Frontier labs: Meta, OpenAI, Google, Microsoft
Government: Chief Data and AI Officer (CDAO), Department of Energy (Genesis Mission participant)
Enterprise: British Petroleum, Mayo Clinic, Cengage, Shore Capital, Howard Hughes
Safety: Center for AI Safety

Competitive & Peer Relationships:

Direct competition with surge-ai (RLHF quality) and mercor (talent-driven evals)
Post-2025 customer churn driven by Google and OpenAI concerns over Meta ownership