Scale AI
Supplies high-quality training data and AI evaluation services across the stack—from model fine-tuning to enterprise deployment.
1. Core Product / Service
Scale AI operates as a comprehensive AI infrastructure platform with three primary product lines: Scale Data Engine (training data generation and curation for frontier models), Scale GenAI Platform (enterprise-grade generative AI deployment), and Scale Donovan (agentic AI systems for business operations). The company also runs Scale Labs, a research division launched in March 2026 focused on post-training evaluation methods, enterprise deployment strategies, and safety oversight.
At its core, Scale provides data annotation, reinforcement learning from human feedback (RLHF), model evaluation, and red-teaming services. The company maintains subsidiary platforms: Remotasks for crowdsourced computer vision and autonomous vehicle data, and Outlier for generative AI contributor work. Scale claims 90% of the world's leading generative AI model builders use its data services, positioning itself as foundational infrastructure for frontier model development.
The technical approach emphasizes data quality and governance across the entire AI stack. Scale operates as an LLM red team for adversarial testing, helping identify vulnerabilities and biases. For enterprise customers, the platform addresses a critical problem: Scale states that "most AI deployments in enterprise and government fail," with its solutions keeping "humans in the loop" rather than enabling fully autonomous systems.
2. Target Users & Pain Points
Scale serves three distinct customer segments: (1) frontier AI labs and tech companies (Meta, OpenAI, Google, Microsoft, physical intelligence startups) requiring massive volumes of high-quality training data and RLHF for model development; (2) enterprise organizations (Fortune 500 companies, government agencies like the Chief Data and AI Officer's office) seeking reliable AI deployment with human oversight and compliance; (3) specialized sectors (healthcare systems like Mayo Clinic, automotive/robotics, media/publishing) needing domain-specific model evaluation and safety testing.
Pain points include: scarcity of high-quality annotated data for training frontier models, difficulty scaling RLHF workflows across large distributed teams, model safety and bias assessment before production, enterprise AI failure rates due to data quality or responsible deployment gaps, and regulatory/compliance requirements for critical AI systems.
3. Competitive Landscape
| Company | Focus | Positioning | Key Differentiator |
|---|---|---|---|
| Surge AI | RLHF & data annotation | Bootstrapped, premium process-driven | Higher-quality workflows; $1.2B revenue (2024) |
| Mercor | Human expert evaluation & hiring | Talent-centric AI evals | 300k+ expert network; recruiting revenue stream |
| Scale AI | Full-stack data + deployment | Breadth; 90% of frontier labs | Diversity of product lines; enterprise + frontier focus |
Scale's main competitive pressure comes from surge-ai, which bootstrapped to $1.2B in 2024 revenue while maintaining higher quality standards, and mercor, which differentiates by monetizing its network of 300k+ vetted experts. However, Scale's breadth—spanning data generation, RLHF, enterprise deployment, and research—and its dominance with frontier AI labs gives it the largest addressable market. The Meta $14.3B investment created customer risk: Google and OpenAI announced shifts away from Scale post-June 2025, partly due to Meta's 49% ownership creating competitive concerns.
4. Unique Observations
The "acquihire" deal structure: Meta's $14.3B investment (49% non-voting stake) is functionally an acquihire of Alexandr Wang—the founder departed to lead Meta's AI efforts while Scale remained independent. This is unusual for a $29B valuation company and signals Meta's prioritization of Wang's talent and vision over Scale's equity upside. Wang retains a board seat but operational control passed to Jason Droege (ex-Uber). This may explain partial customer churn: Google and OpenAI shifted data labeling work post-announcement, suggesting concern about Meta access to competitive training data streams.
Customer concentration risk post-Meta: The 2025 deal fundamentally altered customer trust. With Meta owning nearly half of Scale and simultaneously being a direct frontier model competitor to Google/OpenAI, the three largest AI labs can no longer equally access Scale's services. This created a de facto forced choice for non-Meta labs, benefiting surge-ai and mercor.
Scale Labs as research defensibility: The March 2026 launch of Scale Labs (post-Wang departure) suggests Scale is pivoting from pure execution services toward research and evaluation capabilities. This mimics OpenAI's research divisions—positioning data as a form of intellectual property and moat rather than a commodity service.
5. Financials / Funding
- Total raised (primary equity): $1.60B
- Latest valuation: $29.0B
| Date | Round | Amount | Post-money | Lead investor(s) |
|---|---|---|---|---|
| 2016-08 | Seed | $0.00B | — | Y Combinator |
| 2017-05 | Series A | $0.00B | — | Accel |
| 2018-08 | Series B | $0.02B | — | Index Ventures |
| 2019-08 | Series C | $0.10B | $1.0B | Founders Fund |
| 2020-12 | Series D | $0.15B | $3.5B | Tiger Global Management |
| 2021-04 | Series E | $0.33B | $7.3B | Dragoneer Investment Group; Greenoaks Capital; Tiger Global Management |
| 2024-05 | Series F | $1.00B | $13.8B | Accel |
| 2025-06 | Strategic | $14.30B | $29.0B | Meta Platforms |
Note: 2024 revenue reported at $870M; 2026 projected at $2B (130% growth rate), one of the fastest in AI infrastructure.
6. People & Relationships
Founders:
- Alexandr Wang (Founder, CEO until June 2025; joined Meta as part of strategic investment; retains Scale board seat)
- Lucy Guo (Co-founder; ex-Quora alongside Wang)
Current Leadership:
- Jason Droege (CEO from June 2025 onwards; former Chief Strategy Officer, ex-Uber executive)
Key Investors:
- Meta Platforms ($14.3B investment, June 2025; 49% non-voting stake; now major shareholder)
- Accel (Series A 2017, Series F lead 2024)
- Tiger Global (Series D 2020; $150M)
- Founders Fund (Series C 2019; $100M)
- Dragoneer Investment Group (Series E 2021)
Notable Customers & Partnerships:
- Frontier labs: Meta, OpenAI, Google, Microsoft
- Government: Chief Data and AI Officer (CDAO), Department of Energy (Genesis Mission participant)
- Enterprise: British Petroleum, Mayo Clinic, Cengage, Shore Capital, Howard Hughes
- Safety: Center for AI Safety
Competitive & Peer Relationships: