Phony Cloud Platform
"Realistic data. Zero risk."
Phony is the next-generation synthetic data platform combining open-source developer tools with enterprise-grade cloud services.
The Vision
Phony is the next-generation synthetic data platform that combines open-source developer tools with enterprise-grade cloud services. Unlike competitors that rely on expensive LLMs or static word lists, Phony uses statistical learning to generate data that's indistinguishable from real data—fast, cheap, and private.
┌─────────────────────────────────────────────────────────────────────────┐
│ PHONY ECOSYSTEM │
│ │
│ ┌─────────────┐ │
│ │ PHONY │ │
│ │ CORE │ │
│ │ │ │
│ │ N-gram │ │
│ │ Statistical│ │
│ │ Engine │ │
│ └──────┬──────┘ │
│ │ │
│ ┌───────────┴───────────┐ │
│ ▼ ▼ │
│ ┌───────────────────┐ ┌───────────────────────┐ │
│ │ PHONY │ │ PHONY CLOUD │ │
│ │ Open Source │ │ phony.cloud │ │
│ │ │ │ │ │
│ │ • CLI & Library │ │ • Web Dashboard │ │
│ │ • Local Training│ │ • DB Column Training│ │
│ │ • Any Language │ │ • Database Sync │ │
│ │ • Any Framework │ │ • Mock API Hosting │ │
│ │ • Free Forever │ │ • Team Features │ │
│ └───────────────────┘ └───────────────────────┘ │
│ │
│ For: Individual devs For: Teams & Companies │
│ Cost: Free Cost: $29-199/mo │
└─────────────────────────────────────────────────────────────────────────┘The Problem
Production Data in Non-Production Environments
Companies copy production databases to staging, development, and QA environments—exposing real customer data to breaches and compliance violations.
Unrealistic Test Data
Traditional fake data tools generate generic, predictable output that doesn't match real data distributions.
Faker output: Your real data:
───────────── ──────────────
"John Smith" "Mehmetcan Yılmaz"
"123 Main St" "Atatürk Mah. 15/3"
"Lorem ipsum..." Domain-specific textTests pass with fake data but fail with real data.
Existing Solutions Fall Short
| Solution | Limitation |
|---|---|
| Faker | Static lists, no learning, generic |
| Tonic Structural | Expensive ($199+/mo), enterprise-only |
| Tonic Fabricate | LLM-based = slow, costly, non-deterministic |
| Greenmask | PostgreSQL only, no mock API |
The Opportunity
| Market | Current (2026) | Projected (2031) | CAGR |
|---|---|---|---|
| Synthetic Data | ~$700M | $8-10B | 35%+ |
| Test Automation | $30B | $100B+ | 17% |
Gap in market: No solution combines statistical learning + mock API + Laravel-first design.
Key Numbers
| Metric | Target |
|---|---|
| Exit ARR | $600K-1M |
| Exit Multiple | 5-8x ARR |
| Exit Value | $3-8M (realistic) |
| Customers at Exit | 300-500 |
| Timeline | 4-5 years |
Pricing Overview
★ ALL TIERS: UNLIMITED GENERATION + UNLIMITED USERS ★
| Tier | Price | Sync Jobs | Endpoints | Snapshots |
|---|---|---|---|---|
| FREE | $0 | 1 | 5 | 3 |
| STARTER | $29/mo | 5 | 30 | 10 |
| TEAM | $79/mo | 15 | 150 | 50 |
| BUSINESS | $199/mo | Unlimited | Unlimited | Unlimited |
Why unlimited generation? Our N-gram engine costs ~$0.0000001/record vs LLM's $0.001-$0.10/record.
OSS vs Cloud
| Feature | OSS (Free) | Cloud (Paid) |
|---|---|---|
| Pre-trained models | ✓ | ✓ |
| Local model training | ✓ | ✓ |
| Model portability (.phony) | ✓ | ✓ |
| DB column training | ✗ | ✓ |
| Database sync | ✗ | ✓ |
| Data snapshots | ✗ | ✓ |
| Mock API hosting | ✗ | ✓ |
| Team collaboration | ✗ | ✓ |
Tech Stack
┌────────────────┐ ┌────────────────┐ ┌────────────────┐
│ NUXT │ │ GO │ │ RUST │
│ Dashboard │ → │ Engine │ → │ Core │
├────────────────┤ ├────────────────┤ ├────────────────┤
│ Vue 3 + TS │ │ pgx (Postgres) │ │ N-gram (5M/s) │
│ Auth.js │ │ go-mysql │ │ MessagePack │
│ Stripe SDK │ │ asynq (queue) │ │ FFI to Go │
└────────────────┘ └────────────────┘ └────────────────┘