Phony Cloud Platform - Appendix
Glossary
| Term | Definition |
|---|---|
| N-gram | Contiguous sequence of n items (characters or words) used for statistical pattern learning |
| PII | Personally Identifiable Information - data that can identify an individual |
| Synthetic Data | Artificially generated data that mimics real data patterns |
| Anonymization | Process of removing or replacing identifying information |
| Masking | Partially hiding data while preserving format (e.g., j***@e***.com) |
| Mock API | Simulated API for development/testing that mimics real backend behavior |
| Deterministic | Same input always produces same output (reproducible) |
| Seed | Initial value used to start a pseudorandom number generator for reproducibility |
| FK (Foreign Key) | Database constraint linking records between tables |
| Referential Integrity | Ensuring FK relationships remain valid after data transformation |
| Subset | A smaller representative portion of data maintaining relationships |
Business Terms
| Term | Definition |
|---|---|
| ARR | Annual Recurring Revenue - yearly subscription revenue |
| MRR | Monthly Recurring Revenue - monthly subscription revenue |
| LTV | Lifetime Value - total revenue expected from a customer |
| CAC | Customer Acquisition Cost - cost to acquire one customer |
| ARPU | Average Revenue Per User - average monthly revenue per customer |
| Churn | Rate at which customers cancel subscriptions |
| NRR | Net Revenue Retention - revenue retained including expansion/contraction |
| PLG | Product-Led Growth - growth driven by product usage vs sales |
| PMF | Product-Market Fit - degree to which product satisfies market demand |
Technical Terms
| Term | Definition |
|---|---|
| MCP | Model Context Protocol - standard for AI tool integration |
| SSO | Single Sign-On - unified authentication across services |
| SAML | Security Assertion Markup Language - SSO protocol |
| OIDC | OpenID Connect - authentication layer on OAuth 2.0 |
| SCIM | System for Cross-domain Identity Management - user provisioning |
| RBAC | Role-Based Access Control - permissions based on roles |
| SOC2 | Service Organization Control 2 - security compliance framework |
| HIPAA | Health Insurance Portability and Accountability Act - US healthcare privacy |
| GDPR | General Data Protection Regulation - EU privacy law |
| KVKK | Kişisel Verilerin Korunması Kanunu - Turkish data protection law |
References
Market Research
- Mordor Intelligence: Synthetic Data Market (2025-2030, 39.4% CAGR)
- Grand View Research: Synthetic Data Generation (35.3% CAGR)
- MarketsandMarkets: Synthetic Data Generation (45.7% CAGR)
Competitors
| Company | URL | Notes |
|---|---|---|
| Tonic.ai | tonic.ai | Primary competitor, enterprise-focused, $199+/mo |
| Neosync | github.com/nucleuscloud/neosync | Discontinued (acquired by Grow Therapy, Jan 2025) |
| Greenmask | greenmask.io | Open source, PostgreSQL-only |
| FakerPHP | fakerphp.github.io | PHP fake data library |
| Faker.js | fakerjs.dev | JavaScript fake data library |
Mock API Tools
| Tool | URL | Notes |
|---|---|---|
| Mockoon | mockoon.com | Open source, local-first |
| Postman | postman.com | Mock as part of platform |
| Apidog | apidog.com | All-in-one API platform |
Database Anonymization Tools
| Tool | URL | Notes |
|---|---|---|
| Greenmask | greenmask.io | Go, PostgreSQL/S3 |
| Anonimatron | realrolfje.github.io/anonimatron | Java, multi-DB |
| Divante Anonymizer | github.com/DivanteLtd/anonymizer | PHP, Magento-focused |
Technical References
- N-gram Language Models: Jurafsky & Martin, "Speech and Language Processing"
- Laravel Documentation: laravel.com/docs
- Model Context Protocol: modelcontextprotocol.io
Document Information
| Field | Value |
|---|---|
| Version | 2.3 |
| Updated | January 2026 |
| Author | Phonyland Team |
| Status | Draft |
| Domain | phony.cloud |
| GitHub | github.com/phonyland |
Version History
| Version | Date | Changes |
|---|---|---|
| 1.0 | December 2025 | Initial specification |
| 1.1 | January 2026 | Added implementation details |
| 2.0 | January 2026 | Restructured into modular documents, added security model, consolidated duplications |
| 2.1 | January 2026 | Local training decision: OSS includes full local training (files), Cloud differentiates with DB column training, team features, and infrastructure |
| 2.2 | January 2026 | New features: Data Snapshots & Rollback, CI/CD Integration, Competitors to Track, Future Capabilities (GraphQL, PII Detection, Data Quality), Model Portability emphasis, Snapshot limits in pricing, Strategic Business Plan (/business/strategic-plan) |
| 2.3 | January 2026 | Infrastructure-based pricing: Unlimited generation + UNLIMITED USERS (Starter+), no per-seat pricing, Sync Job model (1 source→1 target per job), endpoint-based Mock API limits, Complete Pricing Matrix (/product/pricing) with ALL features, Revenue-optimized language strategy (PHP→Python→TypeScript based on WTP analysis) |
Document Structure
This specification is organized into modular documents for easier navigation:
PHONY_CLOUD_SPEC/
├── README.md # This index
├── 00_OVERVIEW.md # Executive Summary, Problem Statement
├── /product/overview # Solution Ecosystem, Core Engine
├── /product/database-sync # Platform Capabilities
├── /business/market # Target Users, Competitive Analysis
├── /business/model # Pricing, Unit Economics
├── /business/go-to-market # Go-to-Market, Marketing
├── /technical/architecture # Architecture, Security, API
├── /reference/implementation # Phase 1, 2, 3 Plans
├── /reference/operations # Risks, Milestones, Exit
├── /reference/appendix # Glossary, References (this file)
├── /business/strategic-plan # Business Plan, Competitive Strategy, DX
├── 11_DEEP_STRATEGY_ANALYSIS.md # Deep Pricing & Competitive Analysis
└── /product/pricing # Complete Feature Matrix (ALL features)