Framework Definition

Synthetic COGS

Coined by Richard Ewing, Product Economist

Definition

Synthetic COGS (Cost of Goods Sold) is a financial framework that reclassifies computational intelligence as a direct, variable cost tied to every individual software transaction. It marks the definitive end of the "Zero Marginal Cost" era that defined SaaS economics for two decades. In traditional cloud software, COGS primarily consisted of predictable hosting, storage, and bandwidth fees. The cost to serve user #1 was essentially the same as the cost to serve user #10,000. Under this model, companies were incentivized to drive infinite engagement because the marginal cost of a new user query was near zero. In the AI-native era, this paradigm is inverted. Intelligence is not a fixed asset; it is a consumed resource. Every interaction with an LLM triggers a complex chain of computational expenses: embedding the user query, conducting a semantic similarity search against a vector database, packaging the context window, and running the generative inference step. Each of these components carries a specific, variable price tag. Synthetic COGS forces product managers to recognize that they are essentially buying raw compute and reselling it to users as intelligence. If a user asks a complex question that requires scanning a massive RAG (Retrieval-Augmented Generation) corpus and generating a 2,000-token response, that single query has a discrete cost that must be subtracted from the user's lifetime value. Failing to model Synthetic COGS accurately leads to the "AI Scaling Paradox": the more successful and engaging your product becomes, the faster you burn cash, because user engagement scales your inference costs directly against your flat subscription revenue.

Why It Matters

Synthetic COGS is the most critical metric for determining the viability of an AI business model. It forces engineering and product teams to stop treating inference costs as a general cloud expense and start attributing them directly to feature-level P&Ls. For investors, auditing Synthetic COGS reveals whether an AI startup is actually building a sustainable software business or merely acting as an unprofitable, subsidized wrapper for OpenAI or Anthropic. For engineering leadership, understanding Synthetic COGS is required to prioritize architectural decisions like caching layers, model distillation, and deterministic routing. If you don't know the unit cost of intelligence, you cannot optimize it.

How to Calculate

  1. 1Map the full infrastructure footprint of a single AI transaction (Embedding + Vector Search + Prompt Tokens + Completion Tokens).
  2. 2Assign the exact API or compute cost to each step in the transaction chain.
  3. 3Multiply the total unit cost by the expected daily query volume per user.
  4. 4Compare the monthly Synthetic COGS per user against their monthly subscription fee to calculate gross margin.
  5. 5Track the change in Synthetic COGS as model accuracy requirements increase (The Cost of Predictivity).

Related Articles

Citation

To cite this definition:

Ewing, R. (2026). "Synthetic COGS." richardewing.io.
https://www.richardewing.io/articles/frameworks/synthetic-cogs