Skip to main content

Why We Founded Commerce Schema

Problem Statement

Working with eCommerce data to build insights, make decisions, and power advertising campaigns is harder than it should be. As an industry, eCommerce lags behind when it comes to data engineering and data science best practices. As an industry, we tend to rely on siloed “Marketplace Apps” that compete by offering proprietary insights and secret sauce.

The ability to customize these tools to meet a business’ specific requirements is low. Interoperability between these tools is rudimentary.

There are large, sophisticated tools that can be leveraged for eCommerce — such as enterprise CDP products, enterprise-grade data warehousing solutions, and so forth. But these platforms lack a point of view, and so leveraging them efficiently for eCommerce use cases is difficult.

The “Time to Value” of implementing enterprise tools from scratch is low. Total cost of ownership of these tools is often too high.

TLDR 🤔 ; eCommerce data strategies are nascent, because we aren’t working together as a community.


Data warehousing and cloud computer have almost become a commodity. With AI, building machine learning models is significantly easier now than it was just 2 years ago. All the component pieces are in place for eCommerce to catch up with other industries in terms of The Modern Data Stack.

Commerce Schema is an attempt to solve this problem through collaboration and open source.

Commerce Schema is a community and collection of shared specifications for working with eCommerce data - be that 1st-party eCommerce “events” (like add_to_cart or purchase events) or data warehouse entities like orders or products, or customers.

With these shared specifications (really, a shared point of view), we can collectively start building composable analytics solutions for eCommerce, that might look like the following.

Figma Board - The Modern Data-Stack for eCommerce

(Apologies for the bias towards Fueled, but as an example, it hopefully stands.)

Next Steps

To get started, we need to:

  1. Agree to a handful of high-level specifications for working with eCommerce data.
  2. Once we have those specifications, we can begin to leverage tools like dbt, Google Dataflow, and semantic layers (like to transform, blend, and work with various sources of 1st-party and eCommerce data.