Service

Data Products & Starter Kits

Accelerate your data time-to-value with pre-built models, reference datasets, and LLM-extracted data.

Semantic Starter Kits

Most companies need roughly the same first year of data projects: product revenue mix, churn prediction, cohort analysis, customer segmentation. We package these up for specific industries and business functions into a Semantic Starter Kit that includes:

  • A "silver" and "gold" layer data model
  • Reference visualizations and dashboard templates
  • Python scripts for transformation and analysis
  • Data catalog metadata with full lineage
  • LLM-optimized model for natural language queries

Three ways to acquire

Purchase

Buy the kit outright and customize it with your own team.

Subscribe

Ongoing updates as the model evolves and new features ship.

Managed

We deploy, customize, and maintain the kit for you.

Published Datasets

Over a career you repeatedly need the same missing datasets. A complete calendar dimension. Sets of intervals for grouping timestamps. Mapping postal codes to latitude/longitude. We publish and license these foundational datasets so you don't have to build them from scratch.

LLM Data Extraction

We use LLMs to create datasets by drawing on training data. Extracting data from LLMs at scale is difficult because of high compute cost, estimation challenges, legal considerations, validation, and tracking changes over time. We deliver the datasets you need within your token budget.

Discuss Data Products