Service
Data Products & Starter Kits
Accelerate your data time-to-value with pre-built models, reference datasets, and LLM-extracted data.
Semantic Starter Kits
Most companies need roughly the same first year of data projects: product revenue mix, churn prediction, cohort analysis, customer segmentation. We package these up for specific industries and business functions into a Semantic Starter Kit that includes:
- A "silver" and "gold" layer data model
- Reference visualizations and dashboard templates
- Python scripts for transformation and analysis
- Data catalog metadata with full lineage
- LLM-optimized model for natural language queries
Three ways to acquire
Published Datasets
Over a career you repeatedly need the same missing datasets. A complete calendar dimension. Sets of intervals for grouping timestamps. Mapping postal codes to latitude/longitude. We publish and license these foundational datasets so you don't have to build them from scratch.
LLM Data Extraction
We use LLMs to create datasets by drawing on training data. Extracting data from LLMs at scale is difficult because of high compute cost, estimation challenges, legal considerations, validation, and tracking changes over time. We deliver the datasets you need within your token budget.