
SCRIBBLE
ENRICH
Feature Engineering, Data Pipelines
& Nothing More.

What is Scribble Enrich?
Scribble Enrich is a feature engineering platform. It sits behind our customers’ firewalls, takes data from a lake or other store, and turns it into features. It is built for scalability, with numerous guardrails to help data science teams accelerate their productivity, whether it is in ML Model deployments, or model training.
Enrich streamlines the most laborious parts of ML model training and productionization, and does so with high auditability, reproducibility, and the highest per-core compute efficiency.
Design Principles Behind the Enrich Platform
Robustness

Audit framework to increase trust
Scale & Performance

Prepare to scale models with new usecases, and new data
Quick Time-to-Market

Application framework to cut down development time for each model
Manage Evolution

Rapid iteration on the features and model development
SCRIBBLE ENRICH
Components & Architecture
Catalog

A lightweight data catalog to continuously document what is in the data store
Labelling

Generate labeled datasets or extend master for richer features
Audit

Search interface to understand lineage of every dataset
Health

A programmable health check monitor of data flowing into the data store
Core

Versioned auditable feature computation pipelines
Marketplace

Discover features being computed by the system (for status & reuse)
Augment

Extend data by linking with thirdparty datasets
Monitor

Monitor model performance
Search

Filter and export datasets

