SCRIBBLE

ENRICH

Feature Engineering, Data Pipelines
& Nothing More.
scribble assembly line_assembly line ill
What is Scribble Enrich?

Scribble Enrich is a feature engineering platform. It sits behind our customers’ firewalls, takes data from a lake or other store, and turns it into features. It is built for scalability, with numerous guardrails to help data science teams accelerate their productivity, whether it is in ML Model deployments, or model training.

Enrich streamlines the most laborious parts of ML model training and productionization, and does so with high auditability, reproducibility, and the highest per-core compute efficiency.

Design Principles Behind the Enrich Platform
Robustness
catalog_1x.png

Audit framework to increase trust 
 

Scale & Performance
labeling_1x.png

Prepare to scale models with new usecases, and new data

Quick Time-to-Market
audit_1x.png

Application framework to cut down development time for each model

Manage Evolution
audit_1x.png

Rapid iteration on the features and model development

SCRIBBLE ENRICH

Components & Architecture

Catalog
catalog_1x.png

A lightweight data catalog to continuously document what is in the data store

 

Labelling
labeling_1x.png

Generate labeled datasets or extend master for richer features

Audit
audit_1x.png

Search interface to understand lineage of every dataset
 

Health
health_1x.png

A programmable health check monitor of data flowing into the data store
 

Core
core_1x.png

Versioned auditable feature computation pipelines
 

Marketplace
marketplace_1x.png

Discover features being computed by the system (for status & reuse)
 

Augment
augment_1x.png

Extend data by linking with thirdparty datasets

 

Monitor
monitor_1x.png

Monitor model performance
 

Search
search_1x.png

Filter and export datasets
 

arch_scribble-01.png
arch_scribble_Artboard 2.png
 
Scribblescribble1