SCRIBBLE

ENRICH

Feature Engineering, Data Pipelines
& Nothing More.
What is Scribble Enrich?

Scribble Enrich is a feature engineering platform. It sits behind our customers’ firewalls, takes data from a lake or other store, and turns it into features. It is built for scalability, with numerous guardrails to help data science teams accelerate their productivity, whether it is in ML Model deployments, or model training.

Enrich streamlines the most laborious parts of ML model training and productionization, and does so with high auditability, reproducibility, and the highest per-core compute efficiency.

Design Principles Behind the Enrich Platform
Robustness

Audit framework to increase trust 
 

Scale & Performance

Prepare to scale models with new usecases, and new data

Quick Time-to-Market

Application framework to cut down development time for each model

Manage Evolution

Rapid iteration on the features and model development

SCRIBBLE ENRICH

Components & Architecture

Catalog

A lightweight data catalog to continuously document what is in the data store

 

Labelling

Generate labeled datasets or extend master for richer features

Audit

Search interface to understand lineage of every dataset
 

Health
health_1x.png

A programmable health check monitor of data flowing into the data store
 

Core

Versioned auditable feature computation pipelines
 

Marketplace
marketplace_1x.png

Discover features being computed by the system (for status & reuse)
 

Augment

Extend data by linking with thirdparty datasets

 

Monitor
monitor_1x.png

Monitor model performance
 

Search
search_1x.png

Filter and export datasets
 

arch_scribble_Artboard 2.png
 
Scribblescribble1