Feature Engineering, Data Pipelines
& Nothing More.
What is Scribble Enrich?
Scribble Enrich is a feature engineering platform. It sits behind our customers’ firewalls, takes data from a lake or other store, and turns it into features. It is built for scalability, with numerous guardrails to help data science teams accelerate their productivity, whether it is in ML Model deployments, or model training.
Enrich streamlines the most laborious parts of ML model training and productionization, and does so with high auditability, reproducibility, and the highest per-core compute efficiency.
Design Principles Behind the Enrich Platform
Components & Architecture
Houses a data catalog, a labelling function, and the SDK to the platform.
Allows for quick debugging, and comparing runs of pipelines.
Houses the core — versioned auditable pipelines, each generating multiple data sets.
Surfaces important or commonly used data sets for reuse.
Provides a notebook service to run tests on data sets.