Slither.Examples.DataEtl.EtlPipe (Slither v0.1.0)

ETL pipeline: prepare -> validate -> transform -> route.

Demonstrates a full data ETL flow with hot-reloadable schemas and per-worker audit logs that prove process isolation beats free-threaded Python:

prepare (beam) -- reads the current validation schema from ETS and packages each row for Python validation, including schema version
validate (python) -- validates rows against the schema using csv_transformer.validate_batch; records each decision in a per-worker audit log
transform (beam) -- applies rename, cast, and default rules to valid rows; passes invalid rows through unchanged
route (router) -- sends invalid rows to :invalid and all remaining valid rows to :default

After both batches, the demo queries per-worker audit stats from Python to prove that each worker maintained a consistent, uncorrupted audit log in its own process -- something free-threaded Python cannot guarantee when multiple threads share the same list and counter.

The demo shows schema hot-reload: batch 1 runs against v1 (lenient), then the schema is swapped to v2 (stricter) via a serialized store write, and batch 2 demonstrates that previously-valid rows now fail.

Run with Slither.Examples.DataEtl.EtlPipe.run_demo/0.

Summary

Functions

maybe_transform(item, ctx)

Apply transformation rules to valid rows; pass invalid rows through.

prepare(item, ctx)

Package a raw row with the current schema for Python validation.

run_demo()

Run the data ETL demo, printing formatted results to stdout.