A self-hosted Claude assistant on a Raspberry Pi, with compounding memory via a knowledge graph and synthesised wiki pages.
Notes
Writing on data engineering, AI in production, and things I've learned the hard way. 7 posts
The three SQL syntax, schema reference, and state management problems I keep seeing when teams migrate from Databricks to dbt; and how to fix each one.
How I migrated 250 BI users from SQL Server 2012 to Snowflake, cutting reporting latency from daily to hourly and bringing the platform into GDPR compliance.
How I migrated 20M+ records from Databricks to Snowflake, cutting monthly infrastructure costs by 40% and fixing years of accumulated governance debt.
Three patterns that derail lakehouses in production: the swamp lakehouse, the performance mirage, and the metadata ghost town.
GDPR checklist for RAG systems: data inventory, vector storage, deletion pipelines, and audit trails; tested across three DACH audits.
RAG returns the closest document. ECL returns the reasoning your experts use, cited and conflict-aware. Open source, built on Andy Chen's pattern.