Work

Projects

Production-grade reference architectures and open-source tools for the modern data & AI stack. 9 projects.

Production

data engineering

SQL-First Real-Time Fraud Detection

Production-oriented reference architecture for real-time fraud detection using Redpanda, dbt, RisingWave, and Grafana

RedpandaRisingWavedbtGrafana+1

View details

Production

data engineering

dbt CI/CD Pipeline with Slim CI

A production-ready CI/CD pipeline for dbt projects using GitHub Actions and Slim CI — running only modified models and their downstream dependencies to cut build times and warehouse costs.

dbtGitHub ActionsDatabricksPython+4

View details

Production

dbt Conversation AI Local

A locally-running AI agent powered by Ollama and the dbt MCP server — answering questions about your semantic layer without sending data to the cloud.

dbt Data Contracts

A framework for defining, enforcing, and monitoring data contracts across a dbt project — guaranteeing that data producers keep their promises to consumers at build time.

dbtPythonDatabricksYAML

View details

Development

data engineering

dbt Docker + Airflow

A containerised dbt + Airflow setup using Docker Compose — orchestrating dbt runs as Airflow DAGs with isolated environments, centralised logging, and easy local development.

dbtDockerAirflowPython

View details

Production

data engineering

Modern Data Stack: DuckDB Semantic Layer

A local-first Modern Data Stack reference using dlt for ingestion, dbt for transformation, and DuckDB as the engine — with a MetricFlow semantic layer and Rill dashboards.

dbtDuckDBdltMetricFlow+2

View details

Production

data engineering

Modern Data Stack: Databricks Semantic Layer

Production-grade Modern Data Stack reference on Databricks Unity Catalog — Medallion architecture with dlt ingestion, dbt transformation, MetricFlow metrics, and GitHub Actions CI/CD.

dbtDatabricksdltMetricFlow+3

View details

Production

data engineering

dbt Synthetic Data Generator

An automated star-schema data generator for dbt and DuckDB — producing realistic, referentially-intact dimension and fact tables for testing and development.

PythondbtDuckDBParquet

View details

Production

data engineering

ASX Stock Analysis & ML Pipeline

An automated pipeline that ingests ASX mining stock data via Yahoo Finance, transforms it with dbt, generates 45+ ML features, and surfaces trading signals through Streamlit dashboards.

PythondbtDuckDBdlt+2

View details