projects

ML · infra · interfaces

PiTorch: ML on Baremetal Raspberry Pis

4.8.26

How do you get from a $5 computer to a working language model?

We strip away every layer of abstraction and build up from scratch, running and training models on a cluster of Pi Zeros. No PyTorch, no OS, not even a standard library.

61×

185×

210×

weights

state

stacks

GPU

Read

002

Sparsity is Cool: Interpretability Insights into Sparse Attention

6.25.25· 16 min read

A new wave of sparse attention methods promise faster, more expressive transformers. But why does sparsity help, and can we use that understanding to make it work even better?

We investigate the mechanisms behind sparse attention and propose improvements based on what we find.

Figs 01, 06, 16.

Figs 01, 06.

Post

003

Activault: Scalable, Efficient, and Fast Model Activation Storage

3.17.25· 9 min read

Training interpreter models on frontier LLMs requires collecting billions of activations (the model's "mental state" at each step). At scale, storing these becomes prohibitively expensive.

I built and open-sourced Activault to dramatically reduce these costs.

Figs 01, 05, 06.

Figs 01, 05.

Post

004

Sieve: SAEs Beat Baselines on a Real-World Task (A Code Generation Case Study)

12.15.24· 10 min read

Sparse autoencoders can decode what a language model is "thinking," but can that understanding actually be used to improve model behavior on real tasks?

We show SAE-based steering outperforms classical baselines on code generation, and release Sieve, a pipeline for applying SAEs for fine-grained control.

Figs 01, 02, 07.

Figs 01, 02.

Post

005

Tilde Stargazer

11.14.24· 1 min watch

As part of our launch of Tilde Research, Tina and I built Stargazer, where you can explore the internals of a Llama model, powered by one of our interpreter models.

When you submit a prompt, each word the model outputs reveals a night sky of constellations, each star a feature the model activated while generating that word, exposing the concepts it drew on before choosing each token.

Stargazer's backend is no longer live.

Demo video. Posted on