Recent Posts

GRPO Fine Tuning on Llama 3.1 8B

9 minute read

This is the second iteration of this blog post. I wrote the entire thing up, and didnt actually save the bed i had put it into. It must have gotten closed of...

Agents and Workflows

4 minute read

A few months ago I met a friend for drinks at the pub. While waiting, I dove into this Anthropic article on agents. I took considerable notes because a, I fo...

Agency & Gumption

6 minute read

Everyone on the internet is a fucking retard except me. And it has bothered me so much the last few months as I try and fit my mental models to the incorrect...