Thoughts on software engineering, AI, side projects, and the things I learn along the way.
Three weeks running Claude 24/7 taught me how to make coding agents actually work: verification loops, team standards, and the right tooling setup.
Scattered thoughts on the nature of trust in LLMs, how context gives meaning, and language as a facilitator of senses.
Striving for the best in the age of Gen AI, and preparing for the unknown future.
Notes on the DeepSeek-R1 paper — how pure reinforcement learning with GRPO enables emergent reasoning in LLMs without supervised data.