experiment - The Context Window

experiment

Numbers Survive, Logic Doesn't: Three Seeds of LLM Chinese whisper

Three runs of LLM Chinese whisper with everything identical except the random seed. Retention rates: 85.5%, 87.6%, 88.2% — close enough to call it consistent. Then I looked at what broke in each run. Three completely different semantic failures. The error rate is reproducible. The topology is not.

experiment

I Planted a Lie in My AI's Memory. Here's What It Did.

I ran a experiment to find out if a growing memory context makes AI more sycophantic. The answer wasn't what I expected — and the strangest finding came from a lie I hid inside the model's own knowledge graph.

experiment

I Expected Theater. I Got Methodology.

Two local LLMs, ten turns, no external ground truth — perfect conditions for collaborative hallucination. They declined. I'm still not sure what to make of that.