In 2006, a (then-14-year-old) me wrote Overhead, a 256-byte intro (which was, ehm, "hEavilY InspiREd" by lander, among other intros). It was, for me, an early and highly rewarding experiment in x86 assembly, and huge thanks are owed to Baze/3SC (author of lander) not just for inspiration, but also including source with their intros (a common practice at the time, less common now unfortunately) and helpful responses to my emails asking how some basic things worked (eg. VGA setup and FPU stack).
In the pouët comments, many people reported that it ran quite a bit slower than intended. It was never blazingly fast (I recall 20-30FPS on my box at the time), but some people were reporting eg. 1-2FPS, which was pretty awful.
In a 2007 comment by Pirx, it was mentioned that writing memory at the beginning of the program segment was likely what was causing it to be so slo