- simplify cycle counting so it's easier to understand
- make the time it takes to emulate 100kcycles more regular
Memory access has inconsistent costs currently. When decoding instructions it costs cycles, but things like [A] don't. For both a "real" DCPU and the emulator (assuming typical cache behavior), memory accesses are more expensive than register accesses.