Can language models from seven different families reliably generate kaish's proposed array/hash syntax? We tested. This is the scorecard, the journey behind it, and how to reproduce it.
Companion to designing-syntax-with-llms.md (the
methodology) and arrays-and-hashes.md (the design the
evals shaped).