This is ridiculously difficult. This note is an attempt to record my findings.
There is no perfect solution (that I'm aware of), so the following demonstrates different trade-offs. I am writing this on MacOS, which I note as font behaviour can be OS-specific in subtle ways.
In this example I'm using the Twitter Color Emoji font as our external font.