Do all these problems without the aid of a computer, except where required to look up character names, character code points, and emoji data. The purpose of these exercises is for you to develop skills. If you spend the time to practice with pencil and paper (or a whiteboard) you will learn the material much better.
To submit your answers, copy the text below into a secret gist on GitHub and fill in the answers at the end of the same line. Email or DM me the url of the secret gist.
Make sure your Gist filename is CMSI284S2018HW2.md
.
Your submission will be autograded so it is imperative that your answers are formmatted as expected. All encoded values should be in hex, not binary. Use the proper number of bits always; for example, UTF-32 answers must always show 8 hex digits, and characters should use the U+
prefix. MAKE SURE YOUR GIST HAS EXACTLY 50 LINES, NO MORE, NO LESS. Answers that are "correct" but not in the proper format will receive zero points, since answers that follow instructions are better than those that do not. Again, make sure your gist has exactly 50 lines, or the autograder may get confused and you very well may end up with zero points. Don't risk it.
If you see a question that asks for an encoding of a character that does not exist within the character set in question, answer with the three symbols: N/A
Here are some example answers, that illustrate the expected formats:
- The characters that make up a woman judge emoji are: U+1F469 U+200D U+2696 U+FE0F
- The characters that make up a medium-skin-tone woman mechanic emoji are: U+1F469 U+1F3FD U+200D U+1F527
- The encoding of character FIRE ENGINE in UTF-32BE is: 0001F692
- The encoding of character FIRE ENGINE in UTF-16LE is: 92DE3DD8
- The encoding of character FIRE ENGINE in UTF-8 is: F09F9A92
- The NAME of the character whose UTF-8 encoding is E38084 is: JAPANESE INDUSTRIAL STANDARD SYMBOL
Note: It is perfectly okay to use Emojipedia. You probably should, to make sure you get the variation selectors right.
Another note: Pay attention to BE and LE!
- The characters that make up a medium-dark skin tone man vampire are:
- The characters that make up a dark skin tone woman playing handball are:
- The characters that make up the flag of Morocco are:
- The characters that make up the flag of El Salvador are:
- The characters that make up the rainbow flag emoji are:
- The characters that make up a medium-light skin tone oncoming fist are:
- The characters that make up a Latin small letter n with a caron are:
- The characters that make up a Cyrillic small letter Э with two dots, placed vertically above, are:
- The encoding of BACKSPACE, in UTF-32BE, is:
- The encoding of BACKSPACE, in UTF-16BE, is:
- The encoding of BACKSPACE, in UTF-8, is:
- The encoding of ARABIC PERCENT SIGN, in UTF-32BE, is:
- The encoding of ARABIC PERCENT SIGN, in UTF-16LE, is:
- The encoding of ARABIC PERCENT SIGN, in UTF-8, is:
- The encoding of DEGREE SIGN, in UTF-32LE, is:
- The encoding of DEGREE SIGN, in UTF-16BE, is:
- The encoding of DEGREE SIGN, in UTF-8, is:
- The encoding of COFFIN, in UTF-32BE, is:
- The encoding of COFFIN, in UTF-16BE, is:
- The encoding of COFFIN, in UTF-8, is:
- The encoding of CJK UNIFIED IDEOGRAPH-583A, in UTF-32LE, is:
- The encoding of CJK UNIFIED IDEOGRAPH-583A, in UTF-16BE, is:
- The encoding of CJK UNIFIED IDEOGRAPH-583A, in UTF-8, is:
- The encoding of BYZANTINE MUSICAL SYMBOL KRATIMATA, in UTF-32BE, is:
- The encoding of BYZANTINE MUSICAL SYMBOL KRATIMATA, in UTF-16BE, is:
- The encoding of BYZANTINE MUSICAL SYMBOL KRATIMATA, in UTF-8, is:
- The encoding of CHEROKEE LETTER HA, in UTF-32LE, is:
- The encoding of CHEROKEE LETTER HA, in UTF-16LE, is:
- The encoding of CHEROKEE LETTER HA, in UTF-8, is:
- The encoding of TAG ASTERISK, in UTF-32BE, is:
- The encoding of TAG ASTERISK, in UTF-16BE, is:
- The encoding of TAG ASTERISK, in UTF-8, is:
- The encoding of LATIN SMALL LIGATURE OE, in UTF-16BE, is:
- The encoding of LATIN SMALL LIGATURE OE, in UTF-8, is:
- The encoding of LATIN SMALL LIGATURE OE, in Windows-1252, is:
- The encoding of STRING TERMINATOR, in UTF-16BE, is:
- The encoding of STRING TERMINATOR, in UTF-8, is:
- The encoding of STRING TERMINATOR, in Windows-1252, is:
- The NAME of the character whose UTF-16LE encoding is F726 is:
- The NAME of the character whose UTF-32BE encoding is 0000002B is:
- The NAME of the character whose UTF-8 encoding is E38083 is:
- The NAME of the character whose UTF-8 encoding is F09F90BD is:
- The character sequence (in
U+
notation) whose UTF-8 encoding is 4BCC9E is: - The character sequence (in
U+
notation) whose UTF-8 encoding is 486920F09F9095 is: - The character sequence (in
U+
notation) whose UTF-8 encoding is E2889EE289A531 is: - The character sequence with graphemes
É=mc²
in NFC is: - The character sequence with graphemes
É=mc²
in NFD is: - The character sequence with graphemes
É=mc²
in NFKD is: - The character whose grapheme is 腂 when encoded in UTF-32LE and interpreted as a IEEE-754 float, is:
- The character BACKSPACE encoded in UTF-32BE and interpreted as a IEEE-754 float, is 2 to the which power: