Skip to content

Instantly share code, notes, and snippets.

@dangra
Created January 25, 2014 02:18
Show Gist options
  • Save dangra/8610777 to your computer and use it in GitHub Desktop.
Save dangra/8610777 to your computer and use it in GitHub Desktop.
import encodings
import lxml.etree
for enc in set(encodings.aliases.aliases.values()):
try:
parser = lxml.etree.HTMLParser(recover=True, encoding=enc)
except LookupError as exc:
print str(exc)
unknown encoding: 'bz2_codec'
unknown encoding: 'rot_13'
unknown encoding: 'euc_jisx0213'
unknown encoding: 'hex_codec'
unknown encoding: 'uu_codec'
unknown encoding: 'mbcs'
unknown encoding: 'euc_jis_2004'
unknown encoding: 'iso2022_jp_3'
unknown encoding: 'iso2022_jp_2'
unknown encoding: 'iso2022_jp_1'
unknown encoding: 'iso2022_jp_2004'
unknown encoding: 'quopri_codec'
unknown encoding: 'iso2022_jp'
unknown encoding: 'mac_iceland'
unknown encoding: 'hp_roman8'
unknown encoding: 'iso2022_kr'
unknown encoding: 'euc_kr'
unknown encoding: 'utf_32_be'
unknown encoding: 'utf_16_le'
unknown encoding: 'mac_latin2'
unknown encoding: 'utf_32'
unknown encoding: 'mac_cyrillic'
unknown encoding: 'base64_codec'
unknown encoding: 'ptcp154'
unknown encoding: 'euc_jp'
unknown encoding: 'hz'
unknown encoding: 'utf_8'
unknown encoding: 'utf_32_le'
unknown encoding: 'mac_greek'
unknown encoding: 'utf_7'
unknown encoding: 'mac_turkish'
unknown encoding: 'zlib_codec'
unknown encoding: 'iso8859_9'
unknown encoding: 'iso8859_8'
unknown encoding: 'iso8859_5'
unknown encoding: 'iso8859_4'
unknown encoding: 'iso8859_7'
unknown encoding: 'iso8859_6'
unknown encoding: 'iso8859_3'
unknown encoding: 'iso8859_2'
unknown encoding: 'shift_jis_2004'
unknown encoding: 'mac_roman'
unknown encoding: 'utf_16'
unknown encoding: 'iso8859_15'
unknown encoding: 'iso8859_14'
unknown encoding: 'tis_620'
unknown encoding: 'iso8859_16'
unknown encoding: 'iso8859_11'
unknown encoding: 'iso8859_10'
unknown encoding: 'iso8859_13'
unknown encoding: 'latin_1'
unknown encoding: 'tactis'
unknown encoding: 'koi8_r'
unknown encoding: 'utf_16_be'
unknown encoding: 'iso2022_jp_ext'
unknown encoding: 'cp858'
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment