Skip to content

Instantly share code, notes, and snippets.

@hiepph
Created July 30, 2019 03:15
Show Gist options
  • Save hiepph/8ab99aac9c97d623b858171eeed08da0 to your computer and use it in GitHub Desktop.
Save hiepph/8ab99aac9c97d623b858171eeed08da0 to your computer and use it in GitHub Desktop.
Get unique words
# Get unique words from Vietnamese dictionary
words = File.read!("gungui98.txt")
|> String.split("\n", trim: true)
|> Enum.map(&String.downcase/1)
|> Enum.map(&String.split/1)
|> List.flatten
|> Enum.uniq
|> Enum.sort
|> Enum.join("\n")
chars = words
|> String.codepoints
|> Enum.chunk_every(1)
|> Enum.uniq
|> Enum.sort
|> Enum.join("\n")
File.write!("viet.txt", words)
File.write!("char.txt", chars)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment