Skip to content

Instantly share code, notes, and snippets.

@Barry1
Last active March 30, 2021 21:05
Show Gist options
  • Save Barry1/f433fb7c7dda3a72f3d1430a27970516 to your computer and use it in GitHub Desktop.
Save Barry1/f433fb7c7dda3a72f3d1430a27970516 to your computer and use it in GitHub Desktop.
My wordcloud experience
to title subtitle author date lang smart documentclass colorlinks hyperrefoptions
pdf
My WordCloud Experience
HowTo
Dr. Bastian Ebeling
29.th March 2021
de-DE
true
scrartcl
true
linktoc=all
pdfwindowui

references

The following ideas and experiences are based on reading the sources tds also available under tds_archive. As I'm native german, I also read .

Major steps

The followings steps here describe the steps based on the python package wordcloud. You can find the documentation here.

Prepare text

First extract your text from the source.
For sure you want to remove so called stopwords, those ones with high frequency but nearly no meaning - such like a, we, you, me, and, or, how and so on.
For doing this with python I know the following three options

optional step: Prepare mask

If you want an non rectangular shape, you need to tell the shape to use.

build wordcloud

https://towardsdatascience.com/simple-wordcloud-in-python-2ae54a9f58e5

PowerPoint

You can read the german article 4 einfache Wege, um eine Wortwolke in PowerPoint zu generieren on using word clouds in PowerPoint presentations.

Online Tools

Further reading

German Reading

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment