Skip to content

Instantly share code, notes, and snippets.

@audhiaprilliant
Last active April 25, 2022 14:46
Show Gist options
  • Select an option

  • Save audhiaprilliant/362725dcb313f115d60da67448d155dc to your computer and use it in GitHub Desktop.

Select an option

Save audhiaprilliant/362725dcb313f115d60da67448d155dc to your computer and use it in GitHub Desktop.
How to Automatically Build Stopwords
# Data viz
plotnine.options.figure_size = (10, 4.8)
(
ggplot(
data = df
)+
geom_line(
aes(
x = 'rank',
y = 'zipf_freq',
group = 1
),
size = 1,
color = '#981220'
)+
geom_point(
aes(
x = 'rank',
y = 'zipf_freq',
group = 1
),
size = 1.5
)+
labs(
title = 'Zipf Distribution in English Literature'
)+
xlab(
xlab = 'Rank'
)+
ylab(
ylab = 'Frequency of words'
)+
theme_minimal()
)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment