Skip to content

Instantly share code, notes, and snippets.

@myui
Created December 15, 2016 11:38
Show Gist options
  • Save myui/2b00086110776c1ac9f925900c257a12 to your computer and use it in GitHub Desktop.
Save myui/2b00086110776c1ac9f925900c257a12 to your computer and use it in GitHub Desktop.
WITH term_frequency as (
select
docid,
word,
freq
from (
select
docid,
tf(word) as word2freq
from
page_exploded
group by
docid
) t
LATERAL VIEW explode(word2freq) t2 as word, freq
),
document_frequency as (
select
word,
count(distinct docid) docs
from
page_exploded
group by
word
),
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment