Skip to content

Instantly share code, notes, and snippets.

@AnasAlmasri
Last active August 14, 2019 12:45
Show Gist options
  • Save AnasAlmasri/f86a933a1bc4d46d3f309340c74e8ac7 to your computer and use it in GitHub Desktop.
Save AnasAlmasri/f86a933a1bc4d46d3f309340c74e8ac7 to your computer and use it in GitHub Desktop.
preprocesing all tweets
tweetProcessor = PreProcessTweets()
preprocessedTrainingSet = tweetProcessor.processTweets(trainingData)
preprocessedTestSet = tweetProcessor.processTweets(testDataSet)
@CryptoPynchon
Copy link

Hello Anas!

trainingData is generated by this line:

trainingData = buildTrainingSet(corpusFile, tweetDataFile)

What was the point of saving all the fetched tweets to tweetDataFile if I have to fetch and save them to a new tweetDataFile all over again every time I run buildTrainingSet?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment