I've installed aeneas and I want to split text. Here's what I did. Taking the raw transcript from my captioner, I convert to plain text (on Mac OSX using textutil
):
textutil 2021-01-28\ Machine\ Learning.rtf -convert txt
This gives me 2021-01-28\ Machine\ Learning.txt
which I then process with split.py
above (requires nltk):