Hello, I personally was looking for a simple minimal script that performed just this function: parsing vtt, discarding timecodes, merging chronologically close lines into a larger block, and outputting the result in a human-readable txt file. Just wanted to say that in my use case I prefer the way it merges multiple lines into a less-fine-grained time code.
@glasslion, thanks a lot for sharing this script!
vtt2text.py is a nice little script by glasslion I just found that seems to do what I am looking for - convert subtitle file, even closed-captioning "roll-up" style webvtt formats like what I have, into human-friendly full-page transcript.