To get the transcripts I got them in the MS Stream website. The transcript was extracted when inspecting the file downloads in the network tab. You need to filter the network tab to list the paths that contain "transcript" and select the largest one. The response will contain the JSON which was copied to this directory in the form of transcript.MMMDD.json
.
Make sure you have Node.JS in path and then run the script:
node convertTranscript.js
The script will sort the entries in each transcript file and will write the content to Mardown file. The output will be in the form of transcript.MMMDD.json.md
.
You will need to install the depenencies first:
npm install
Then copy your OpenAI API key to the .env
file so that it looks like this:
OPENAI_API_KEY=your_api_key_here
Then run the script:
node --env-file=.env summariseTranscript.js