If you are comfortable with Python or JS, I would call the YouTube Data API. They have native APIs for this, plus a bunch of other useful data. It's also very ChatGPT friendly.
I manage some via their Captions API unless there is some nuance that I'm missing. We pay for transcription services or use Whisper and then upload them via the Captions API.
I looked into it, but it doesn't seem like they have transcript available. They only have captions which I'm not so sure are accurate. Are the captions as accurate as in the case of uploading a audio extract to whisper?
The quality varies by the video. The only difference between the two is that captions include timestamps with the transcriptions.
I'm glad you found a working solution. Check that API out if the pypi project ever goes out on you. It should be quick to get it back up and running with it.
If you are comfortable with Python or JS, I would call the YouTube Data API. They have native APIs for this, plus a bunch of other useful data. It's also very ChatGPT friendly.
They don't offer transcripts via their API unfortunately.
I manage some via their Captions API unless there is some nuance that I'm missing. We pay for transcription services or use Whisper and then upload them via the Captions API.
developers.google.com/youtube…
Well, I didn't know that and I spoke too fast!
I looked into it, but it doesn't seem like they have transcript available. They only have captions which I'm not so sure are accurate. Are the captions as accurate as in the case of uploading a audio extract to whisper?
The quality varies by the video. The only difference between the two is that captions include timestamps with the transcriptions.
I'm glad you found a working solution. Check that API out if the pypi project ever goes out on you. It should be quick to get it back up and running with it.
Appreciate it Jeff! You're amazing!