Video file to subtitles file

MRAB python at mrabarnett.plus.com
Sun Aug 30 11:25:44 EDT 2020


On 2020-08-30 07:23, Muskan Sanghai wrote:
> On Sunday, August 30, 2020 at 11:46:15 AM UTC+5:30, Chris Angelico wrote:
>> On Sun, Aug 30, 2020 at 4:11 PM Muskan Sanghai <muskan... at gmail.com> wrote: 
>> > 
>> > On Sunday, August 30, 2020 at 10:57:00 AM UTC+5:30, Christian Gollwitzer wrote: 
>> > > Am 29.08.20 um 13:51 schrieb Muskan Sanghai: 
>> > > > I want to extract subtitles from a MPEG video (which does not have any previous subtitles) 
>> > > I'm still not sure I get it. "Extract" subtitles, when they are NOT 
>> > > there? Can it be, by any chance, that you are talking about speech 
>> > > recognition? I.e., you want a software which understands the spoken word 
>> > > in the movie sound and turns that into text, which can be shown as 
>> > > subtitles? Like the "auto-generated" subtitles which youtube offers for 
>> > > some videos. 
>> > > 
>> > > If so, it is a complex task and will not work overly well. I defer to 
>> > > the experts if there are any usable speech recognitino engines for this 
>> > > task. 
>> > > 
>> > > Christian 
>> > Yes, this is what I exactly want to do. I want to create a software which understands the spoken word in the movie sound and turns that into text. 
>> >
>> I recommend looking into CMU Sphinx then. I've used that from Python. 
>> 
>> The results are highly entertaining. 
>> 
>> ChrisA
> Okay I will try it, thank you.
> 
Speech recognition works best when there's a single voice, speaking 
clearly, with little or no background noise. Movies tend not to be like 
that.

Which is why the results are "highly entertaining"...


More information about the Python-list mailing list