In one embodiment, a method and a system for generating custom video content are disclosed. The method comprises receiving a source video, wherein the source video contains a plurality of frames, transcribing an audio of the source video to text to generate transcribed text, wherein the transcribed text correspo...