Because the AI has to do the work of translating the image to text before deciding what to do with it

  • @[email protected]
    link
    fedilink
    121 day ago

    So start normalizing using ffmpeg to type in whatever you want to say, and render it as a video with just static text on white background to make it even more expensive?