Skip to content

detect_text

FFmpeg + vision inference pipeline for detecting burnt-in text in video.

Stack

  • FFmpeg image2pipe for in-memory frame extraction
  • DeepInfra Llama 4 Scout for vision inference
  • CMX3600 EDL output for DaVinci Resolve markers

TODO

Document CLI usage, rate limiting behaviour, EDL output format, and false positive handling.