Caption videos

Use AI to caption videos with an API

Назад к коллекциям

Модели в коллекции

Сортировка: по популярности (run_count)
google/gemini-2.5-flash

Google’s hybrid “thinking” AI model optimized for speed and cost-efficiency

1 679 794 запусков
lucataco/qwen-vl-chat

A multimodal LLM-based AI assistant, which is trained with alignment techniques. Qwen-VL-Chat supports more flexible interaction, such as multi-round question answering, and creative capabilities.

825 746 запусков
chenxwh/cogvlm2-video

CogVLM2: Visual Language Models for Image and Video Understanding

672 583 запусков
lucataco/qwen2-vl-7b-instruct

Latest model in the Qwen family for chatting with video and image models

356 851 запусков
shreejalmaharjan-27/tiktok-short-captions

Generate Tiktok-Style Captions powered by Whisper (GPU)

228 089 запусков
lucataco/apollo-7b

Apollo 7B - An Exploration of Video Understanding in Large Multimodal Models

124 743 запусков
fictions-ai/autocaption

Automatically add captions to a video

87 714 запусков
lucataco/videollama3-7b

VideoLLaMA 3: Frontier Multimodal Foundation Models for Video Understanding

31 863 запусков
lucataco/qwen2.5-omni-7b

Qwen2.5-Omni is an end-to-end multimodal model designed to perceive diverse modalities, including text, images, audio, and video, while simultaneously generating text and natural speech responses in a streaming manner.

31 610 запусков
cuuupid/qwen2-vl-2b

SOTA open-source model for chatting with videos and the newest model in the Qwen family

657 запусков
lucataco/minicpm-v-4

MiniCPM-V 4.0 has strong image and video understanding performance

317 запусков
lucataco/bulk-video-caption

Video Preprocessing tool for captioning multiple videos using GPT, Claude or Gemini

179 запусков
lucataco/apollo-3b

Apollo 3B - An Exploration of Video Understanding in Large Multimodal Models

149 запусков