Все модели
Полный список отсортирован по популярности на Replicate.
Stable diffusion fork for generating tileable outputs using v1.5 model
The LaMa (Large Mask Inpainting) model is an advanced image inpainting system designed to address the challenges of handling large missing areas, complex geometric structures, and high-resolution images.
Claude Sonnet 4 is a significant upgrade to 3.7, delivering superior coding and reasoning while responding more precisely to your instructions
FLUX.1-Schnell LoRA Explorer
Img2Img model that combines 6 other img2image models
A reasoning model trained with reinforcement learning, on par with OpenAI o1
Image to image face swapping
The highest quality Ideogram v3 model. v3 creates images with stunning realism, creative designs, and consistent styles
An enhanced version over Qwen-Image-Edit-2509, featuring multiple improvements including notably better consistency
RealVisXl V3 with multi-controlnet, lora loading, img2img, inpainting
Like Ideogram v2, but faster and cheaper
The DeepFloyd IF model has been initially released as a non-commercial research-only model. Please make sure you read and abide to the license before using it.
An efficient, intelligent, and truly open-source language model
Make stickers with AI. Generates graphics with transparent backgrounds.
Google's highest quality text-to-image model, capable of generating images with detail, rich lighting and beauty
A text-to-image model that generates high-resolution images with fine details. It supports various artistic styles and produces diverse outputs from the same prompt, thanks to Query-Key Normalization.
A 7 billion parameter language model from Mistral.
Towards Photo-Realistic Image Colorization via Dual Decoders
Determines the toxicity of text to image prompts, llama-13b fine-tune. [SAFETY_RANKING] between 0 (safe) and 10 (toxic)
Designed to make images sharper and cleaner, Crisp Upscale increases overall quality, making visuals suitable for web use or print-ready materials.
Realistic interior design with text and image inputs
Kling 2.5 Turbo Pro: Unlock pro-level text-to-video and image-to-video creation with smooth motion, cinematic depth, and remarkable prompt adherence.
A model which generates text in response to an input image and prompt.
A text-to-image model with greatly improved performance in image quality, typography, complex prompt understanding, and resource-efficiency
FLUX.1-Dev Multi LoRA Explorer
LLaVA v1.6: Large Language and Vision Assistant (Nous-Hermes-2-34B)
✍️✨Prompts to auto-magically relights your images
Simple image captioning model using CLIP and GPT-2
Real-Time Open-Vocabulary Object Detection using the xl weights
Text-to-Audio (T2A) that offers voice synthesis, emotional expression, and multilingual capabilities. Optimized for high-fidelity applications like voiceovers and audiobooks.
Google’s hybrid “thinking” AI model optimized for speed and cost-efficiency
Advance Face Swap powered by pixalto.app
Granite-3.3-8B-Instruct is a 8-billion parameter 128K context length language model fine-tuned for improved reasoning and instruction-following capabilities.
Semantic Segmentation
Turn a face into a sticker
Edit images using a prompt. This model extends Qwen-Image’s unique text rendering capabilities to image editing tasks, enabling precise text editing
A pro version of Seedance that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 1080p resolution
A deep learning approach to remove background & adding new background image
A Strong Image Tagging Model with Segment Anything
Create photos, paintings and avatars for anyone in any style within seconds. (Stylization version)
Fast, affordable version of GPT-4.1
Inpainting using RunwayML's stable-diffusion-inpainting checkpoint
Super-fast, 0.6s per image. LCM with img2img, large batching and canny controlnet
An image generation foundation model in the Qwen series that achieves significant advances in complex text rendering.
Generate 5s and 10s videos in 720p resolution at 30fps
Create song covers with any RVC v2 trained AI voice from audio files.