Все модели
Полный список отсортирован по популярности на Replicate.
SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps
The fastest image generation model tailored for local development and personal use
An 8 billion parameter language model from Meta, fine tuned for chat completions
Generate image captions
A 70 billion parameter language model from Meta, fine tuned for chat completions
Convert speech in audio to text
Return CLIP features for the clip-vit-large-patch14 model
A latent text-to-image diffusion model capable of generating photo-realistic images given any text input
Practical face restoration algorithm for *old photos* or *AI-generated faces*
A simple OCR Model that can easily extract text from an image.
Real-ESRGAN with optional face correction and adjustable upscale
A text-to-image generative AI model that creates beautiful images
Google's latest image editing model in Gemini 2.5
Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)
Fine-Tuned Vision Transformer (ViT) for NSFW Image Classification
Faster, better FLUX Pro. Text-to-image model with excellent image quality, prompt adherence, and output diversity.
multilingual-e5-large: A multi-language text embedding model
Practical face restoration algorithm for *old photos* or *AI-generated faces*
Generate CLIP (clip-vit-large-patch14) text & image embeddings
Base version of Llama 3, an 8 billion parameter language model from Meta.
Robust face restoration algorithm for old photos / AI-generated faces
Private instance of stable-diffusion
A state-of-the-art text-based image editing model that delivers high-quality outputs with excellent prompt following and consistent results for transforming images through natural language
image tagger
A 12 billion parameter rectified flow transformer capable of generating images from text descriptions
This is the fastest Flux endpoint in the world.
Generate detailed images from scribbled drawings
Visual instruction tuning towards large language and vision models with GPT-4 level capabilities
Answers questions about images
whisper-large-v3, incredibly fast, with video transcription
Detect everything with language!
Low latency, low cost version of OpenAI's GPT-4o model
Unified text-to-image generation and precise single-sentence editing at up to 4K resolution
High resolution image Upscaler and Enhancer. Use at ClarityAI.co. A free Magnific alternative. Twitter/X: @philz1337x
whisper-large-v3, incredibly fast, powered by Hugging Face Transformers! 🤗
Z-Image Turbo is a super fast text-to-image model of 6B parameters developed by Tongyi-MAI.
Hyper FLUX 8-step by ByteDance
Fill in masked parts of images with Stable Diffusion
FLUX1.1 [pro] in ultra and raw modes. Images are up to 4 megapixels. Use raw mode for realism.
Ultra fast flux kontext endpoint
A 7 billion parameter language model from Meta, fine tuned for chat completions
🦙 LaMa: Resolution-robust Large Mask Inpainting with Fourier Convolutions
Remove backgrounds from images.
Real-ESRGAN for image upscaling on an A100
Turn a face into 3D, emoji, pixel art, video game, claymation or toy
openai/clip-vit-large-patch14 with Transformers
Remove background from an image
A sub 1 second 0.01$ multi-image editing model built for production use cases. For image generation, check out p-image here: https://replicate.com/prunaai/p-image
State-of-the-art image generation with top of the line prompt following, visual quality, image detail and output diversity.
Google's state of the art image generation and editing model 🍌🍌