Все модели
Полный список отсортирован по популярности на Replicate.
generate product images - PixMiller.com
TheBloke/Nous-Hermes-Llama2-AWQ served with vLLM
Largest completely open sourced flow-based generation model that is capable of text-to-image generation
Create variations of an image while preserving shape and depth.
FLUX.1-dev Inpainting ControlNet model
Lightweight multimodal model for visual question answering, reasoning and captioning
Generate Mortal Kombat 1 fighters and character skins
Faster slight quality reduction compared to LTX-Video 13b
Modify a video with style transfer and prompt-based editing
Transfer the texture/style of one image onto another
Stable Diffusion fine tuned on Funko Pop
CogVLM2: Visual Language Models for Image and Video Understanding
DreamShaper V8 Model (Text2Img, Img2Img and Inpainting)
The power of flux with the model trained on VITON-HD used for try-on on categories such as upper body, lower body and full body dresses
Arbitrary Stylized Face Generation
Upscales images by a lot.
Image super-resolution with stable-diffusion V2
Stable Diffusion fine-tuned of the Codex Borgia, a 16th century Meso-American manuscript.
CLIP Guided latent k-diffusion
LoRA, fp16 Foto-Assisted-Diffusion-FAD_V0
[Non-commerical] A multi-image visual language model
Generate videos from text prompts with Kandinsky-2.2
360 Panorama SDXL image with inpainted wrapping seam
Generate images from text quickly. See https://replicate.com/afiaka87/laionide-v2 for a new checkpoint.
Flux LORA to generate images in the style of the arworks used for sowed versions of a song
A Flux LoRA trained on watercolor style photos
A cinematic model fine-tuned on SDXL
Lipsync model using MuseTalk
I fed the beast my oil paintings, made in the south of France. (version ec0d4305 is my fav)
Flux LoRA: Use "m1st1c" in your prompt to trigger this LoRA model.
https://huggingface.co/wavymulder/portraitplus
Image tagger fine-tuned on WaifuDiffusion w/ (SwinV2, SwinV2, ConvNext, and ViT)
Lightweight Bimodal Network for Single-Image Super-Resolution via Symmetric CNN and Recursive Transformer
Merge two images, with an optional third for controlnet.
Scalable Streaming Speech Synthesis with Large Language Models
Monocular metric depth estimation
Generate 16:9 Thumbnails. Use prefix - `Thumbnail in the style of TOK`
stylegan3 + clip
fine-tuned Stable Diffusion model trained on the game art from Elden Ring
Surya is a document OCR toolkit that does:
Separate Anything You Describe
Projection module trained to add vision capabilties to Llama 3 using SigLIP
Converts images or text into 512-dimensional vector embeddings.
Scaling Diffusion Models for High Resolution Textured 3D Assets Generation
🎨 Fill in masked parts of images with FLUX.1-schnell 🖌️